Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/dask issues and pull requests

#8437 - When `divisions` has repeats, `set_index` puts all data in the last partition instead of balancing it

Issue - State: open - Opened by gjoseph92 almost 3 years ago - 1 comment
Labels: dataframe, needs attention

#8435 - [Discussion] Don't compute divisions by default in `set_index`?

Issue - State: open - Opened by gjoseph92 almost 3 years ago - 8 comments
Labels: dataframe, discussion, needs attention

#8430 - Make sort on groupby also affect the groupby sort on the chunk

Pull Request - State: open - Opened by jsignell almost 3 years ago - 3 comments
Labels: dataframe, needs attention

#8421 - DataFrame.groupby sorts group keys even with sort=False

Issue - State: open - Opened by ghost almost 3 years ago - 1 comment
Labels: dataframe, needs attention, bug, p3

#8415 - Documentation for `set_index(col, compute=True)` is unclear/inaccurate

Issue - State: open - Opened by DahnJ almost 3 years ago - 13 comments
Labels: dataframe, documentation, needs attention

#8380 - da.store loses dependency information

Issue - State: open - Opened by djhoese almost 3 years ago - 12 comments
Labels: array, needs attention, bug

#8361 - Optimized groupby aggregations when grouping by a sorted index

Issue - State: open - Opened by gjoseph92 almost 3 years ago - 11 comments
Labels: dataframe, needs attention

#8355 - Could not deserialize task when using `npartitions="auto"` in `DataFrame.set_index()`

Issue - State: open - Opened by aloysius-lim almost 3 years ago - 5 comments
Labels: array, needs attention, bug

#8353 - ignore_index is not used in dd.concat

Issue - State: open - Opened by boazmohar almost 3 years ago - 2 comments
Labels: dataframe, needs attention

#8335 - `aiobotocore` releated test failures

Issue - State: closed - Opened by jrbourbeau almost 3 years ago - 9 comments

#8334 - Use map instead of batch submit in local.get_async

Issue - State: open - Opened by SebastienDorgan almost 3 years ago - 1 comment
Labels: needs attention

#8294 - Shuffle prototype: Feedback (disk usage + workers dying)

Issue - State: open - Opened by DahnJ almost 3 years ago - 6 comments
Labels: needs attention

#8292 - graph became invalid in 2021.10.0

Issue - State: open - Opened by chrisroat almost 3 years ago - 20 comments
Labels: dataframe

#8291 - Fix test_describe_empty to work without global -Werror

Pull Request - State: closed - Opened by mgorny almost 3 years ago - 4 comments
Labels: tests, almost done

#8289 - #4012 for read_csv?

Issue - State: open - Opened by y-he2 almost 3 years ago - 5 comments
Labels: dataframe, io, needs attention

#8280 - computing std of sparse matrix produces an error

Issue - State: open - Opened by vttrifonov almost 3 years ago - 3 comments
Labels: array, needs attention

#8262 - Use Rich more broadly?

Issue - State: open - Opened by mrocklin almost 3 years ago - 1 comment
Labels: discussion, needs attention

#8247 - [WIP] fix OOM error of dask-glm with cupy on GPU

Pull Request - State: open - Opened by daxiongshu almost 3 years ago - 10 comments
Labels: array

#8245 - Implement `unstack()` and/or `pivot()`

Issue - State: open - Opened by DahnJ almost 3 years ago - 4 comments
Labels: dataframe, needs attention

#8233 - Monthly community meeting

Issue - State: open - Opened by jrbourbeau almost 3 years ago - 4 comments
Labels: community

#8229 - Unexpected behaviour with out-of-bound indices

Issue - State: open - Opened by fnattino almost 3 years ago - 3 comments
Labels: array, needs attention

#8216 - Pandas 1.2.0 compatibility - column reductions are applied column-wise (when possible)

Issue - State: open - Opened by jsignell almost 3 years ago - 1 comment
Labels: dataframe

#8196 - Remove `try..except` block in `set_partitions_pre`

Pull Request - State: open - Opened by charlesbluca almost 3 years ago - 2 comments
Labels: dataframe, needs attention

#8172 - botocore error when writing parquet to S3

Issue - State: open - Opened by cliffplaysdrums about 3 years ago - 4 comments
Labels: dataframe, io, parquet, needs attention

#8147 - Unified data reader / writer interfaces

Issue - State: open - Opened by MrPowers about 3 years ago - 3 comments
Labels: io, parquet

#8143 - A few threaded scheduler fixups

Pull Request - State: open - Opened by jcrist about 3 years ago - 7 comments

#8062 - Flaky `test_create_metadata_file`

Issue - State: closed - Opened by jrbourbeau about 3 years ago - 10 comments
Labels: dataframe, io, tests, parquet, needs attention

#8058 - [Discussion] Improve Parquet-Metadata Processing in read_parquet

Issue - State: open - Opened by rjzamora about 3 years ago - 8 comments
Labels: dataframe, io, discussion, parquet, needs attention

#8020 - Use uniform distribution in `timeseries` demo

Pull Request - State: open - Opened by jrbourbeau about 3 years ago - 3 comments
Labels: dataframe, io, needs attention

#8001 - in code suggestion of when to use split_out in dask.dataframe.groupby

Issue - State: open - Opened by raybellwaves about 3 years ago - 6 comments
Labels: dataframe, documentation, needs attention

#7999 - Possible bug when using dask.array.bincount and dask.array.apply_along_axis

Issue - State: open - Opened by miguelcarcamov about 3 years ago - 4 comments
Labels: array, needs attention

#7996 - Flaky `test_setitem_extended_API_2d[index13-value13]`

Issue - State: open - Opened by pentschev about 3 years ago - 2 comments
Labels: tests, needs attention

#7977 - Pyarrow metadata `RuntimeError` in `to_parquet`

Issue - State: open - Opened by jrbourbeau about 3 years ago - 26 comments
Labels: dataframe, io, parquet, needs attention

#7859 - mysterious rechunking error

Issue - State: closed - Opened by d-v-b about 3 years ago - 6 comments
Labels: array, tests

#7718 - Frobenius norm promotes float32 matrix to float64 norm

Issue - State: closed - Opened by RogerMoens over 3 years ago - 4 comments
Labels: array

#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated

Issue - State: closed - Opened by kdebrab over 3 years ago - 5 comments
Labels: delayed

#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated

Issue - State: closed - Opened by kdebrab over 3 years ago - 5 comments
Labels: delayed

#6723 - Switch to different, stable hash algorithm in Bag

Issue - State: closed - Opened by itamarst almost 4 years ago - 14 comments
Labels: bag

#6691 - Support from_pandas with known divisions

Issue - State: open - Opened by syagev almost 4 years ago - 3 comments
Labels: dataframe

#6525 - Make sure dask array are writable via the __array__ protocol.

Pull Request - State: closed - Opened by Carreau about 4 years ago - 11 comments

#6329 - add mode example to custom aggregation

Issue - State: open - Opened by raybellwaves over 4 years ago - 6 comments
Labels: documentation

#6280 - doc: add nunique example to custom aggregation

Issue - State: closed - Opened by raybellwaves over 4 years ago - 7 comments
Labels: dataframe, documentation

#6272 - Expose intermediate rechunking logic da.reshape

Issue - State: closed - Opened by TomAugspurger over 4 years ago - 3 comments
Labels: array

#5794 - Mean implementation for datetime series

Pull Request - State: open - Opened by exemplary-citizen over 4 years ago - 12 comments

#5679 - da.std and da.var handle complex values incorrectly

Issue - State: closed - Opened by TAdeJong almost 5 years ago - 9 comments
Labels: array

#5633 - Add partition lengths to DataFrame metadata.

Issue - State: open - Opened by hadim almost 5 years ago - 13 comments
Labels: dataframe

#5610 - Clean up warnings in doc build

Issue - State: open - Opened by TomAugspurger almost 5 years ago - 8 comments
Labels: good first issue, documentation

#5544 - chunks get combined in 4d array reshape

Issue - State: closed - Opened by rabernat almost 5 years ago - 23 comments
Labels: array

#5432 - Proxying Dask (Bokeh) Web Interface on AWS SageMaker

Issue - State: open - Opened by davidtwomey about 5 years ago - 27 comments

#5051 - Implement mean for Series[datetime64[ns]] and DatetimeIndex

Issue - State: open - Opened by TomAugspurger about 5 years ago - 11 comments
Labels: good first issue, dataframe

#4974 - dd.Series.isin() broken for dict views in distributed

Issue - State: closed - Opened by gsakkis over 5 years ago - 7 comments
Labels: dataframe

#4869 - Groupby NUnique is slow and possibly buggy

Issue - State: open - Opened by bluecoconut over 5 years ago - 17 comments
Labels: dataframe

#4845 - ValueError: cannot handle a non-unique multi-index!

Issue - State: open - Opened by marberi over 5 years ago - 27 comments
Labels: dataframe

#4368 - Sort dask array

Issue - State: open - Opened by Pierre-Bartet over 5 years ago - 13 comments
Labels: array

#4299 - Bug in array.map_blocks when providing chunks and UDF adjusts shape

Issue - State: closed - Opened by TomAugspurger almost 6 years ago - 15 comments

#4170 - Implement Dask equivalent of pandas.io.json.json_normalize

Issue - State: open - Opened by nsadeh almost 6 years ago - 19 comments
Labels: dataframe, io

#3650 - Performance issue on `compute` with reshaped arrays

Issue - State: closed - Opened by nbren12 over 6 years ago - 10 comments
Labels: array

#3280 - Support norm in dask.array.fft

Issue - State: closed - Opened by jakirkham over 6 years ago
Labels: array

#3258 - Documenting NumPy functions that work fine with Dask Arrays

Issue - State: closed - Opened by jakirkham over 6 years ago - 8 comments
Labels: good first issue, array, documentation

#3245 - Silencing warnings issued by numpy functions within dask.array

Issue - State: open - Opened by shoyer over 6 years ago - 30 comments
Labels: array

#2824 - Add axis= keyword to percentile

Issue - State: open - Opened by mrocklin almost 7 years ago - 8 comments
Labels: array

#2802 - Keep original filenames in dask.dataframe.read_csv

Issue - State: closed - Opened by kmader almost 7 years ago - 19 comments

#2489 - Left side of old and new divisions are different

Issue - State: closed - Opened by shughes-uk over 7 years ago - 8 comments

#1493 - Full support for multiindex in dataframes

Issue - State: open - Opened by dirkbike about 8 years ago - 55 comments
Labels: dataframe

#1259 - Add missing methods to Series

Issue - State: open - Opened by mrocklin over 8 years ago - 23 comments
Labels: good first issue, dataframe