Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / dask/dask issues and pull requests
#8437 - When `divisions` has repeats, `set_index` puts all data in the last partition instead of balancing it
Issue -
State: open - Opened by gjoseph92 almost 3 years ago
- 1 comment
Labels: dataframe, needs attention
#8435 - [Discussion] Don't compute divisions by default in `set_index`?
Issue -
State: open - Opened by gjoseph92 almost 3 years ago
- 8 comments
Labels: dataframe, discussion, needs attention
#8430 - Make sort on groupby also affect the groupby sort on the chunk
Pull Request -
State: open - Opened by jsignell almost 3 years ago
- 3 comments
Labels: dataframe, needs attention
#8421 - DataFrame.groupby sorts group keys even with sort=False
Issue -
State: open - Opened by ghost almost 3 years ago
- 1 comment
Labels: dataframe, needs attention, bug, p3
#8415 - Documentation for `set_index(col, compute=True)` is unclear/inaccurate
Issue -
State: open - Opened by DahnJ almost 3 years ago
- 13 comments
Labels: dataframe, documentation, needs attention
#8380 - da.store loses dependency information
Issue -
State: open - Opened by djhoese almost 3 years ago
- 12 comments
Labels: array, needs attention, bug
#8361 - Optimized groupby aggregations when grouping by a sorted index
Issue -
State: open - Opened by gjoseph92 almost 3 years ago
- 11 comments
Labels: dataframe, needs attention
#8355 - Could not deserialize task when using `npartitions="auto"` in `DataFrame.set_index()`
Issue -
State: open - Opened by aloysius-lim almost 3 years ago
- 5 comments
Labels: array, needs attention, bug
#8353 - ignore_index is not used in dd.concat
Issue -
State: open - Opened by boazmohar almost 3 years ago
- 2 comments
Labels: dataframe, needs attention
#8335 - `aiobotocore` releated test failures
Issue -
State: closed - Opened by jrbourbeau almost 3 years ago
- 9 comments
#8334 - Use map instead of batch submit in local.get_async
Issue -
State: open - Opened by SebastienDorgan almost 3 years ago
- 1 comment
Labels: needs attention
#8294 - Shuffle prototype: Feedback (disk usage + workers dying)
Issue -
State: open - Opened by DahnJ almost 3 years ago
- 6 comments
Labels: needs attention
#8292 - graph became invalid in 2021.10.0
Issue -
State: open - Opened by chrisroat almost 3 years ago
- 20 comments
Labels: dataframe
#8291 - Fix test_describe_empty to work without global -Werror
Pull Request -
State: closed - Opened by mgorny almost 3 years ago
- 4 comments
Labels: tests, almost done
#8289 - #4012 for read_csv?
Issue -
State: open - Opened by y-he2 almost 3 years ago
- 5 comments
Labels: dataframe, io, needs attention
#8280 - computing std of sparse matrix produces an error
Issue -
State: open - Opened by vttrifonov almost 3 years ago
- 3 comments
Labels: array, needs attention
#8262 - Use Rich more broadly?
Issue -
State: open - Opened by mrocklin almost 3 years ago
- 1 comment
Labels: discussion, needs attention
#8247 - [WIP] fix OOM error of dask-glm with cupy on GPU
Pull Request -
State: open - Opened by daxiongshu almost 3 years ago
- 10 comments
Labels: array
#8245 - Implement `unstack()` and/or `pivot()`
Issue -
State: open - Opened by DahnJ almost 3 years ago
- 4 comments
Labels: dataframe, needs attention
#8233 - Monthly community meeting
Issue -
State: open - Opened by jrbourbeau almost 3 years ago
- 4 comments
Labels: community
#8229 - Unexpected behaviour with out-of-bound indices
Issue -
State: open - Opened by fnattino almost 3 years ago
- 3 comments
Labels: array, needs attention
#8216 - Pandas 1.2.0 compatibility - column reductions are applied column-wise (when possible)
Issue -
State: open - Opened by jsignell almost 3 years ago
- 1 comment
Labels: dataframe
#8196 - Remove `try..except` block in `set_partitions_pre`
Pull Request -
State: open - Opened by charlesbluca almost 3 years ago
- 2 comments
Labels: dataframe, needs attention
#8172 - botocore error when writing parquet to S3
Issue -
State: open - Opened by cliffplaysdrums about 3 years ago
- 4 comments
Labels: dataframe, io, parquet, needs attention
#8147 - Unified data reader / writer interfaces
Issue -
State: open - Opened by MrPowers about 3 years ago
- 3 comments
Labels: io, parquet
#8143 - A few threaded scheduler fixups
Pull Request -
State: open - Opened by jcrist about 3 years ago
- 7 comments
#8062 - Flaky `test_create_metadata_file`
Issue -
State: closed - Opened by jrbourbeau about 3 years ago
- 10 comments
Labels: dataframe, io, tests, parquet, needs attention
#8058 - [Discussion] Improve Parquet-Metadata Processing in read_parquet
Issue -
State: open - Opened by rjzamora about 3 years ago
- 8 comments
Labels: dataframe, io, discussion, parquet, needs attention
#8020 - Use uniform distribution in `timeseries` demo
Pull Request -
State: open - Opened by jrbourbeau about 3 years ago
- 3 comments
Labels: dataframe, io, needs attention
#8001 - in code suggestion of when to use split_out in dask.dataframe.groupby
Issue -
State: open - Opened by raybellwaves about 3 years ago
- 6 comments
Labels: dataframe, documentation, needs attention
#7999 - Possible bug when using dask.array.bincount and dask.array.apply_along_axis
Issue -
State: open - Opened by miguelcarcamov about 3 years ago
- 4 comments
Labels: array, needs attention
#7996 - Flaky `test_setitem_extended_API_2d[index13-value13]`
Issue -
State: open - Opened by pentschev about 3 years ago
- 2 comments
Labels: tests, needs attention
#7977 - Pyarrow metadata `RuntimeError` in `to_parquet`
Issue -
State: open - Opened by jrbourbeau about 3 years ago
- 26 comments
Labels: dataframe, io, parquet, needs attention
#7859 - mysterious rechunking error
Issue -
State: closed - Opened by d-v-b about 3 years ago
- 6 comments
Labels: array, tests
#7718 - Frobenius norm promotes float32 matrix to float64 norm
Issue -
State: closed - Opened by RogerMoens over 3 years ago
- 4 comments
Labels: array
#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated
Issue -
State: closed - Opened by kdebrab over 3 years ago
- 5 comments
Labels: delayed
#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated
Issue -
State: closed - Opened by kdebrab over 3 years ago
- 5 comments
Labels: delayed
#6723 - Switch to different, stable hash algorithm in Bag
Issue -
State: closed - Opened by itamarst almost 4 years ago
- 14 comments
Labels: bag
#6691 - Support from_pandas with known divisions
Issue -
State: open - Opened by syagev almost 4 years ago
- 3 comments
Labels: dataframe
#6525 - Make sure dask array are writable via the __array__ protocol.
Pull Request -
State: closed - Opened by Carreau about 4 years ago
- 11 comments
#6354 - Cannot compute min or max of dates in dask array when converted from dask dataframe using to_dask_array
Issue -
State: open - Opened by kylejn27 about 4 years ago
- 18 comments
Labels: array
#6329 - add mode example to custom aggregation
Issue -
State: open - Opened by raybellwaves over 4 years ago
- 6 comments
Labels: documentation
#6280 - doc: add nunique example to custom aggregation
Issue -
State: closed - Opened by raybellwaves over 4 years ago
- 7 comments
Labels: dataframe, documentation
#6272 - Expose intermediate rechunking logic da.reshape
Issue -
State: closed - Opened by TomAugspurger over 4 years ago
- 3 comments
Labels: array
#5794 - Mean implementation for datetime series
Pull Request -
State: open - Opened by exemplary-citizen over 4 years ago
- 12 comments
#5679 - da.std and da.var handle complex values incorrectly
Issue -
State: closed - Opened by TAdeJong almost 5 years ago
- 9 comments
Labels: array
#5633 - Add partition lengths to DataFrame metadata.
Issue -
State: open - Opened by hadim almost 5 years ago
- 13 comments
Labels: dataframe
#5610 - Clean up warnings in doc build
Issue -
State: open - Opened by TomAugspurger almost 5 years ago
- 8 comments
Labels: good first issue, documentation
#5544 - chunks get combined in 4d array reshape
Issue -
State: closed - Opened by rabernat almost 5 years ago
- 23 comments
Labels: array
#5432 - Proxying Dask (Bokeh) Web Interface on AWS SageMaker
Issue -
State: open - Opened by davidtwomey about 5 years ago
- 27 comments
#5051 - Implement mean for Series[datetime64[ns]] and DatetimeIndex
Issue -
State: open - Opened by TomAugspurger about 5 years ago
- 11 comments
Labels: good first issue, dataframe
#4974 - dd.Series.isin() broken for dict views in distributed
Issue -
State: closed - Opened by gsakkis over 5 years ago
- 7 comments
Labels: dataframe
#4959 - How do I fill any value to a dask dataframe cell (cell having specific row number and column number)?
Issue -
State: closed - Opened by 3ggaurav over 5 years ago
- 7 comments
#4869 - Groupby NUnique is slow and possibly buggy
Issue -
State: open - Opened by bluecoconut over 5 years ago
- 17 comments
Labels: dataframe
#4845 - ValueError: cannot handle a non-unique multi-index!
Issue -
State: open - Opened by marberi over 5 years ago
- 27 comments
Labels: dataframe
#4368 - Sort dask array
Issue -
State: open - Opened by Pierre-Bartet over 5 years ago
- 13 comments
Labels: array
#4299 - Bug in array.map_blocks when providing chunks and UDF adjusts shape
Issue -
State: closed - Opened by TomAugspurger almost 6 years ago
- 15 comments
#4170 - Implement Dask equivalent of pandas.io.json.json_normalize
Issue -
State: open - Opened by nsadeh almost 6 years ago
- 19 comments
Labels: dataframe, io
#3650 - Performance issue on `compute` with reshaped arrays
Issue -
State: closed - Opened by nbren12 over 6 years ago
- 10 comments
Labels: array
#3280 - Support norm in dask.array.fft
Issue -
State: closed - Opened by jakirkham over 6 years ago
Labels: array
#3258 - Documenting NumPy functions that work fine with Dask Arrays
Issue -
State: closed - Opened by jakirkham over 6 years ago
- 8 comments
Labels: good first issue, array, documentation
#3245 - Silencing warnings issued by numpy functions within dask.array
Issue -
State: open - Opened by shoyer over 6 years ago
- 30 comments
Labels: array
#2824 - Add axis= keyword to percentile
Issue -
State: open - Opened by mrocklin almost 7 years ago
- 8 comments
Labels: array
#2802 - Keep original filenames in dask.dataframe.read_csv
Issue -
State: closed - Opened by kmader almost 7 years ago
- 19 comments
#2489 - Left side of old and new divisions are different
Issue -
State: closed - Opened by shughes-uk over 7 years ago
- 8 comments
#1493 - Full support for multiindex in dataframes
Issue -
State: open - Opened by dirkbike about 8 years ago
- 55 comments
Labels: dataframe
#1259 - Add missing methods to Series
Issue -
State: open - Opened by mrocklin over 8 years ago
- 23 comments
Labels: good first issue, dataframe