Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/dask issues and pull requests

#11407 - Update gpuCI `RAPIDS_VER` to `24.12`

Pull Request - State: open - Opened by github-actions[bot] 4 days ago

#11406 - unable to compute dataframe after sorting

Issue - State: open - Opened by Cognitus-Stuti 5 days ago - 1 comment
Labels: needs triage

#11405 - Bump jacobtomlinson/gha-anaconda-package-version from 0.1.3 to 0.1.4

Pull Request - State: closed - Opened by dependabot[bot] 5 days ago - 1 comment
Labels: dependencies

#11404 - Invalid `validate` argument in `dask.dataframe.merge`

Issue - State: open - Opened by noahblakesmith 7 days ago
Labels: needs triage

#11402 - Is there any way to have the finalize task be distributed across workers too

Issue - State: open - Opened by Cognitus-Stuti 8 days ago - 1 comment
Labels: needs triage

#11401 - pandas & dask metadata mismatch after .unique()

Issue - State: open - Opened by JoranDox 8 days ago
Labels: needs triage

#11400 - Add tests for row-wise mode functionality in DataFrame in test_core.py

Pull Request - State: open - Opened by thyripian 9 days ago - 3 comments

#11399 - ENH: relax conditions on boolean index assignment

Pull Request - State: closed - Opened by lucascolley 9 days ago - 3 comments

#11398 - Boolean index assignment fails for values of `ndim>1`

Issue - State: open - Opened by lucascolley 10 days ago
Labels: needs triage

#11397 - Improve error message for boolean index assignment with `nan` shape

Issue - State: open - Opened by lucascolley 10 days ago - 1 comment
Labels: needs triage

#11394 - Discrepancy in column property with actual structure after grouping

Issue - State: open - Opened by dbalabka 11 days ago
Labels: needs triage

#11393 - Improve error message for incorrect columns order in meta information

Pull Request - State: open - Opened by dbalabka 11 days ago - 2 comments

#11392 - Improve error message for incorrect columns order in meta information (#11390)

Pull Request - State: closed - Opened by dbalabka 11 days ago - 3 comments

#11391 - Circular imports in dask-histogram/dask-awkward

Issue - State: open - Opened by martindurant 11 days ago - 6 comments
Labels: needs triage

#11390 - "Order of columns does not match" error should give an extra info about expected order

Issue - State: open - Opened by dbalabka 11 days ago - 2 comments
Labels: dataframe

#11389 - mode on `axis=1`

Issue - State: open - Opened by marcdelabarrera 12 days ago - 4 comments
Labels: dataframe, enhancement

#11388 - [WIP] Zarr-Python 3 compatibility

Pull Request - State: open - Opened by jhamman 14 days ago - 2 comments
Labels: upstream

#11386 - Memory issues with slicing

Issue - State: open - Opened by csbrown 16 days ago - 3 comments
Labels: needs triage

#11385 - Revert "Improve normalize_chunks calculation for "auto" setting"

Pull Request - State: closed - Opened by jrbourbeau 16 days ago - 3 comments
Labels: needs triage

#11384 - dask.dataframe can't read_csv

Issue - State: open - Opened by FredaXYu 17 days ago - 3 comments
Labels: needs triage

#11383 - New "auto" rechunking can break with Zarr

Issue - State: closed - Opened by jrbourbeau 18 days ago
Labels: array, bug

#11382 - when using max/min as first expression for new collumn dataframe will not compute

Issue - State: open - Opened by JavrelWork 18 days ago - 1 comment
Labels: needs triage

#11380 - Bump peter-evans/create-pull-request from 6 to 7

Pull Request - State: closed - Opened by dependabot[bot] 19 days ago - 1 comment
Labels: dependencies

#11378 - Use TaskSpec in local dask execution

Pull Request - State: open - Opened by fjetter 22 days ago - 1 comment

#11377 - Tasks - Remove sequence dict classes

Pull Request - State: open - Opened by fjetter 22 days ago - 1 comment

#11376 - Slicing an array on the last chunk of an axis duplicates the number of chunks

Issue - State: closed - Opened by josephnowak 23 days ago - 3 comments
Labels: needs triage

#11375 - Bump ``bokeh`` minimum version to 3.1.0

Pull Request - State: closed - Opened by jrbourbeau 23 days ago - 2 comments

#11374 - New account registrations not allowed on Discourse

Issue - State: open - Opened by tiarap00 24 days ago
Labels: needs triage

#11373 - Reduce overhead in tokenize

Pull Request - State: closed - Opened by fjetter 24 days ago - 3 comments

#11372 - Update Dask copyright in the docs to 2024

Pull Request - State: open - Opened by krishanbhasin-px 24 days ago - 2 comments

#11371 - Move ``tokenize`` to dedicated submodule

Pull Request - State: closed - Opened by fjetter 24 days ago - 5 comments

#11369 - Use ``np.min_scalar_type`` in shuffle

Pull Request - State: closed - Opened by jrbourbeau 25 days ago - 1 comment

#11368 - Client submit with workers doesn’t handle new joining workers correctly

Issue - State: open - Opened by YuriFeigin 25 days ago
Labels: needs triage

#11367 - Ensure process_runnables is not too eager in the presence of multiple splits

Pull Request - State: closed - Opened by fjetter 25 days ago - 2 comments

#11366 - Bump JamesIves/github-pages-deploy-action from 4.6.3 to 4.6.4

Pull Request - State: closed - Opened by dependabot[bot] 26 days ago - 1 comment
Labels: dependencies

#11365 - test_tokenize failures in 2024.8.1

Issue - State: open - Opened by QuLogic 26 days ago - 5 comments
Labels: needs triage

#11365 - test_tokenize failures in 2024.8.1

Issue - State: open - Opened by QuLogic 26 days ago
Labels: needs triage

#11364 - Cast indexer to minimal dtype in shuffle

Pull Request - State: closed - Opened by phofl 26 days ago - 1 comment

#11362 - Write indexing arrays into dask graph to reduce size for multiple xarray variables

Pull Request - State: closed - Opened by phofl 26 days ago - 2 comments

#11361 - Reduce memory usage of dask.order

Pull Request - State: closed - Opened by fjetter 26 days ago - 1 comment

#11360 - precommit autoupdate

Pull Request - State: closed - Opened by fjetter 26 days ago - 1 comment

#11359 - Release 2024.8.2

Pull Request - State: closed - Opened by jrbourbeau 29 days ago

#11358 - zipfile.BadZipFile: Overlapped entries (possible zip bomb)

Issue - State: open - Opened by leonardozilli 29 days ago - 1 comment
Labels: dataframe, needs info

#11357 - Update zoom link for dask meeting

Pull Request - State: closed - Opened by scharlottej13 30 days ago - 2 comments

#11356 - Add option to automatically compute chunk sizes in dask

Issue - State: closed - Opened by lithomas1 about 1 month ago - 4 comments
Labels: needs triage

#11355 - Ensure tokenize is thread safe

Pull Request - State: closed - Opened by fjetter about 1 month ago - 4 comments

#11354 - Improve normalize_chunks calculation for "auto" setting

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11353 - Futures not always resolved when using dataframe.reduction

Issue - State: open - Opened by DaniJG about 1 month ago
Labels: needs triage, dask-expr

#11352 - Parquet read with `filesystem="arrow"` fails when `distributed` isn't imported first

Issue - State: open - Opened by rjzamora about 1 month ago - 3 comments
Labels: bug, dask-expr

#11351 - Deprecate legacy DataFrame implementation

Issue - State: open - Opened by phofl about 1 month ago - 1 comment
Labels: dataframe, deprecation

#11350 - Add changelor entries for shuffle, vindex and blockwise_reshape

Pull Request - State: closed - Opened by phofl about 1 month ago - 1 comment

#11349 - map_overlap passes wrong block_info[:]['array-location']

Issue - State: open - Opened by bnavigator about 1 month ago - 1 comment
Labels: array

#11348 - Ensure persisted collections are released without GC

Pull Request - State: closed - Opened by fjetter about 1 month ago - 2 comments

#11347 - Full support for task spec in dask.order

Pull Request - State: open - Opened by fjetter about 1 month ago - 1 comment

#11346 - KilledWorker (exceeded 95% memory budget) with new optimizer

Issue - State: open - Opened by noreentry about 1 month ago - 5 comments
Labels: needs triage

#11345 - Increase visibility of GPU CI updates

Pull Request - State: closed - Opened by charlesbluca about 1 month ago - 1 comment

#11344 - Weird RecursionError during `tokenize`

Issue - State: closed - Opened by hanjinliu about 1 month ago - 5 comments
Labels: needs info

#11343 - Bug: Can't perform a (meaningful) "outer" concatenation with dask-expr on `axis=1`

Issue - State: closed - Opened by benrutter about 1 month ago - 1 comment
Labels: dask-expr

#11342 - Better chunk size value for chunks=auto setting

Issue - State: open - Opened by phofl about 1 month ago
Labels: array

#11341 - Improve how normalize_chunks selects chunk sizes if auto is given

Issue - State: closed - Opened by phofl about 1 month ago - 2 comments
Labels: array

#11340 - Update ``numpy`` and ``pyarrow`` versions in install docs

Pull Request - State: closed - Opened by jrbourbeau about 1 month ago

#11339 - Suggesting updates on the doc of `dask.dataframe.read_sql_query`

Issue - State: open - Opened by ParsifalXu about 1 month ago - 2 comments
Labels: dataframe, documentation

#11338 - Fixup dask and distributed dependencies

Pull Request - State: closed - Opened by phofl about 1 month ago

#11337 - Choose automatically between tasks-based and p2p rechunking

Pull Request - State: closed - Opened by hendrikmakait about 1 month ago - 7 comments

#11336 - An inconsistency between the documentation of `dask.array.percentile` and code implementation

Issue - State: open - Opened by ParsifalXu about 1 month ago - 2 comments
Labels: array, documentation

#11335 - Add ``crick`` back to Python 3.11+ CI builds

Pull Request - State: closed - Opened by jrbourbeau about 1 month ago - 2 comments

#11334 - gpuCI failing

Issue - State: closed - Opened by jrbourbeau about 1 month ago - 3 comments
Labels: tests, gpu

#11333 - order: Run ordering test on distributed cluster and compare against local ordering

Issue - State: open - Opened by phofl about 1 month ago
Labels: dask-order

#11332 - Fix docstring formatting for map_overlap

Pull Request - State: closed - Opened by Tao-VanJS about 1 month ago - 3 comments

#11331 - Bump `numpy>=1.24` and `pyarrow>=14.0.1` minimum versions

Pull Request - State: closed - Opened by jrbourbeau about 1 month ago - 4 comments

#11330 - Preserve chunksizes in vindex

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11329 - `map_blocks()` with `new_axis` output has incorrect shape

Issue - State: open - Opened by dstansby about 1 month ago - 3 comments
Labels: array

#11328 - Implement blockwise reshape

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11327 - Fix NumPy overflowing for prod on 2.0

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11326 - Make rechunking in shuffle more intelligent to distribute unevenly if necessary

Pull Request - State: closed - Opened by phofl about 1 month ago - 1 comment

#11325 - read_sql_table would throw an exception when calling for unique values of a column

Issue - State: closed - Opened by phalvesmbai about 1 month ago
Labels: dataframe, io

#11324 - Add changelog entry for reshape and ordering improvements

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11323 - Bump mindeps for pyarrow and numpy

Issue - State: closed - Opened by fjetter about 1 month ago - 3 comments
Labels: needs triage

#11322 - Avoid casting arrow dtypes to numpy object for tokenize

Pull Request - State: closed - Opened by phofl about 1 month ago - 2 comments

#11321 - Revert "Test ordering on distributed scheduler (#11310)"

Pull Request - State: closed - Opened by fjetter about 1 month ago - 2 comments

#11320 - Ensure pickle does not change tokens

Pull Request - State: closed - Opened by fjetter about 1 month ago - 12 comments

#11319 - Pass additional parameters to `rechunk_p2p`

Pull Request - State: closed - Opened by hendrikmakait about 1 month ago - 1 comment

#11318 - cannot access local variable 'divisions' where it is not associated with a value

Issue - State: open - Opened by Cognitus-Stuti about 1 month ago - 1 comment
Labels: needs triage

#11317 - Rename chunksize-tolerance option

Pull Request - State: closed - Opened by phofl about 1 month ago - 5 comments

#11316 - Requested dask.distributed scheduler but no Client active

Issue - State: open - Opened by Cognitus-Stuti about 2 months ago - 5 comments
Labels: needs info

#11315 - ⚠️ Upstream CI failed ⚠️

Issue - State: closed - Opened by github-actions[bot] about 2 months ago - 3 comments
Labels: upstream

#11314 - Expose a blockwise - reshape operation that doesn't guarantee to keep the ordering consistent for downstream libraries

Issue - State: closed - Opened by phofl about 2 months ago - 5 comments
Labels: array, array-expr

#11313 - Add tests to cover more cases of new reshape implementation

Pull Request - State: closed - Opened by phofl about 2 months ago - 1 comment

#11312 - gpuCI broken

Issue - State: open - Opened by fjetter about 2 months ago - 7 comments
Labels: needs triage

#11311 - Implement automatic rechunking for shuffle

Pull Request - State: closed - Opened by phofl about 2 months ago - 2 comments

#11310 - Test ordering on distributed scheduler

Pull Request - State: closed - Opened by fjetter about 2 months ago - 3 comments

#11309 - Upgrade gpuCI and fix Dask Array failures with "cupy" backend

Pull Request - State: closed - Opened by rjzamora about 2 months ago - 3 comments
Labels: array, bug, gpu

#11308 - Unexpected Behavior When Using `dask.delayed` with `xarray` to Load a Chunked Dataset

Issue - State: open - Opened by Eis-ba-er about 2 months ago - 3 comments
Labels: needs triage