Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / dask/dask issues and pull requests
#11407 - Update gpuCI `RAPIDS_VER` to `24.12`
Pull Request -
State: open - Opened by github-actions[bot] 4 days ago
#11406 - unable to compute dataframe after sorting
Issue -
State: open - Opened by Cognitus-Stuti 5 days ago
- 1 comment
Labels: needs triage
#11405 - Bump jacobtomlinson/gha-anaconda-package-version from 0.1.3 to 0.1.4
Pull Request -
State: closed - Opened by dependabot[bot] 5 days ago
- 1 comment
Labels: dependencies
#11404 - Invalid `validate` argument in `dask.dataframe.merge`
Issue -
State: open - Opened by noahblakesmith 7 days ago
Labels: needs triage
#11403 - Delayed function with string argument value matching dask_key_name causes a circular reference detection
Issue -
State: open - Opened by syagev 7 days ago
- 1 comment
#11402 - Is there any way to have the finalize task be distributed across workers too
Issue -
State: open - Opened by Cognitus-Stuti 8 days ago
- 1 comment
Labels: needs triage
#11401 - pandas & dask metadata mismatch after .unique()
Issue -
State: open - Opened by JoranDox 8 days ago
Labels: needs triage
#11400 - Add tests for row-wise mode functionality in DataFrame in test_core.py
Pull Request -
State: open - Opened by thyripian 9 days ago
- 3 comments
#11399 - ENH: relax conditions on boolean index assignment
Pull Request -
State: closed - Opened by lucascolley 9 days ago
- 3 comments
#11398 - Boolean index assignment fails for values of `ndim>1`
Issue -
State: open - Opened by lucascolley 10 days ago
Labels: needs triage
#11397 - Improve error message for boolean index assignment with `nan` shape
Issue -
State: open - Opened by lucascolley 10 days ago
- 1 comment
Labels: needs triage
#11396 - Are there any workarounds for dask breaking altogether with higher amounts of load than what fits into a worker
Issue -
State: open - Opened by Cognitus-Stuti 10 days ago
- 2 comments
Labels: needs triage
#11395 - When adding collumns from 2 dataframes will not compute in some instances, fix for one instance seems to break the other
Issue -
State: open - Opened by JavrelWork 10 days ago
- 1 comment
Labels: needs triage
#11394 - Discrepancy in column property with actual structure after grouping
Issue -
State: open - Opened by dbalabka 11 days ago
Labels: needs triage
#11393 - Improve error message for incorrect columns order in meta information
Pull Request -
State: open - Opened by dbalabka 11 days ago
- 2 comments
#11392 - Improve error message for incorrect columns order in meta information (#11390)
Pull Request -
State: closed - Opened by dbalabka 11 days ago
- 3 comments
#11391 - Circular imports in dask-histogram/dask-awkward
Issue -
State: open - Opened by martindurant 11 days ago
- 6 comments
Labels: needs triage
#11390 - "Order of columns does not match" error should give an extra info about expected order
Issue -
State: open - Opened by dbalabka 11 days ago
- 2 comments
Labels: dataframe
#11389 - mode on `axis=1`
Issue -
State: open - Opened by marcdelabarrera 12 days ago
- 4 comments
Labels: dataframe, enhancement
#11388 - [WIP] Zarr-Python 3 compatibility
Pull Request -
State: open - Opened by jhamman 14 days ago
- 2 comments
Labels: upstream
#11387 - Switch to using ``zarr.open_array`` instead of using the ``zarr.Array`` constructor
Pull Request -
State: closed - Opened by jhamman 15 days ago
- 1 comment
#11386 - Memory issues with slicing
Issue -
State: open - Opened by csbrown 16 days ago
- 3 comments
Labels: needs triage
#11385 - Revert "Improve normalize_chunks calculation for "auto" setting"
Pull Request -
State: closed - Opened by jrbourbeau 16 days ago
- 3 comments
Labels: needs triage
#11384 - dask.dataframe can't read_csv
Issue -
State: open - Opened by FredaXYu 17 days ago
- 3 comments
Labels: needs triage
#11383 - New "auto" rechunking can break with Zarr
Issue -
State: closed - Opened by jrbourbeau 18 days ago
Labels: array, bug
#11382 - when using max/min as first expression for new collumn dataframe will not compute
Issue -
State: open - Opened by JavrelWork 18 days ago
- 1 comment
Labels: needs triage
#11381 - Appending to partitioned parquet with metadata throws appended dtypes differ even though they should be the same
Issue -
State: open - Opened by paluchs 18 days ago
Labels: needs triage
#11380 - Bump peter-evans/create-pull-request from 6 to 7
Pull Request -
State: closed - Opened by dependabot[bot] 19 days ago
- 1 comment
Labels: dependencies
#11379 - different `run_spec` between consecutive calls to `update_graph` | zarr-formatted xarray
Issue -
State: open - Opened by templiert 21 days ago
Labels: needs triage
#11378 - Use TaskSpec in local dask execution
Pull Request -
State: open - Opened by fjetter 22 days ago
- 1 comment
#11377 - Tasks - Remove sequence dict classes
Pull Request -
State: open - Opened by fjetter 22 days ago
- 1 comment
#11376 - Slicing an array on the last chunk of an axis duplicates the number of chunks
Issue -
State: closed - Opened by josephnowak 23 days ago
- 3 comments
Labels: needs triage
#11375 - Bump ``bokeh`` minimum version to 3.1.0
Pull Request -
State: closed - Opened by jrbourbeau 23 days ago
- 2 comments
#11374 - New account registrations not allowed on Discourse
Issue -
State: open - Opened by tiarap00 24 days ago
Labels: needs triage
#11373 - Reduce overhead in tokenize
Pull Request -
State: closed - Opened by fjetter 24 days ago
- 3 comments
#11372 - Update Dask copyright in the docs to 2024
Pull Request -
State: open - Opened by krishanbhasin-px 24 days ago
- 2 comments
#11371 - Move ``tokenize`` to dedicated submodule
Pull Request -
State: closed - Opened by fjetter 24 days ago
- 5 comments
#11369 - Use ``np.min_scalar_type`` in shuffle
Pull Request -
State: closed - Opened by jrbourbeau 25 days ago
- 1 comment
#11368 - Client submit with workers doesn’t handle new joining workers correctly
Issue -
State: open - Opened by YuriFeigin 25 days ago
Labels: needs triage
#11367 - Ensure process_runnables is not too eager in the presence of multiple splits
Pull Request -
State: closed - Opened by fjetter 25 days ago
- 2 comments
#11366 - Bump JamesIves/github-pages-deploy-action from 4.6.3 to 4.6.4
Pull Request -
State: closed - Opened by dependabot[bot] 26 days ago
- 1 comment
Labels: dependencies
#11365 - test_tokenize failures in 2024.8.1
Issue -
State: open - Opened by QuLogic 26 days ago
- 5 comments
Labels: needs triage
#11365 - test_tokenize failures in 2024.8.1
Issue -
State: open - Opened by QuLogic 26 days ago
Labels: needs triage
#11364 - Cast indexer to minimal dtype in shuffle
Pull Request -
State: closed - Opened by phofl 26 days ago
- 1 comment
#11363 - order: not optimal scheduling for patterns where we slice subsets into 2 different datasets and then combine them again
Issue -
State: open - Opened by phofl 26 days ago
- 1 comment
Labels: dask-order
#11362 - Write indexing arrays into dask graph to reduce size for multiple xarray variables
Pull Request -
State: closed - Opened by phofl 26 days ago
- 2 comments
#11361 - Reduce memory usage of dask.order
Pull Request -
State: closed - Opened by fjetter 26 days ago
- 1 comment
#11360 - precommit autoupdate
Pull Request -
State: closed - Opened by fjetter 26 days ago
- 1 comment
#11359 - Release 2024.8.2
Pull Request -
State: closed - Opened by jrbourbeau 29 days ago
#11358 - zipfile.BadZipFile: Overlapped entries (possible zip bomb)
Issue -
State: open - Opened by leonardozilli 29 days ago
- 1 comment
Labels: dataframe, needs info
#11357 - Update zoom link for dask meeting
Pull Request -
State: closed - Opened by scharlottej13 30 days ago
- 2 comments
#11356 - Add option to automatically compute chunk sizes in dask
Issue -
State: closed - Opened by lithomas1 about 1 month ago
- 4 comments
Labels: needs triage
#11355 - Ensure tokenize is thread safe
Pull Request -
State: closed - Opened by fjetter about 1 month ago
- 4 comments
#11354 - Improve normalize_chunks calculation for "auto" setting
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11353 - Futures not always resolved when using dataframe.reduction
Issue -
State: open - Opened by DaniJG about 1 month ago
Labels: needs triage, dask-expr
#11352 - Parquet read with `filesystem="arrow"` fails when `distributed` isn't imported first
Issue -
State: open - Opened by rjzamora about 1 month ago
- 3 comments
Labels: bug, dask-expr
#11351 - Deprecate legacy DataFrame implementation
Issue -
State: open - Opened by phofl about 1 month ago
- 1 comment
Labels: dataframe, deprecation
#11350 - Add changelor entries for shuffle, vindex and blockwise_reshape
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 1 comment
#11349 - map_overlap passes wrong block_info[:]['array-location']
Issue -
State: open - Opened by bnavigator about 1 month ago
- 1 comment
Labels: array
#11348 - Ensure persisted collections are released without GC
Pull Request -
State: closed - Opened by fjetter about 1 month ago
- 2 comments
#11347 - Full support for task spec in dask.order
Pull Request -
State: open - Opened by fjetter about 1 month ago
- 1 comment
#11346 - KilledWorker (exceeded 95% memory budget) with new optimizer
Issue -
State: open - Opened by noreentry about 1 month ago
- 5 comments
Labels: needs triage
#11345 - Increase visibility of GPU CI updates
Pull Request -
State: closed - Opened by charlesbluca about 1 month ago
- 1 comment
#11344 - Weird RecursionError during `tokenize`
Issue -
State: closed - Opened by hanjinliu about 1 month ago
- 5 comments
Labels: needs info
#11343 - Bug: Can't perform a (meaningful) "outer" concatenation with dask-expr on `axis=1`
Issue -
State: closed - Opened by benrutter about 1 month ago
- 1 comment
Labels: dask-expr
#11342 - Better chunk size value for chunks=auto setting
Issue -
State: open - Opened by phofl about 1 month ago
Labels: array
#11341 - Improve how normalize_chunks selects chunk sizes if auto is given
Issue -
State: closed - Opened by phofl about 1 month ago
- 2 comments
Labels: array
#11340 - Update ``numpy`` and ``pyarrow`` versions in install docs
Pull Request -
State: closed - Opened by jrbourbeau about 1 month ago
#11339 - Suggesting updates on the doc of `dask.dataframe.read_sql_query`
Issue -
State: open - Opened by ParsifalXu about 1 month ago
- 2 comments
Labels: dataframe, documentation
#11338 - Fixup dask and distributed dependencies
Pull Request -
State: closed - Opened by phofl about 1 month ago
#11337 - Choose automatically between tasks-based and p2p rechunking
Pull Request -
State: closed - Opened by hendrikmakait about 1 month ago
- 7 comments
#11336 - An inconsistency between the documentation of `dask.array.percentile` and code implementation
Issue -
State: open - Opened by ParsifalXu about 1 month ago
- 2 comments
Labels: array, documentation
#11335 - Add ``crick`` back to Python 3.11+ CI builds
Pull Request -
State: closed - Opened by jrbourbeau about 1 month ago
- 2 comments
#11334 - gpuCI failing
Issue -
State: closed - Opened by jrbourbeau about 1 month ago
- 3 comments
Labels: tests, gpu
#11333 - order: Run ordering test on distributed cluster and compare against local ordering
Issue -
State: open - Opened by phofl about 1 month ago
Labels: dask-order
#11332 - Fix docstring formatting for map_overlap
Pull Request -
State: closed - Opened by Tao-VanJS about 1 month ago
- 3 comments
#11331 - Bump `numpy>=1.24` and `pyarrow>=14.0.1` minimum versions
Pull Request -
State: closed - Opened by jrbourbeau about 1 month ago
- 4 comments
#11330 - Preserve chunksizes in vindex
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11329 - `map_blocks()` with `new_axis` output has incorrect shape
Issue -
State: open - Opened by dstansby about 1 month ago
- 3 comments
Labels: array
#11328 - Implement blockwise reshape
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11327 - Fix NumPy overflowing for prod on 2.0
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11326 - Make rechunking in shuffle more intelligent to distribute unevenly if necessary
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 1 comment
#11325 - read_sql_table would throw an exception when calling for unique values of a column
Issue -
State: closed - Opened by phalvesmbai about 1 month ago
Labels: dataframe, io
#11324 - Add changelog entry for reshape and ordering improvements
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11323 - Bump mindeps for pyarrow and numpy
Issue -
State: closed - Opened by fjetter about 1 month ago
- 3 comments
Labels: needs triage
#11322 - Avoid casting arrow dtypes to numpy object for tokenize
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 2 comments
#11321 - Revert "Test ordering on distributed scheduler (#11310)"
Pull Request -
State: closed - Opened by fjetter about 1 month ago
- 2 comments
#11320 - Ensure pickle does not change tokens
Pull Request -
State: closed - Opened by fjetter about 1 month ago
- 12 comments
#11319 - Pass additional parameters to `rechunk_p2p`
Pull Request -
State: closed - Opened by hendrikmakait about 1 month ago
- 1 comment
#11318 - cannot access local variable 'divisions' where it is not associated with a value
Issue -
State: open - Opened by Cognitus-Stuti about 1 month ago
- 1 comment
Labels: needs triage
#11317 - Rename chunksize-tolerance option
Pull Request -
State: closed - Opened by phofl about 1 month ago
- 5 comments
#11316 - Requested dask.distributed scheduler but no Client active
Issue -
State: open - Opened by Cognitus-Stuti about 2 months ago
- 5 comments
Labels: needs info
#11315 - ⚠️ Upstream CI failed ⚠️
Issue -
State: closed - Opened by github-actions[bot] about 2 months ago
- 3 comments
Labels: upstream
#11314 - Expose a blockwise - reshape operation that doesn't guarantee to keep the ordering consistent for downstream libraries
Issue -
State: closed - Opened by phofl about 2 months ago
- 5 comments
Labels: array, array-expr
#11313 - Add tests to cover more cases of new reshape implementation
Pull Request -
State: closed - Opened by phofl about 2 months ago
- 1 comment
#11312 - gpuCI broken
Issue -
State: open - Opened by fjetter about 2 months ago
- 7 comments
Labels: needs triage
#11311 - Implement automatic rechunking for shuffle
Pull Request -
State: closed - Opened by phofl about 2 months ago
- 2 comments
#11310 - Test ordering on distributed scheduler
Pull Request -
State: closed - Opened by fjetter about 2 months ago
- 3 comments
#11309 - Upgrade gpuCI and fix Dask Array failures with "cupy" backend
Pull Request -
State: closed - Opened by rjzamora about 2 months ago
- 3 comments
Labels: array, bug, gpu
#11308 - Unexpected Behavior When Using `dask.delayed` with `xarray` to Load a Chunked Dataset
Issue -
State: open - Opened by Eis-ba-er about 2 months ago
- 3 comments
Labels: needs triage