Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/dask issues and pull requests

#10584 - SchemaError / NotImplementedError: The python type string is not implemented (yet)

Issue - State: closed - Opened by orlandombaa about 1 year ago - 1 comment
Labels: needs triage

#10583 - Handle date and decimal dtypes in parquet data

Issue - State: open - Opened by mrocklin about 1 year ago - 3 comments
Labels: needs attention, needs triage

#10582 - Reading and writing data from cloud storage or bigquery

Issue - State: closed - Opened by orlandombaa about 1 year ago - 5 comments
Labels: needs triage

#10581 - delete setup.py

Pull Request - State: closed - Opened by graingert about 1 year ago - 1 comment

#10580 - switch to PyTables HEAD [test-upstream]

Pull Request - State: closed - Opened by graingert about 1 year ago - 1 comment

#10579 - Speed up ``normalize_chunks`` for common case

Pull Request - State: closed - Opened by martindurant about 1 year ago - 6 comments
Labels: array

#10577 - unignore and fix deprecated freq aliases

Pull Request - State: closed - Opened by graingert about 1 year ago - 1 comment
Labels: dataframe, io, documentation

#10576 - Groupby "observed" being ignored

Issue - State: closed - Opened by frbelotto about 1 year ago - 1 comment
Labels: needs triage

#10575 - Upgrade ``versioneer`` to 0.29

Pull Request - State: closed - Opened by graingert about 1 year ago

#10574 - change test_concat_categorical xfail to be non-strict [test-upstream]

Pull Request - State: closed - Opened by graingert about 1 year ago - 1 comment
Labels: dataframe

#10573 - dask.dataframe groupby cov warns with DataFrameGroupBy.apply operated on the grouping columns FutureWarning

Issue - State: open - Opened by graingert about 1 year ago
Labels: needs attention, needs triage

#10572 - dask.dataframe groupby results in pandas FutureWarning: When grouping with a length-1 list-like, ...

Issue - State: open - Opened by graingert about 1 year ago
Labels: needs attention, needs triage

#10570 - scipy supports numpy v2 [test-upstream]

Pull Request - State: closed - Opened by graingert about 1 year ago
Labels: array

#10569 - scikit-image v0.22.0 merged numpy v2 support [test-upstream]

Pull Request - State: closed - Opened by graingert about 1 year ago - 1 comment
Labels: array

#10567 - Bug in the graph optimization causing exception to be thrown

Issue - State: open - Opened by AnsgarSchuffenhauer about 1 year ago
Labels: needs attention, needs triage

#10566 - DataFrameGroupBy.agg numeric_only incompatible with pd.NamedAgg

Issue - State: open - Opened by Martmists-GH about 1 year ago - 5 comments
Labels: dataframe, needs info, needs attention

#10565 - TypeError: 'type' object is not subscriptable

Issue - State: closed - Opened by ballegri about 1 year ago - 2 comments
Labels: needs info

#10564 - to_csv doesn't support file-like objects

Issue - State: open - Opened by Martmists-GH about 1 year ago - 3 comments
Labels: needs attention, needs triage

#10563 - Better name for "Fused Operation"

Issue - State: closed - Opened by mrocklin about 1 year ago - 7 comments
Labels: needs triage

#10562 - [RFC] Order assignment groups

Pull Request - State: closed - Opened by fjetter about 1 year ago - 4 comments

#10561 - Infer the shape only if chunks is not supplied by the caller of to_zarr

Pull Request - State: open - Opened by bartbroere about 1 year ago - 2 comments
Labels: array, needs attention

#10559 - Release 2023.10.0

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago
Labels: documentation

#10558 - test_concat_categorical fails on pandas >= 2.2.0.dev0 or 3.12

Issue - State: open - Opened by graingert about 1 year ago - 1 comment
Labels: needs attention, needs triage

#10557 - [WIP] Rewrite dask order

Pull Request - State: closed - Opened by fjetter about 1 year ago - 8 comments

#10556 - Fix Precision Handling in `array.linalg.norm`

Pull Request - State: closed - Opened by joanrue about 1 year ago - 7 comments
Labels: array, needs attention

#10555 - Selecting an empty slice with negative step size from a dask array returns a non-empty array.

Issue - State: open - Opened by marcoheisig about 1 year ago
Labels: needs attention, needs triage

#10554 - Broken static typing of `Array.shape`

Issue - State: open - Opened by headtr1ck about 1 year ago
Labels: needs attention, needs triage

#10553 - Add typing to dask.order

Pull Request - State: closed - Opened by fjetter about 1 year ago - 1 comment

#10552 - Dispatch the ``partd.Encode`` class used for disk-based shuffling

Pull Request - State: closed - Opened by rjzamora about 1 year ago - 1 comment
Labels: dataframe, dispatch

#10551 - Support ``cudf`` backend in ``dd.DataFrame.sort_values``

Pull Request - State: closed - Opened by rjzamora about 1 year ago - 1 comment
Labels: dataframe, dispatch

#10550 - Nunique dropping NaN values

Issue - State: open - Opened by manschoe about 1 year ago - 2 comments
Labels: dataframe, needs attention

#10549 - upstream fixes

Pull Request - State: closed - Opened by graingert about 1 year ago - 8 comments
Labels: dataframe, array

#10548 - Config file path can not be updated using DASK_CONFIG

Issue - State: open - Opened by adrien-berchet about 1 year ago
Labels: needs attention, needs triage

#10547 - Improve cache hits for tuple keys in `key_split` and intern results

Pull Request - State: closed - Opened by fjetter about 1 year ago

#10546 - Unexpected behavior with Pyarrow string conversion

Issue - State: open - Opened by JaguarPaw2409 about 1 year ago - 1 comment
Labels: dataframe, needs attention, bug, convert-string

#10545 - "Failed to deserialize" when using dd.from_delayed on pd.read_sql-generated DFs

Issue - State: open - Opened by arjun-shanmugam about 1 year ago
Labels: needs attention, needs triage

#10544 - Support for Python 3.12

Pull Request - State: closed - Opened by graingert about 1 year ago - 2 comments
Labels: dataframe, array

#10543 - Add typing to empty_like

Pull Request - State: open - Opened by dstansby about 1 year ago - 2 comments
Labels: array, needs attention

#10542 - Add optimized code paths for ``drop_duplicates``

Pull Request - State: closed - Opened by rjzamora about 1 year ago - 6 comments
Labels: dataframe

#10541 - Support Dask collections in `dd.from_dict`

Issue - State: open - Opened by jrbourbeau about 1 year ago - 1 comment
Labels: dataframe, needs attention, feature

#10540 - Add `DataFrame.equals` method

Issue - State: open - Opened by jrbourbeau about 1 year ago - 2 comments
Labels: dataframe, needs attention, feature

#10539 - [WIP] Upstream fixes

Pull Request - State: open - Opened by jrbourbeau about 1 year ago - 3 comments
Labels: dataframe, array

#10538 - Dask Task Graph is choosing the wrong path, which makes me run out of RAM.

Issue - State: open - Opened by raphael7777777 about 1 year ago - 5 comments
Labels: needs attention, needs triage

#10537 - `dask.config.write` API

Issue - State: open - Opened by fjetter about 1 year ago - 1 comment
Labels: needs attention, needs triage

#10536 - Add ~/.dask to list of directories to search for config

Pull Request - State: open - Opened by mrocklin about 1 year ago - 1 comment
Labels: needs attention

#10535 - [dask.order] Reduce memory pressure for multi array reductions by releasing splitter tasks more eagerly

Pull Request - State: closed - Opened by fjetter about 1 year ago - 27 comments
Labels: needs review, enhancement

#10534 - Allow partial graph computation without full materialization

Pull Request - State: open - Opened by BrandonSmithJ about 1 year ago - 5 comments
Labels: needs attention

#10533 - astype('bool') gives wrong result, astype('boolean') yields TypeError

Issue - State: open - Opened by ligonliu about 1 year ago - 3 comments
Labels: needs triage, convert-string

#10532 - Bump actions/checkout from 4.0.0 to 4.1.0

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies

#10531 - Match `pandas` reverting `apply` deprecation

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago - 3 comments
Labels: dataframe

#10530 - Run tests with ``crick=0.0.4`` release

Pull Request - State: open - Opened by jrbourbeau about 1 year ago - 5 comments
Labels: needs attention

#10528 - Dask SVD For Matrices > 3 GB

Issue - State: closed - Opened by tommysisk about 1 year ago - 2 comments
Labels: needs attention, needs triage

#10527 - DataFrame sum produces incorrect results under specific conditions

Issue - State: closed - Opened by fbunt about 1 year ago - 3 comments

#10526 - Update gpuCI `RAPIDS_VER` to `23.12`

Pull Request - State: closed - Opened by github-actions[bot] about 1 year ago - 1 comment

#10525 - read_parquet() filters not applied for pivot_table()

Issue - State: open - Opened by kwodzicki about 1 year ago
Labels: needs attention, needs triage

#10524 - Tighten HighLevelGraph annotations

Pull Request - State: closed - Opened by crusaderky about 1 year ago - 2 comments
Labels: array

#10523 - PyArrow 12 is 50x slower on `read_parquet`

Issue - State: open - Opened by crusaderky about 1 year ago - 2 comments
Labels: dataframe, convert-string

#10522 - `dask.array.to_zarr()` does not parse for `chunks` kwarg

Issue - State: open - Opened by aaptss about 1 year ago - 4 comments
Labels: needs attention, needs triage

#10521 - Revert changes to `config.get(..., override_with=None)`

Pull Request - State: closed - Opened by crusaderky about 1 year ago - 1 comment

#10520 - Temporarily skip failing tests with ``fsspec==2023.9.1``

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago - 2 comments
Labels: dataframe, io

#10519 - Incorrect behavior of override_with argument in dask.config.get

Issue - State: closed - Opened by bnaul about 1 year ago - 1 comment
Labels: needs triage

#10518 - JIT graph building

Issue - State: open - Opened by BrandonSmithJ about 1 year ago - 8 comments
Labels: needs attention, needs triage

#10517 - Groupby aggregation with custom aggregation and median causes TypeError

Issue - State: open - Opened by fbunt about 1 year ago
Labels: needs attention, needs triage

#10516 - Unexpected DataFrame map_overlap timedelta post-filtering

Issue - State: open - Opened by epizut about 1 year ago
Labels: needs attention, needs triage

#10515 - test_to_csv_nodir and test_to_json_results fail with fsspec-2023.9.1

Issue - State: closed - Opened by mgorny about 1 year ago - 4 comments
Labels: io, tests, bug

#10514 - Release 2023.9.2

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago
Labels: documentation

#10513 - list dtype in from_pandas() is casted as string[pyarrow]

Issue - State: closed - Opened by VTonelli about 1 year ago - 1 comment
Labels: needs triage

#10512 - doc: add missing detail to deploying-cli.rst

Pull Request - State: open - Opened by SultanOrazbayev about 1 year ago - 1 comment
Labels: documentation, needs attention

#10511 - Dask dataframe from_map uses too much scheduler memory when enforce_metadata=True

Issue - State: closed - Opened by miltava about 1 year ago - 1 comment
Labels: needs attention, needs triage

#10510 - performance issue with da.unique

Issue - State: open - Opened by MariusCausemann about 1 year ago - 3 comments
Labels: needs triage

#10509 - Deprecation cycle for `admin.traceback.shorten`

Pull Request - State: closed - Opened by crusaderky about 1 year ago - 1 comment

#10508 - General updates to the HPC docs page

Pull Request - State: open - Opened by jacobtomlinson about 1 year ago - 4 comments
Labels: documentation, needs attention

#10507 - Breaking change of dask config schema

Issue - State: closed - Opened by fjetter about 1 year ago
Labels: needs triage

#10506 - Bugfix for writing empty array to zarr

Pull Request - State: closed - Opened by benrutter about 1 year ago - 1 comment
Labels: array, needs attention

#10505 - [WIP] Fixes for dask.order - Remove change of tactical goal in single dep path

Pull Request - State: closed - Opened by fjetter about 1 year ago - 6 comments

#10504 - Dask reads list-type parquet columns as strings

Issue - State: closed - Opened by lessthantrue about 1 year ago - 3 comments
Labels: dataframe

#10503 - Use ``register_plugin`` in tests

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago

#10502 - CI failing due to plugin deprecations

Issue - State: closed - Opened by jrbourbeau about 1 year ago - 1 comment
Labels: tests

#10501 - Make `preserve_index` explicit in `pyarrow_schema_dispatch`

Pull Request - State: closed - Opened by hendrikmakait about 1 year ago - 2 comments
Labels: dataframe, dispatch

#10500 - Add `**kwargs` support for `pyarrow_schema_dispatch`

Pull Request - State: closed - Opened by hendrikmakait about 1 year ago - 4 comments
Labels: dataframe, dispatch

#10499 - Overhaul deprecations system in ``dask.config``

Pull Request - State: closed - Opened by crusaderky about 1 year ago - 3 comments

#10498 - Avoid materializing all iterator in delayed tasks

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago - 2 comments

#10497 - File objects fully read when embedded in `delayed` tasks

Issue - State: closed - Opened by jrbourbeau about 1 year ago - 4 comments
Labels: io, delayed

#10496 - `get_default_shuffle_method` raises if `pyarrow` is outdated

Pull Request - State: closed - Opened by hendrikmakait about 1 year ago - 6 comments

#10495 - Centralize and type ``no_default``

Pull Request - State: closed - Opened by crusaderky about 1 year ago
Labels: dataframe

#10494 - Incompatibility with package db-dtypes results in casts of object to string types

Issue - State: open - Opened by HHarald99 about 1 year ago - 2 comments
Labels: needs attention, needs triage, convert-string

#10493 - Fix `_partitions` `dtype` in `meta` for `DataFrame.set_index` and `DataFrame.sort_values`

Pull Request - State: closed - Opened by hendrikmakait about 1 year ago - 6 comments
Labels: dataframe

#10492 - Bump actions/checkout from 3.6.0 to 4.0.0

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies

#10490 - Handle `cached_property` decorators in `derived_from`

Pull Request - State: closed - Opened by wence- about 1 year ago

#10489 - Test documentation build

Pull Request - State: closed - Opened by jrbourbeau about 1 year ago

#10488 - Skip failing tests for ``pandas=2.1.0``

Pull Request - State: closed - Opened by phofl about 1 year ago
Labels: dataframe, io

#10487 - Special handling for None in DASK_ environment variables

Pull Request - State: closed - Opened by crusaderky about 1 year ago

#10486 - Change `to_parquet` when `compute=False` so that we allow more user control

Pull Request - State: open - Opened by hyenal about 1 year ago - 1 comment
Labels: dataframe, io, needs attention

#10485 - Stricter data type for dask keys

Pull Request - State: closed - Opened by crusaderky about 1 year ago
Labels: bag, dataframe, array, documentation, delayed

#10484 - Simplify some tests that import distributed

Pull Request - State: closed - Opened by crusaderky about 1 year ago
Labels: tests

#10483 - Remove support for ``np.int64`` in keys

Pull Request - State: closed - Opened by crusaderky about 1 year ago