Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/dask issues and pull requests

#11038 - Incompatibility with python 3.11.9

Issue - State: closed - Opened by briceruzand 8 months ago - 2 comments
Labels: dataframe, needs triage

#11036 - Remove skips for named aggregations

Pull Request - State: closed - Opened by phofl 8 months ago - 1 comment

#11035 - Fix ``dask.dataframe`` import error for Python 3.11.9

Pull Request - State: closed - Opened by rjzamora 8 months ago - 7 comments
Labels: bug

#11034 - dask-expr with drop_duplicates messes with dtypes

Issue - State: closed - Opened by aimran-adroll 8 months ago - 4 comments
Labels: needs triage

#11033 - Column aggregation with "list" produces incorrect output

Issue - State: closed - Opened by aimran-adroll 8 months ago - 2 comments
Labels: needs triage

#11031 - Print functions are wrong inside of map_blocks

Issue - State: closed - Opened by leo333000 8 months ago - 2 comments
Labels: array, needs triage

#11030 - Does not work with AWS - aiobotocore related error

Issue - State: closed - Opened by openSourcerer9000 8 months ago - 2 comments
Labels: needs triage

#11029 - Adjust `test_set_index` for "cudf" backend

Pull Request - State: closed - Opened by rjzamora 8 months ago - 2 comments

#11028 - Remove xfail tracebacks from testsuite

Pull Request - State: closed - Opened by phofl 8 months ago - 1 comment

#11027 - Fix ci for upstream pandas changes

Pull Request - State: closed - Opened by phofl 8 months ago - 1 comment

#11026 - Poor scheduling with `flox`, leading to high memory usage and eventual failure

Issue - State: closed - Opened by ivirshup 8 months ago - 16 comments
Labels: needs triage

#11025 - Use ``to/from_legacy_dataframe`` instead of ``to/from_dask_dataframe``

Pull Request - State: closed - Opened by rjzamora 8 months ago - 2 comments
Labels: dataframe, dask-expr

#11024 - Friendly import error message for dask-expr

Pull Request - State: closed - Opened by benrutter 8 months ago - 5 comments

#11023 - Fix value_counts raising if branch exists of nans only

Pull Request - State: closed - Opened by phofl 8 months ago - 1 comment

#11021 - Preserving divisions when reading/loading dataframes with structs containing multiple fields

Issue - State: open - Opened by PhilippeMoussalli 8 months ago - 1 comment
Labels: dataframe, io

#11019 - Hash join transfer with error cannot pickle '_contextvars.ContextVar' object

Issue - State: open - Opened by guozhans 8 months ago - 5 comments
Labels: dataframe, p2

#11018 - `vindex` as outer indexer: memory and time performance

Issue - State: open - Opened by ilan-gold 8 months ago
Labels: array, needs triage

#11017 - ``new_dd_object``'s array logic always assumes the metadata is ``numpy``

Issue - State: open - Opened by rjzamora 8 months ago
Labels: dataframe, array

#11016 - Minimal dd.to_datetime to convert a string column no longer works

Issue - State: closed - Opened by benrutter 8 months ago
Labels: needs triage

#11015 - .loc fails to select columns from boolean array (after dask-exp update)

Issue - State: closed - Opened by benrutter 8 months ago
Labels: needs triage

#11014 - Build nightlies on tag releases

Pull Request - State: closed - Opened by charlesbluca 8 months ago - 1 comment

#11013 - Enable custom expressions in ``dask_cudf``

Pull Request - State: closed - Opened by rjzamora 8 months ago - 1 comment

#11012 - [Docs] Add Hugging Face `hf://` to the list of `fsspec` compatible remote services

Pull Request - State: closed - Opened by lhoestq 8 months ago - 6 comments
Labels: documentation

#11011 - value_counts with NaN sometimes raises ValueError: No objects to concatenate

Issue - State: closed - Opened by m-rossi 8 months ago - 2 comments
Labels: needs triage

#11010 - Update gpuCI `RAPIDS_VER` to `24.06`

Pull Request - State: open - Opened by github-actions[bot] 8 months ago - 1 comment

#11009 - Bump actions/checkout from 4.1.1 to 4.1.2

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago - 1 comment
Labels: dependencies

#11008 - Add HypersSpy to ecosystem.rst

Pull Request - State: closed - Opened by jlaehne 8 months ago - 2 comments
Labels: documentation

#11007 - raise ImportError instead of ValueError when dask-expr cannot be imported

Pull Request - State: closed - Opened by jameslamb 8 months ago - 1 comment

#11006 - as of v2024.3.1, comparing a 1D dask.array.Array to a dask.dataframe.Series fails

Issue - State: closed - Opened by jameslamb 8 months ago - 1 comment
Labels: bug, dask-expr

#11005 - dask.dataframe.DataFrame.reduction fails on`split_every=False` if query planning is in effect

Issue - State: closed - Opened by cbourjau 8 months ago - 1 comment
Labels: needs triage

#11004 - Ensure that repack collections only return tuple if necessary

Pull Request - State: open - Opened by fjetter 8 months ago - 3 comments

#11003 - Only warn if dask-expr is not installed

Pull Request - State: closed - Opened by fjetter 8 months ago - 1 comment

#11002 - Dataframe constructed from single partition bag cannot be shuffled with query planning enabled

Issue - State: closed - Opened by b-phi 8 months ago - 2 comments
Labels: bug, dask-expr

#11001 - Dask query planning string column unique bug

Issue - State: closed - Opened by b-phi 8 months ago - 2 comments
Labels: needs triage

#11000 - dask.dataframe.Series.reduction is not available when using query planning

Issue - State: closed - Opened by cbourjau 8 months ago - 4 comments
Labels: bug, dask-expr

#10999 - TypeError: float() argument must be a string or a real number, not 'csr_matrix'

Issue - State: closed - Opened by erico-imgproj 8 months ago - 1 comment
Labels: needs triage

#10998 - dask.bag.Bag.to_dataframe behavior change in 2024.3.0 - setting dtype to string rather than object by default

Issue - State: open - Opened by kbuma 8 months ago - 4 comments
Labels: dataframe, convert-string

#10997 - Dumb code error in the Example code in Dask-SQL Homepage

Issue - State: closed - Opened by tiraldj 8 months ago - 3 comments
Labels: needs triage

#10996 - importing dask.dataframe changes pandas behaviour in 2024.3.0

Issue - State: closed - Opened by ivirshup 8 months ago - 11 comments
Labels: dask-expr

#10995 - Feedback - DataFrame query planning

Issue - State: open - Opened by fjetter 8 months ago - 7 comments
Labels: dataframe, discussion, dask-expr

#10992 - Implement setting config variables that contain the dot in name

Pull Request - State: open - Opened by dbalabka 8 months ago - 2 comments

#10991 - Combined save and calculation is using excessive memory

Issue - State: open - Opened by pp-mo 8 months ago - 3 comments
Labels: needs triage

#10986 - CI is printing tracebacks for all xfailed tests which can be very confusing

Issue - State: closed - Opened by phofl 8 months ago
Labels: needs triage

#10982 - Dask Nunique bug under dask 2024.2.1

Issue - State: open - Opened by frbelotto 8 months ago - 7 comments
Labels: dataframe

#10962 - Drop pandas 1.X support?

Issue - State: open - Opened by fjetter 9 months ago - 1 comment
Labels: dataframe, discussion

#10951 - UnicodeDecodeError when using a Dataframe with byte data and pandas 2

Issue - State: closed - Opened by danmar3 9 months ago - 2 comments
Labels: needs triage

#10949 - Issue repartitioning a time series by frequency when loaded from parquet file

Issue - State: open - Opened by pvaezi 9 months ago - 5 comments
Labels: dataframe

#10934 - [DISCUSSION] What is the timeline for `dask.dataframe` deprecation

Issue - State: closed - Opened by rjzamora 9 months ago - 9 comments
Labels: dataframe, discussion, deprecation

#10934 - [DISCUSSION] What is the timeline for `dask.dataframe` deprecation

Issue - State: closed - Opened by rjzamora 9 months ago - 9 comments
Labels: dataframe, discussion, deprecation

#10906 - Rename futures to tasks

Pull Request - State: open - Opened by milesgranger 9 months ago - 1 comment

#10896 - [DNM] Test numba tokenization

Pull Request - State: open - Opened by crusaderky 9 months ago

#10895 - Bump codecov/codecov-action from 3 to 4

Pull Request - State: open - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#10894 - Bump peter-evans/create-pull-request from 5 to 6

Pull Request - State: open - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#10893 - Test against pandas 2.0

Pull Request - State: open - Opened by crusaderky 9 months ago - 2 comments

#10893 - Test against pandas 2.0

Pull Request - State: open - Opened by crusaderky 9 months ago - 2 comments

#10892 - Fix dask-expr tests after singleton pr

Pull Request - State: closed - Opened by phofl 10 months ago - 1 comment

#10892 - Fix dask-expr tests after singleton pr

Pull Request - State: closed - Opened by phofl 10 months ago - 1 comment

#10891 - Use 3.12 as the canonical environment

Pull Request - State: open - Opened by crusaderky 10 months ago - 3 comments
Labels: upstream

#10890 - Set upper bound version for numba when pandas<2.1

Pull Request - State: closed - Opened by milesgranger 10 months ago - 1 comment

#10890 - Set upper bound version for numba when pandas<2.1

Pull Request - State: closed - Opened by milesgranger 10 months ago - 1 comment

#10889 - Set lower bound version for s3fs

Pull Request - State: closed - Opened by milesgranger 10 months ago

#10889 - Set lower bound version for s3fs

Pull Request - State: closed - Opened by milesgranger 10 months ago

#10888 - Fix mimesis API >=13.1.0 - use random.randint

Pull Request - State: open - Opened by milesgranger 10 months ago - 3 comments

#10887 - max number of tasks per dask worker

Issue - State: closed - Opened by llodds 10 months ago - 1 comment
Labels: needs triage

#10886 - Fix inplace modification on read-only arrays for string conversion

Pull Request - State: closed - Opened by phofl 10 months ago - 1 comment

#10885 - Pickle da.argwhere and da.count_nonzero

Pull Request - State: closed - Opened by crusaderky 10 months ago - 2 comments
Labels: array, bug

#10884 - [DNM] Remove redundant normalize_token variants

Pull Request - State: open - Opened by crusaderky 10 months ago - 1 comment

#10883 - [DNM] Deterministic hashing for almost everything

Pull Request - State: open - Opened by crusaderky 10 months ago - 1 comment

#10882 - Update deployment documentation

Pull Request - State: closed - Opened by mrocklin 10 months ago - 1 comment

#10881 - applying tuple with pyarrow

Issue - State: open - Opened by SurkynRik 10 months ago - 2 comments
Labels: convert-string

#10880 - A couple of dask-expr fixes for new parquet cache

Pull Request - State: closed - Opened by fjetter 10 months ago - 2 comments

#10879 - Start with dask-expr doc build

Pull Request - State: closed - Opened by phofl 10 months ago - 2 comments

#10878 - Add ``distributed.print`` and ``distributed.warn`` to API docs

Pull Request - State: closed - Opened by jrbourbeau 10 months ago - 2 comments

#10877 - Run macos ci on M1 architecture

Pull Request - State: closed - Opened by phofl 10 months ago - 1 comment

#10876 - Make tokenization more deterministic

Pull Request - State: open - Opened by crusaderky 10 months ago - 1 comment

#10875 - ``test_tokenize_function_cloudpickle`` is very flaky

Issue - State: open - Opened by phofl 10 months ago
Labels: needs triage

#10874 - Deterministic tokenize for pyarrow datatypes

Pull Request - State: closed - Opened by crusaderky 10 months ago

#10873 - Fix regression in test_graph_manipulation

Pull Request - State: closed - Opened by crusaderky 10 months ago - 3 comments

#10872 - Test tokenization of static and class methods

Pull Request - State: closed - Opened by crusaderky 10 months ago - 3 comments

#10871 - Adjust pytest errors for dask-expr ci

Pull Request - State: closed - Opened by phofl 10 months ago

#10869 - Moto 5 results in timeouts in s3 tests:

Issue - State: open - Opened by phofl 10 months ago
Labels: needs triage

#10868 - Fix pytest 8 issues

Pull Request - State: closed - Opened by phofl 10 months ago - 5 comments

#10867 - Remove warning filter from pyproject.toml

Pull Request - State: closed - Opened by phofl 10 months ago - 1 comment

#10866 - Add recommended deployment options to deployment docs

Pull Request - State: closed - Opened by jrbourbeau 10 months ago - 1 comment

#10865 - Deterministic tokenize() for global random functions

Pull Request - State: closed - Opened by crusaderky 10 months ago - 2 comments

#10864 - Allow length of ascending to be larger than one in sort_values

Pull Request - State: closed - Opened by fjetter 10 months ago - 1 comment

#10863 - Refactor: move tests for tokenize() to its own module

Pull Request - State: closed - Opened by crusaderky 10 months ago - 3 comments

#10862 - Adjust test for support of `median` in Groupby.aggregate in `dask-expr`

Pull Request - State: closed - Opened by hendrikmakait 10 months ago - 2 comments

#10860 - Temporarily pin ``mimesis<13.1.0``

Pull Request - State: closed - Opened by jrbourbeau 10 months ago

#10859 - pyright: "read_parquet" is not exported from module "dask.dataframe"

Issue - State: closed - Opened by dluks 10 months ago - 1 comment
Labels: needs triage

#10858 - Tests for dummy data generation failing

Issue - State: open - Opened by fjetter 10 months ago - 1 comment
Labels: tests, bug

#10857 - Trivial cosmetic tweaks to _testing.py

Pull Request - State: closed - Opened by crusaderky 10 months ago

#10856 - Update DataFrame examples section

Pull Request - State: closed - Opened by jrbourbeau 10 months ago - 1 comment

#10855 - ⚠️ Upstream CI failed ⚠️

Issue - State: open - Opened by github-actions[bot] 10 months ago
Labels: upstream

#10854 - numpy 2.0: fix slicing by uint64 array

Pull Request - State: closed - Opened by crusaderky 10 months ago - 1 comment
Labels: upstream