Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/dask issues and pull requests

#7941 - DOC: mention tqdm-dask integration in the docs?

Issue - State: open - Opened by raybellwaves over 3 years ago - 2 comments
Labels: documentation, needs attention

#7933 - High Level Expressions

Issue - State: open - Opened by mrocklin over 3 years ago - 65 comments
Labels: highlevelgraph, needs attention

#7930 - Issue opening h5py arrays

Issue - State: open - Opened by N4321D over 3 years ago - 6 comments
Labels: array, io, needs attention

#7923 - Binops between `dd.Series` and NumPy arrays fail while inferring meta

Issue - State: open - Opened by gjoseph92 over 3 years ago - 2 comments
Labels: dataframe, array, needs attention

#7911 - Deterministic dict order

Issue - State: open - Opened by jsignell over 3 years ago - 7 comments
Labels: core, needs attention

#7899 - [FEA] API to write dask dataframes to local storage of each node in multi-node cluster

Issue - State: open - Opened by VibhuJawa over 3 years ago - 10 comments
Labels: io, needs attention

#7885 - Mixed dataframe - array graph optimization (column projection pushdown not working if final result is an array)

Issue - State: open - Opened by jorisvandenbossche over 3 years ago - 4 comments
Labels: dataframe, highlevelgraph, needs attention

#7860 - map_partitions or map_blocks with large objects eats up scheduler memory

Issue - State: open - Opened by rikturr over 3 years ago - 5 comments
Labels: scheduler, needs attention

#7859 - mysterious rechunking error

Issue - State: closed - Opened by d-v-b over 3 years ago - 6 comments
Labels: array, tests

#7850 - Unable to load ORC table using `read_orc`

Issue - State: open - Opened by lucharo over 3 years ago - 6 comments
Labels: dataframe, io, needs attention

#7828 - Updating GPU documentation

Pull Request - State: open - Opened by jnolis over 3 years ago
Labels: needs attention

#7791 - get_output_keys method for ArrayOverlapLayer that won't materialize the graph

Issue - State: open - Opened by GenevieveBuckley over 3 years ago - 1 comment
Labels: array, highlevelgraph, needs attention

#7789 - Implement `cull` method for ArrayOverlapLayer

Issue - State: open - Opened by GenevieveBuckley over 3 years ago
Labels: array, highlevelgraph, needs attention

#7788 - Find number of tasks in overlap layer without materializing the layer

Issue - State: open - Opened by GenevieveBuckley over 3 years ago - 3 comments
Labels: array, highlevelgraph, needs attention

#7781 - #7779 added Type annotations for dask.delayed

Pull Request - State: open - Opened by BrianArbuckle over 3 years ago - 12 comments
Labels: needs attention

#7779 - Type annotations for dask.delayed

Issue - State: open - Opened by twoertwein over 3 years ago - 2 comments
Labels: delayed, needs attention

#7767 - Very high peak memory-usage on "compute_chunk_sizes"

Issue - State: open - Opened by Hoeze over 3 years ago - 2 comments
Labels: array, needs attention

#7755 - Document dev process around high level graphs

Issue - State: open - Opened by mrocklin over 3 years ago - 1 comment
Labels: documentation, highlevelgraph, needs attention

#7727 - `dask.array.store` with `compute=False` and `return_stored=True`

Issue - State: open - Opened by fnattino over 3 years ago - 2 comments
Labels: array, io, needs attention

#7722 - [Discussion] Proposed layer reorganization

Issue - State: open - Opened by ian-r-rose over 3 years ago - 19 comments
Labels: discussion, highlevelgraph, needs attention

#7718 - Frobenius norm promotes float32 matrix to float64 norm

Issue - State: closed - Opened by RogerMoens over 3 years ago - 4 comments
Labels: array

#7709 - Update HighLevelGraph documentation

Issue - State: open - Opened by jsignell over 3 years ago - 24 comments
Labels: documentation, highlevelgraph, needs attention

#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated

Issue - State: closed - Opened by kdebrab over 3 years ago - 5 comments
Labels: delayed

#7708 - Delayed method in a 'delayed_pure' context produces a different Delayed key every time it is evaluated

Issue - State: closed - Opened by kdebrab over 3 years ago - 5 comments
Labels: delayed

#7702 - Allow encoding execution priorities / order in delayed objects / graph

Issue - State: open - Opened by mlondschien over 3 years ago - 3 comments
Labels: delayed, needs attention

#7702 - Allow encoding execution priorities / order in delayed objects / graph

Issue - State: open - Opened by mlondschien over 3 years ago - 3 comments
Labels: delayed, needs attention

#7686 - Use BlockwiseDep for map_blocks with block_id or block_info

Pull Request - State: open - Opened by bmerry over 3 years ago - 7 comments
Labels: array, needs attention

#7677 - Add a terminology section to the docs?

Issue - State: open - Opened by jrbourbeau over 3 years ago - 6 comments
Labels: documentation, needs attention

#7673 - da.from_zarr ignores Zarr's dimension_separator

Issue - State: open - Opened by joshmoore over 3 years ago - 9 comments
Labels: array, io, needs attention

#7655 - Array slicing HighLevelGraph layer

Pull Request - State: open - Opened by GenevieveBuckley over 3 years ago - 22 comments
Labels: array, needs attention

#7652 - Sparsely blocked/chunked arrays

Issue - State: open - Opened by system123 over 3 years ago - 15 comments
Labels: array, highlevelgraph, needs attention

#7650 - `dumps_task` in `SimpleShuffleLayer` and `BroadcastJoinLayer` unpack

Issue - State: open - Opened by gjoseph92 over 3 years ago - 1 comment
Labels: highlevelgraph, needs attention

#7639 - svd_compressed() fails for complex input

Issue - State: open - Opened by nicrie over 3 years ago - 17 comments
Labels: array, needs attention, bug

#7613 - Dask crashes or hangs during out-of-core dataframes sort

Issue - State: open - Opened by stephanie-wang over 3 years ago - 19 comments
Labels: dataframe, needs attention

#7587 - Optimization turned off when using delayed

Issue - State: open - Opened by chrisroat over 3 years ago - 4 comments
Labels: delayed, needs attention

#7574 - Duplicated keys in da.linalg functions

Issue - State: open - Opened by crusaderky over 3 years ago - 7 comments
Labels: array, needs attention

#7554 - Blockwise chunk alignment metadata incorrect

Issue - State: open - Opened by jsignell over 3 years ago - 5 comments
Labels: array, needs attention

#7550 - Slicing with broadcastable subarray fails

Issue - State: open - Opened by jakirkham over 3 years ago - 2 comments
Labels: array, needs attention

#7545 - Duplicated computations with mixed array/dataframe delayed output

Issue - State: open - Opened by chrisroat over 3 years ago - 3 comments
Labels: dataframe, array, needs attention

#7510 - Local scheduler parameter `chunksize`

Issue - State: open - Opened by jakirkham over 3 years ago - 4 comments
Labels: scheduler, needs attention

#7482 - Array unique fails with cupy backed arrays during cpu/gpu setitem

Issue - State: open - Opened by beckernick over 3 years ago - 6 comments
Labels: array, needs attention

#7482 - Array unique fails with cupy backed arrays during cpu/gpu setitem

Issue - State: open - Opened by beckernick over 3 years ago - 6 comments
Labels: array, needs attention

#7459 - [DNM] Guide updates for PipInstall changes

Pull Request - State: open - Opened by gjoseph92 over 3 years ago - 6 comments
Labels: needs attention

#7459 - [DNM] Guide updates for PipInstall changes

Pull Request - State: open - Opened by gjoseph92 over 3 years ago - 6 comments
Labels: needs attention

#7437 - Backend for scipy sparse building csr/csc matrix inefficiently

Issue - State: open - Opened by ag-tcm over 3 years ago - 1 comment
Labels: array, needs attention, enhancement

#7437 - Backend for scipy sparse building csr/csc matrix inefficiently

Issue - State: open - Opened by ag-tcm over 3 years ago - 1 comment
Labels: array, needs attention, enhancement

#7416 - compute_chunk_sizes could be more efficient for arrays with >1 dimension

Issue - State: open - Opened by alimanfoo over 3 years ago - 6 comments
Labels: array, needs attention

#7416 - compute_chunk_sizes could be more efficient for arrays with >1 dimension

Issue - State: open - Opened by alimanfoo over 3 years ago - 6 comments
Labels: array, needs attention

#7400 - bug: TypeError handling in groupby apply

Pull Request - State: open - Opened by brycedrennan over 3 years ago - 7 comments
Labels: needs attention

#7400 - bug: TypeError handling in groupby apply

Pull Request - State: open - Opened by brycedrennan over 3 years ago - 7 comments
Labels: needs attention

#7377 - map_partition performing calculations on metadata

Issue - State: open - Opened by achapkowski over 3 years ago - 9 comments
Labels: array, needs info, needs attention

#7377 - map_partition performing calculations on metadata

Issue - State: open - Opened by achapkowski over 3 years ago - 9 comments
Labels: array, needs info, needs attention

#7375 - DataFrame: handle ExtensionArrays when converting to dask.Array

Pull Request - State: open - Opened by gjoseph92 over 3 years ago
Labels: needs attention

#7375 - DataFrame: handle ExtensionArrays when converting to dask.Array

Pull Request - State: open - Opened by gjoseph92 over 3 years ago
Labels: needs attention

#7354 - Add Dask Contrib docs

Pull Request - State: open - Opened by jacobtomlinson over 3 years ago - 6 comments
Labels: needs attention

#7354 - Add Dask Contrib docs

Pull Request - State: open - Opened by jacobtomlinson over 3 years ago - 6 comments
Labels: needs attention

#7313 - Axis order from "fancy" indexing with dask.array does not match NumPy

Issue - State: open - Opened by shoyer over 3 years ago - 2 comments
Labels: array, needs attention

#7313 - Axis order from "fancy" indexing with dask.array does not match NumPy

Issue - State: open - Opened by shoyer over 3 years ago - 2 comments
Labels: array, needs attention

#7285 - clone / bind materialize the layers

Issue - State: open - Opened by crusaderky over 3 years ago - 3 comments
Labels: highlevelgraph, needs attention

#7285 - clone / bind materialize the layers

Issue - State: open - Opened by crusaderky over 3 years ago - 3 comments
Labels: highlevelgraph, needs attention

#7283 - Harmonize split_every across modules

Issue - State: open - Opened by crusaderky over 3 years ago
Labels: core, discussion, needs attention

#7283 - Harmonize split_every across modules

Issue - State: open - Opened by crusaderky over 3 years ago
Labels: core, discussion, needs attention

#7280 - Allow custom schedulers to inline futures on persist()

Issue - State: open - Opened by clarkzinzow over 3 years ago - 2 comments
Labels: scheduler, needs attention

#7280 - Allow custom schedulers to inline futures on persist()

Issue - State: open - Opened by clarkzinzow over 3 years ago - 2 comments
Labels: scheduler, needs attention

#7266 - Collaboration between Dask and Modin Dataframe

Issue - State: open - Opened by devin-petersohn over 3 years ago - 3 comments
Labels: dataframe, discussion, needs attention

#7266 - Collaboration between Dask and Modin Dataframe

Issue - State: open - Opened by devin-petersohn over 3 years ago - 3 comments
Labels: dataframe, discussion, needs attention

#7257 - passing a list of filenames to dask.array.image.imread

Pull Request - State: open - Opened by patquem over 3 years ago - 4 comments
Labels: needs attention

#7257 - passing a list of filenames to dask.array.image.imread

Pull Request - State: open - Opened by patquem over 3 years ago - 4 comments
Labels: needs attention

#7219 - dask.dataframe cannot handle timestamps-as-objects, even though Pandas can

Issue - State: open - Opened by itamarst almost 4 years ago - 23 comments
Labels: dataframe, needs attention

#7219 - dask.dataframe cannot handle timestamps-as-objects, even though Pandas can

Issue - State: open - Opened by itamarst almost 4 years ago - 23 comments
Labels: dataframe, needs attention

#7218 - Should dask.array.name be a settable property?

Issue - State: open - Opened by jsignell almost 4 years ago - 20 comments
Labels: array, needs attention

#7218 - Should dask.array.name be a settable property?

Issue - State: open - Opened by jsignell almost 4 years ago - 20 comments
Labels: array, needs attention

#6723 - Switch to different, stable hash algorithm in Bag

Issue - State: closed - Opened by itamarst about 4 years ago - 14 comments
Labels: bag

#6691 - Support from_pandas with known divisions

Issue - State: open - Opened by syagev about 4 years ago - 3 comments
Labels: dataframe

#6525 - Make sure dask array are writable via the __array__ protocol.

Pull Request - State: closed - Opened by Carreau about 4 years ago - 11 comments

#6329 - add mode example to custom aggregation

Issue - State: open - Opened by raybellwaves over 4 years ago - 6 comments
Labels: documentation

#6280 - doc: add nunique example to custom aggregation

Issue - State: closed - Opened by raybellwaves over 4 years ago - 7 comments
Labels: dataframe, documentation

#6272 - Expose intermediate rechunking logic da.reshape

Issue - State: closed - Opened by TomAugspurger over 4 years ago - 3 comments
Labels: array

#5794 - Mean implementation for datetime series

Pull Request - State: open - Opened by exemplary-citizen almost 5 years ago - 12 comments

#5679 - da.std and da.var handle complex values incorrectly

Issue - State: closed - Opened by TAdeJong almost 5 years ago - 9 comments
Labels: array

#5633 - Add partition lengths to DataFrame metadata.

Issue - State: open - Opened by hadim almost 5 years ago - 13 comments
Labels: dataframe

#5610 - Clean up warnings in doc build

Issue - State: open - Opened by TomAugspurger almost 5 years ago - 8 comments
Labels: good first issue, documentation

#5544 - chunks get combined in 4d array reshape

Issue - State: closed - Opened by rabernat about 5 years ago - 23 comments
Labels: array

#5432 - Proxying Dask (Bokeh) Web Interface on AWS SageMaker

Issue - State: open - Opened by davidtwomey about 5 years ago - 27 comments

#5051 - Implement mean for Series[datetime64[ns]] and DatetimeIndex

Issue - State: open - Opened by TomAugspurger over 5 years ago - 11 comments
Labels: good first issue, dataframe

#4974 - dd.Series.isin() broken for dict views in distributed

Issue - State: closed - Opened by gsakkis over 5 years ago - 7 comments
Labels: dataframe

#4869 - Groupby NUnique is slow and possibly buggy

Issue - State: open - Opened by bluecoconut over 5 years ago - 17 comments
Labels: dataframe

#4845 - ValueError: cannot handle a non-unique multi-index!

Issue - State: open - Opened by marberi over 5 years ago - 27 comments
Labels: dataframe

#4368 - Sort dask array

Issue - State: open - Opened by Pierre-Bartet almost 6 years ago - 13 comments
Labels: array

#4299 - Bug in array.map_blocks when providing chunks and UDF adjusts shape

Issue - State: closed - Opened by TomAugspurger almost 6 years ago - 15 comments

#4170 - Implement Dask equivalent of pandas.io.json.json_normalize

Issue - State: open - Opened by nsadeh about 6 years ago - 23 comments
Labels: dataframe, io

#3833 - Added index column name after resample (issue #3827)

Pull Request - State: closed - Opened by eric-bonfadini over 6 years ago - 3 comments

#3744 - Change doctests to Python 3 #3690

Pull Request - State: closed - Opened by eric-bonfadini over 6 years ago - 5 comments

#3650 - Performance issue on `compute` with reshaped arrays

Issue - State: closed - Opened by nbren12 over 6 years ago - 10 comments
Labels: array

#3280 - Support norm in dask.array.fft

Issue - State: closed - Opened by jakirkham over 6 years ago
Labels: array

#3258 - Documenting NumPy functions that work fine with Dask Arrays

Issue - State: closed - Opened by jakirkham over 6 years ago - 8 comments
Labels: good first issue, array, documentation

#3245 - Silencing warnings issued by numpy functions within dask.array

Issue - State: open - Opened by shoyer over 6 years ago - 30 comments
Labels: array