Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rapidsai/cudf issues and pull requests

#16574 - Performance improvement for strings::slice for wide strings

Pull Request - State: open - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, strings, improvement, non-breaking

#16570 - CI: Test against old versions of key dependencies

Pull Request - State: closed - Opened by seberg 3 months ago - 15 comments
Labels: Python, improvement, non-breaking, cudf.pandas

#16563 - Rework strings::slice benchmark to use nvbench

Pull Request - State: open - Opened by davidwendt 3 months ago
Labels: 2 - In Progress, libcudf, CMake, improvement, non-breaking

#16562 - [FEA] Add an environment variable to fail on fallback in `cudf.pandas`

Pull Request - State: open - Opened by Matt711 3 months ago - 9 comments
Labels: feature request, 2 - In Progress, Python, non-breaking, cudf.pandas

#16561 - Prototype get_json_object

Pull Request - State: open - Opened by karthikeyann 3 months ago
Labels: feature request, 2 - In Progress, libcudf, CMake, cuIO, Java, Spark, non-breaking

#16560 - [BUG] Dask cov operation is broken

Issue - State: open - Opened by rjzamora 3 months ago
Labels: bug, dask

#16559 - Switch python version to `3.10` in `cudf.pandas` pandas test scripts

Pull Request - State: closed - Opened by galipremsagar 3 months ago - 1 comment
Labels: bug, non-breaking

#16558 - [DO NOT MERGE] Allow NumPy 2 + CuPy 13.2

Pull Request - State: open - Opened by seberg 3 months ago - 1 comment
Labels: bug, Python, non-breaking

#16557 - [BUG]cannot pip install on linux (Ubuntu)

Issue - State: open - Opened by SHIMURA0 3 months ago - 1 comment
Labels: bug

#16556 - Reenable arrow tests

Pull Request - State: open - Opened by vyasr 3 months ago - 1 comment
Labels: tests, libcudf, CMake, improvement, non-breaking

#16555 - Implement order preserving groupby in cudf-polars

Pull Request - State: open - Opened by lithomas1 3 months ago
Labels: feature request, Python, non-breaking, cudf.polars

#16554 - [FEA] Add support for `cudf.unique`

Pull Request - State: open - Opened by Matt711 3 months ago - 1 comment
Labels: feature request, Python, non-breaking, cuDF (Python)

#16553 - Clean up reshaping ops

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking

#16552 - Ensure managed memory is supported in cudf.pandas.

Pull Request - State: open - Opened by bdice 3 months ago
Labels: bug, Python, non-breaking, cudf.pandas, cudf.polars, pylibcudf

#16551 - [BUG] Consider disabling managed memory in cudf.pandas on WSL2

Issue - State: open - Opened by vyasr 3 months ago - 4 comments
Labels: bug

#16549 - Disallow cudf.Index accepting column in favor of ._from_column

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, breaking

#16548 - Rewrite remaining Python Arrow interop conversions using the C Data Interface

Pull Request - State: open - Opened by vyasr 3 months ago - 2 comments
Labels: 3 - Ready for Review, libcudf, Python, CMake, improvement, non-breaking, pylibcudf

#16547 - Test cudf-polars on ARM64 as well

Pull Request - State: closed - Opened by lithomas1 3 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.polars

#16546 - Hide all gtest symbols in cudftestutil

Pull Request - State: open - Opened by robertmaynard 3 months ago
Labels: bug, 3 - Ready for Review, libcudf, CMake, non-breaking

#16545 - [REVIEW] JSON host tree algorithms

Pull Request - State: closed - Opened by shrshi 3 months ago - 7 comments
Labels: libcudf, 5 - Ready to Merge, cuIO, Java, improvement, non-breaking

#16544 - [FEA] Add support for manual switching from CPU to GPU in `cudf.pandas`

Issue - State: closed - Opened by Matt711 3 months ago - 1 comment
Labels: feature request, cudf.pandas

#16543 - Update chunked parquet reader benchmarks

Pull Request - State: open - Opened by sdrp713 3 months ago - 2 comments
Labels: libcudf, improvement, non-breaking

#16541 - Refactor dictionary encoding in PQ writer to migrate to the new `cuco::static_map`

Pull Request - State: closed - Opened by mhaseeb123 3 months ago - 8 comments
Labels: libcudf, 5 - Ready to Merge, cuIO, improvement, breaking, cuco

#16540 - Remove hardcoded versions from workflows.

Pull Request - State: closed - Opened by bdice 3 months ago - 1 comment
Labels: improvement, non-breaking

#16539 - Test dropping Python 3.9 from shared workflows.

Pull Request - State: closed - Opened by bdice 3 months ago - 1 comment

#16538 - Parquet reader list microkernel

Pull Request - State: open - Opened by pmattione-nvidia 3 months ago - 4 comments
Labels: libcudf, Performance, improvement, non-breaking

#16536 - Update the java code to properly deal with lists being returned as strings

Pull Request - State: closed - Opened by revans2 3 months ago - 1 comment
Labels: bug, 3 - Ready for Review, 4 - Needs Review, Java, Spark, non-breaking, cuDF (Java)

#16535 - Register `read_parquet` and `read_csv` with dask-expr

Pull Request - State: closed - Opened by rjzamora 3 months ago - 1 comment
Labels: bug, Python, 5 - Ready to Merge, dask, non-breaking

#16534 - Add dictionary support to cudf::row_bit_count

Pull Request - State: open - Opened by davidwendt 3 months ago
Labels: 2 - In Progress, libcudf, improvement, non-breaking

#16533 - [FEA] Support duplicate column labels in cudf.DataFrame

Issue - State: open - Opened by mroeschke 3 months ago
Labels: feature request, cudf.pandas

#16532 - Ensure comparisons with pyints and integer series always succeed

Pull Request - State: closed - Opened by seberg 3 months ago - 4 comments
Labels: Python, improvement, non-breaking

#16531 - Remove unneeded output size parameter from internal count_matches utility

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16530 - Remove invalid column_view usage in string-scalar-to-column function

Pull Request - State: open - Opened by davidwendt 3 months ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16529 - Change cudf::empty_like to not include offsets for empty strings columns

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking

#16528 - [FEA] Support named aggregations in `df.groupby().agg()`

Pull Request - State: open - Opened by Matt711 3 months ago - 5 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking

#16527 - Fix DataFrame reductions with median returning scalar instead of Series

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking

#16526 - [BUG] Series.value_counts hangs with over 1B rows of input

Issue - State: open - Opened by bdice 3 months ago - 2 comments
Labels: bug, libcudf

#16525 - Raise NotImplementedError for Series.rename that's not a scalar

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking

#16524 - Remove deprecated public APIs from libcudf

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16523 - Return Interval object in pandas compat mode for IntervalIndex reductions

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking

#16522 - [FEA] Make the cudf.pandas profiler show time taken by other, non-pandas functions

Pull Request - State: closed - Opened by Matt711 3 months ago - 6 comments
Labels: feature request, Python, non-breaking, cudf.pandas

#16521 - [FEA] Add support for `cudf.DataFrame.aggregate`

Issue - State: open - Opened by Matt711 3 months ago
Labels: feature request, Python

#16520 - Update json normalization to take device_buffer

Pull Request - State: closed - Opened by karthikeyann 3 months ago - 1 comment
Labels: 3 - Ready for Review, Needs Triage, libcudf, cuIO, improvement, non-breaking, Needs build-infra

#16519 - Allow DataFrame.sort_values(by=) to select an index level

Pull Request - State: closed - Opened by mroeschke 3 months ago - 1 comment
Labels: bug, Python, non-breaking

#16518 - Rework cudf::io::text::byte_range_info class member functions

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16517 - Use cudf datagen within tpch queries

Pull Request - State: closed - Opened by JayjeetAtGithub 3 months ago
Labels: libcudf, CMake, improvement, non-breaking

#16516 - Fix `date_range(start, end, freq)` when end-start is divisible by freq

Pull Request - State: closed - Opened by mroeschke 3 months ago - 2 comments
Labels: bug, Python, non-breaking

#16515 - Preserve array name in MultiIndex.from_arrays

Pull Request - State: closed - Opened by mroeschke 3 months ago - 1 comment
Labels: bug, Python, non-breaking

#16514 - Disallow indexing by selecting duplicate labels

Pull Request - State: closed - Opened by mroeschke 3 months ago - 3 comments
Labels: bug, Python, non-breaking

#16513 - Fix `.replace(Index, Index)` raising a TypeError

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking

#16512 - [FEA] Adjust libcudf to not load cuFile by default

Issue - State: closed - Opened by GregoryKimball 3 months ago
Labels: feature request, libcudf, cuIO

#16511 - Remove unneeded pair-iterator benchmark

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16507 - [BUG] cuDF and Pandas return different results for ...

Issue - State: open - Opened by Matt711 3 months ago - 3 comments
Labels: bug, Python, cudf.pandas

#16502 - Pass batch size to JSON reader using environment variable

Pull Request - State: closed - Opened by shrshi 3 months ago - 5 comments
Labels: libcudf, CMake, 5 - Ready to Merge, cuIO, improvement, non-breaking

#16501 - Remove a deprecated multibyte_split API

Pull Request - State: closed - Opened by davidwendt 3 months ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#16500 - Implement cudf-polars datetime extraction methods

Pull Request - State: open - Opened by lithomas1 3 months ago - 8 comments
Labels: feature request, Python, non-breaking, cudf.polars, pylibcudf

#16499 - AWS S3 IO through KvikIO

Pull Request - State: closed - Opened by madsbk 3 months ago - 3 comments
Labels: libcudf, Python, improvement, non-breaking, pylibcudf

#16498 - Add interop example for `arrow::StringViewArray` to `cudf::column`

Pull Request - State: closed - Opened by JayjeetAtGithub 3 months ago - 3 comments
Labels: libcudf, CMake, improvement, non-breaking

#16497 - Add keep option to distinct nvbench

Pull Request - State: closed - Opened by bdice 3 months ago - 3 comments
Labels: libcudf, CMake, improvement, non-breaking

#16493 - Check index bounds in compact protocol reader.

Pull Request - State: closed - Opened by bdice 3 months ago - 2 comments
Labels: bug, libcudf, non-breaking

#16491 - [PERF] looping through dataframe is 100x slower than when running without cudf

Issue - State: open - Opened by magnus-ekman 4 months ago - 3 comments
Labels: bug

#16485 - Refactor `histogram` reduction using `cuco::static_set::insert_and_find`

Pull Request - State: closed - Opened by srinivasyadav18 4 months ago - 7 comments
Labels: libcudf, CMake, 5 - Ready to Merge, Performance, improvement, non-breaking

#16484 - Refactor `distinct` using `static_map` `insert_or_apply`

Pull Request - State: open - Opened by srinivasyadav18 4 months ago - 7 comments
Labels: libcudf, improvement, non-breaking

#16481 - [FEA] Full coverage of datetime methods in cudf-polars

Issue - State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars

#16480 - [FEA] Full coverage of stringfunction methods in cudf polars

Issue - State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars

#16479 - [FEA] Support cross-casting to/from strings in cudf-polars

Issue - State: open - Opened by wence- 4 months ago - 1 comment
Labels: feature request, cudf.polars

#16478 - [FEA] Support scan-based aggregations in cudf-polars

Issue - State: closed - Opened by wence- 4 months ago - 1 comment
Labels: feature request, cudf.polars

#16478 - [FEA] Support scan-based aggregations in cudf-polars

Issue - State: closed - Opened by wence- 4 months ago - 1 comment
Labels: feature request, cudf.polars

#16477 - [FEA] Support order-preserving groupby option in cudf-polars

Issue - State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars

#16476 - Implement Kleene logic handling for Any/All and bitwise Or/And

Pull Request - State: closed - Opened by wence- 4 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.polars

#16474 - Use numba-cuda>=0.0.13

Pull Request - State: closed - Opened by gmarkall 4 months ago - 12 comments
Labels: numba, Python, improvement, non-breaking

#16472 - enable list to be forced as string in JSON reader.

Pull Request - State: closed - Opened by karthikeyann 4 months ago - 5 comments
Labels: feature request, 3 - Ready for Review, libcudf, cuIO, breaking

#16466 - Fix all-empty input column for strings split APIs

Pull Request - State: closed - Opened by davidwendt 4 months ago - 3 comments
Labels: bug, 3 - Ready for Review, libcudf, Python, strings, non-breaking

#16453 - [BUG] `split_record` output empty list for empty input string

Issue - State: closed - Opened by ttnghia 4 months ago - 5 comments
Labels: bug

#16450 - [FEA] Add support for `cudf.Timestamp`

Pull Request - State: closed - Opened by Matt711 4 months ago - 5 comments
Labels: feature request, Python, non-breaking, cudf.pandas

#16448 - Compute whole column variance using numerically stable approach

Pull Request - State: closed - Opened by wence- 4 months ago - 2 comments
Labels: bug, libcudf, Python, Java, non-breaking

#16444 - [BUG] whole-column variance calculation uses numerically unstable algorithm

Issue - State: closed - Opened by wence- 4 months ago - 2 comments
Labels: bug, libcudf

#16443 - [DOC] low-memory reader options not very discoverable

Issue - State: open - Opened by wence- 4 months ago - 3 comments
Labels: doc, Python

#16440 - Fix parquet_field_list read_func lambda capture invalid this pointer

Pull Request - State: closed - Opened by davidwendt 4 months ago - 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking

#16428 - Enable cudf.pandas REPL and -c command support

Pull Request - State: closed - Opened by bdice 4 months ago - 3 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking, cudf.pandas

#16423 - Update docs of the TPC-H derived examples

Pull Request - State: closed - Opened by JayjeetAtGithub 4 months ago - 1 comment
Labels: libcudf, improvement, non-breaking

#16390 - [FEA] libcudf read json lines mode support nrows

Issue - State: open - Opened by lithomas1 4 months ago - 2 comments
Labels: feature request, libcudf, cuIO, cudf.polars

#16386 - Use `make_host_vector` instead of `make_std_vector` to facilitate pinned memory optimizations

Pull Request - State: open - Opened by vuule 4 months ago - 2 comments
Labels: 3 - Ready for Review, libcudf, Performance, improvement, non-breaking

#16383 - [FEA] Add cudf-polars to test.yaml

Issue - State: open - Opened by lithomas1 4 months ago - 1 comment
Labels: tests, cudf.polars

#16306 - Pylibcudf polars date from str

Pull Request - State: open - Opened by brandon-b-miller 4 months ago - 10 comments
Labels: feature request, Python, CMake, non-breaking, cudf.polars, pylibcudf

#16304 - Avoid decoding long runs in a single thread

Pull Request - State: open - Opened by gerashegalov 4 months ago - 4 comments
Labels: feature request, libcudf, 4 - Needs Review, cuIO, Performance, Spark, non-breaking

#16300 - Rebuild w/NumPy 2 (restrict to 1)

Pull Request - State: open - Opened by jakirkham 4 months ago - 1 comment
Labels: Python, improvement, non-breaking

#16299 - Setup pylibcudf package

Pull Request - State: open - Opened by lithomas1 4 months ago - 6 comments
Labels: feature request, Python, CMake, non-breaking, cudf.pandas, cudf.polars, pylibcudf

#16294 - Add a libcudf/thrust-based TPC-H derived datagen

Pull Request - State: open - Opened by JayjeetAtGithub 4 months ago - 2 comments
Labels: feature request, libcudf, CMake, non-breaking

#16286 - Initial investigation into NumPy proxying in `cudf.pandas`

Pull Request - State: open - Opened by Matt711 4 months ago - 8 comments
Labels: feature request, Python, non-breaking, cudf.pandas

#16282 - [BUG] Integer promotion fixes needed for NumPy 2 for comparison operators

Issue - State: closed - Opened by seberg 4 months ago - 2 comments

#16272 - [FEA] Refactor Column/NamedColumn split in cudf-polars

Issue - State: closed - Opened by wence- 4 months ago
Labels: feature request, 0 - Backlog, cudf.polars

#16271 - Fix issue in horizontal concat implementation in cudf-polars

Pull Request - State: closed - Opened by wence- 4 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.polars, pylibcudf

#16266 - Cleanup how args and kwargs are passed in `_fast_slow_function_call`

Pull Request - State: open - Opened by Matt711 4 months ago - 9 comments
Labels: bug, Python, non-breaking, cudf.pandas

#16266 - Cleanup how args and kwargs are passed in `_fast_slow_function_call`

Pull Request - State: open - Opened by Matt711 4 months ago - 9 comments
Labels: bug, Python, non-breaking, cudf.pandas