Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rapidsai/cudf issues and pull requests
#16574 - Performance improvement for strings::slice for wide strings
Pull Request -
State: open - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, strings, improvement, non-breaking
#16570 - CI: Test against old versions of key dependencies
Pull Request -
State: closed - Opened by seberg 3 months ago
- 15 comments
Labels: Python, improvement, non-breaking, cudf.pandas
#16563 - Rework strings::slice benchmark to use nvbench
Pull Request -
State: open - Opened by davidwendt 3 months ago
Labels: 2 - In Progress, libcudf, CMake, improvement, non-breaking
#16562 - [FEA] Add an environment variable to fail on fallback in `cudf.pandas`
Pull Request -
State: open - Opened by Matt711 3 months ago
- 9 comments
Labels: feature request, 2 - In Progress, Python, non-breaking, cudf.pandas
#16561 - Prototype get_json_object
Pull Request -
State: open - Opened by karthikeyann 3 months ago
Labels: feature request, 2 - In Progress, libcudf, CMake, cuIO, Java, Spark, non-breaking
#16560 - [BUG] Dask cov operation is broken
Issue -
State: open - Opened by rjzamora 3 months ago
Labels: bug, dask
#16559 - Switch python version to `3.10` in `cudf.pandas` pandas test scripts
Pull Request -
State: closed - Opened by galipremsagar 3 months ago
- 1 comment
Labels: bug, non-breaking
#16558 - [DO NOT MERGE] Allow NumPy 2 + CuPy 13.2
Pull Request -
State: open - Opened by seberg 3 months ago
- 1 comment
Labels: bug, Python, non-breaking
#16557 - [BUG]cannot pip install on linux (Ubuntu)
Issue -
State: open - Opened by SHIMURA0 3 months ago
- 1 comment
Labels: bug
#16556 - Reenable arrow tests
Pull Request -
State: open - Opened by vyasr 3 months ago
- 1 comment
Labels: tests, libcudf, CMake, improvement, non-breaking
#16555 - Implement order preserving groupby in cudf-polars
Pull Request -
State: open - Opened by lithomas1 3 months ago
Labels: feature request, Python, non-breaking, cudf.polars
#16554 - [FEA] Add support for `cudf.unique`
Pull Request -
State: open - Opened by Matt711 3 months ago
- 1 comment
Labels: feature request, Python, non-breaking, cuDF (Python)
#16553 - Clean up reshaping ops
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking
#16552 - Ensure managed memory is supported in cudf.pandas.
Pull Request -
State: open - Opened by bdice 3 months ago
Labels: bug, Python, non-breaking, cudf.pandas, cudf.polars, pylibcudf
#16551 - [BUG] Consider disabling managed memory in cudf.pandas on WSL2
Issue -
State: open - Opened by vyasr 3 months ago
- 4 comments
Labels: bug
#16550 - [BUG]: `cudf.concat([empty DataFrame, empty DataFrame])` does not resolve axis types
Issue -
State: open - Opened by mroeschke 3 months ago
Labels: bug
#16549 - Disallow cudf.Index accepting column in favor of ._from_column
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, breaking
#16548 - Rewrite remaining Python Arrow interop conversions using the C Data Interface
Pull Request -
State: open - Opened by vyasr 3 months ago
- 2 comments
Labels: 3 - Ready for Review, libcudf, Python, CMake, improvement, non-breaking, pylibcudf
#16547 - Test cudf-polars on ARM64 as well
Pull Request -
State: closed - Opened by lithomas1 3 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.polars
#16546 - Hide all gtest symbols in cudftestutil
Pull Request -
State: open - Opened by robertmaynard 3 months ago
Labels: bug, 3 - Ready for Review, libcudf, CMake, non-breaking
#16545 - [REVIEW] JSON host tree algorithms
Pull Request -
State: closed - Opened by shrshi 3 months ago
- 7 comments
Labels: libcudf, 5 - Ready to Merge, cuIO, Java, improvement, non-breaking
#16544 - [FEA] Add support for manual switching from CPU to GPU in `cudf.pandas`
Issue -
State: closed - Opened by Matt711 3 months ago
- 1 comment
Labels: feature request, cudf.pandas
#16543 - Update chunked parquet reader benchmarks
Pull Request -
State: open - Opened by sdrp713 3 months ago
- 2 comments
Labels: libcudf, improvement, non-breaking
#16542 - [BUG] result indices in `group_argmin` was not initialized to -1 as comment says
Issue -
State: open - Opened by thirtiseven 3 months ago
Labels: bug
#16541 - Refactor dictionary encoding in PQ writer to migrate to the new `cuco::static_map`
Pull Request -
State: closed - Opened by mhaseeb123 3 months ago
- 8 comments
Labels: libcudf, 5 - Ready to Merge, cuIO, improvement, breaking, cuco
#16540 - Remove hardcoded versions from workflows.
Pull Request -
State: closed - Opened by bdice 3 months ago
- 1 comment
Labels: improvement, non-breaking
#16539 - Test dropping Python 3.9 from shared workflows.
Pull Request -
State: closed - Opened by bdice 3 months ago
- 1 comment
#16538 - Parquet reader list microkernel
Pull Request -
State: open - Opened by pmattione-nvidia 3 months ago
- 4 comments
Labels: libcudf, Performance, improvement, non-breaking
#16537 - [FEA] Add CI job to validate that `cudf.pandas` can be imported for all supported minor versions of pandas
Issue -
State: open - Opened by mroeschke 3 months ago
Labels: feature request, cudf.pandas
#16536 - Update the java code to properly deal with lists being returned as strings
Pull Request -
State: closed - Opened by revans2 3 months ago
- 1 comment
Labels: bug, 3 - Ready for Review, 4 - Needs Review, Java, Spark, non-breaking, cuDF (Java)
#16535 - Register `read_parquet` and `read_csv` with dask-expr
Pull Request -
State: closed - Opened by rjzamora 3 months ago
- 1 comment
Labels: bug, Python, 5 - Ready to Merge, dask, non-breaking
#16534 - Add dictionary support to cudf::row_bit_count
Pull Request -
State: open - Opened by davidwendt 3 months ago
Labels: 2 - In Progress, libcudf, improvement, non-breaking
#16533 - [FEA] Support duplicate column labels in cudf.DataFrame
Issue -
State: open - Opened by mroeschke 3 months ago
Labels: feature request, cudf.pandas
#16532 - Ensure comparisons with pyints and integer series always succeed
Pull Request -
State: closed - Opened by seberg 3 months ago
- 4 comments
Labels: Python, improvement, non-breaking
#16531 - Remove unneeded output size parameter from internal count_matches utility
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16530 - Remove invalid column_view usage in string-scalar-to-column function
Pull Request -
State: open - Opened by davidwendt 3 months ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16529 - Change cudf::empty_like to not include offsets for empty strings columns
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking
#16528 - [FEA] Support named aggregations in `df.groupby().agg()`
Pull Request -
State: open - Opened by Matt711 3 months ago
- 5 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking
#16527 - Fix DataFrame reductions with median returning scalar instead of Series
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking
#16526 - [BUG] Series.value_counts hangs with over 1B rows of input
Issue -
State: open - Opened by bdice 3 months ago
- 2 comments
Labels: bug, libcudf
#16525 - Raise NotImplementedError for Series.rename that's not a scalar
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking
#16524 - Remove deprecated public APIs from libcudf
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16523 - Return Interval object in pandas compat mode for IntervalIndex reductions
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: Python, improvement, non-breaking
#16522 - [FEA] Make the cudf.pandas profiler show time taken by other, non-pandas functions
Pull Request -
State: closed - Opened by Matt711 3 months ago
- 6 comments
Labels: feature request, Python, non-breaking, cudf.pandas
#16521 - [FEA] Add support for `cudf.DataFrame.aggregate`
Issue -
State: open - Opened by Matt711 3 months ago
Labels: feature request, Python
#16520 - Update json normalization to take device_buffer
Pull Request -
State: closed - Opened by karthikeyann 3 months ago
- 1 comment
Labels: 3 - Ready for Review, Needs Triage, libcudf, cuIO, improvement, non-breaking, Needs build-infra
#16519 - Allow DataFrame.sort_values(by=) to select an index level
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 1 comment
Labels: bug, Python, non-breaking
#16518 - Rework cudf::io::text::byte_range_info class member functions
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16517 - Use cudf datagen within tpch queries
Pull Request -
State: closed - Opened by JayjeetAtGithub 3 months ago
Labels: libcudf, CMake, improvement, non-breaking
#16516 - Fix `date_range(start, end, freq)` when end-start is divisible by freq
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 2 comments
Labels: bug, Python, non-breaking
#16515 - Preserve array name in MultiIndex.from_arrays
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 1 comment
Labels: bug, Python, non-breaking
#16514 - Disallow indexing by selecting duplicate labels
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 3 comments
Labels: bug, Python, non-breaking
#16513 - Fix `.replace(Index, Index)` raising a TypeError
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking
#16512 - [FEA] Adjust libcudf to not load cuFile by default
Issue -
State: closed - Opened by GregoryKimball 3 months ago
Labels: feature request, libcudf, cuIO
#16511 - Remove unneeded pair-iterator benchmark
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16507 - [BUG] cuDF and Pandas return different results for ...
Issue -
State: open - Opened by Matt711 3 months ago
- 3 comments
Labels: bug, Python, cudf.pandas
#16502 - Pass batch size to JSON reader using environment variable
Pull Request -
State: closed - Opened by shrshi 3 months ago
- 5 comments
Labels: libcudf, CMake, 5 - Ready to Merge, cuIO, improvement, non-breaking
#16501 - Remove a deprecated multibyte_split API
Pull Request -
State: closed - Opened by davidwendt 3 months ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#16500 - Implement cudf-polars datetime extraction methods
Pull Request -
State: open - Opened by lithomas1 3 months ago
- 8 comments
Labels: feature request, Python, non-breaking, cudf.polars, pylibcudf
#16499 - AWS S3 IO through KvikIO
Pull Request -
State: closed - Opened by madsbk 3 months ago
- 3 comments
Labels: libcudf, Python, improvement, non-breaking, pylibcudf
#16498 - Add interop example for `arrow::StringViewArray` to `cudf::column`
Pull Request -
State: closed - Opened by JayjeetAtGithub 3 months ago
- 3 comments
Labels: libcudf, CMake, improvement, non-breaking
#16497 - Add keep option to distinct nvbench
Pull Request -
State: closed - Opened by bdice 3 months ago
- 3 comments
Labels: libcudf, CMake, improvement, non-breaking
#16493 - Check index bounds in compact protocol reader.
Pull Request -
State: closed - Opened by bdice 3 months ago
- 2 comments
Labels: bug, libcudf, non-breaking
#16491 - [PERF] looping through dataframe is 100x slower than when running without cudf
Issue -
State: open - Opened by magnus-ekman 4 months ago
- 3 comments
Labels: bug
#16490 - [FEA] Have `cudf::make_empty_column(cudf::type_id::STRING)` return a column with a child column of empty offsets
Issue -
State: closed - Opened by mroeschke 4 months ago
- 3 comments
Labels: feature request
#16485 - Refactor `histogram` reduction using `cuco::static_set::insert_and_find`
Pull Request -
State: closed - Opened by srinivasyadav18 4 months ago
- 7 comments
Labels: libcudf, CMake, 5 - Ready to Merge, Performance, improvement, non-breaking
#16484 - Refactor `distinct` using `static_map` `insert_or_apply`
Pull Request -
State: open - Opened by srinivasyadav18 4 months ago
- 7 comments
Labels: libcudf, improvement, non-breaking
#16481 - [FEA] Full coverage of datetime methods in cudf-polars
Issue -
State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars
#16480 - [FEA] Full coverage of stringfunction methods in cudf polars
Issue -
State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars
#16479 - [FEA] Support cross-casting to/from strings in cudf-polars
Issue -
State: open - Opened by wence- 4 months ago
- 1 comment
Labels: feature request, cudf.polars
#16478 - [FEA] Support scan-based aggregations in cudf-polars
Issue -
State: closed - Opened by wence- 4 months ago
- 1 comment
Labels: feature request, cudf.polars
#16478 - [FEA] Support scan-based aggregations in cudf-polars
Issue -
State: closed - Opened by wence- 4 months ago
- 1 comment
Labels: feature request, cudf.polars
#16477 - [FEA] Support order-preserving groupby option in cudf-polars
Issue -
State: open - Opened by wence- 4 months ago
Labels: feature request, cudf.polars
#16476 - Implement Kleene logic handling for Any/All and bitwise Or/And
Pull Request -
State: closed - Opened by wence- 4 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.polars
#16474 - Use numba-cuda>=0.0.13
Pull Request -
State: closed - Opened by gmarkall 4 months ago
- 12 comments
Labels: numba, Python, improvement, non-breaking
#16472 - enable list to be forced as string in JSON reader.
Pull Request -
State: closed - Opened by karthikeyann 4 months ago
- 5 comments
Labels: feature request, 3 - Ready for Review, libcudf, cuIO, breaking
#16466 - Fix all-empty input column for strings split APIs
Pull Request -
State: closed - Opened by davidwendt 4 months ago
- 3 comments
Labels: bug, 3 - Ready for Review, libcudf, Python, strings, non-breaking
#16458 - [BUG]when setting dask.config.set({"dataframe.backend": "cudf"}), ddf.explode("col1") and apply customized function cannot work correctly anymore?
Issue -
State: open - Opened by Huilin-Li 4 months ago
- 3 comments
Labels: bug
#16453 - [BUG] `split_record` output empty list for empty input string
Issue -
State: closed - Opened by ttnghia 4 months ago
- 5 comments
Labels: bug
#16450 - [FEA] Add support for `cudf.Timestamp`
Pull Request -
State: closed - Opened by Matt711 4 months ago
- 5 comments
Labels: feature request, Python, non-breaking, cudf.pandas
#16448 - Compute whole column variance using numerically stable approach
Pull Request -
State: closed - Opened by wence- 4 months ago
- 2 comments
Labels: bug, libcudf, Python, Java, non-breaking
#16444 - [BUG] whole-column variance calculation uses numerically unstable algorithm
Issue -
State: closed - Opened by wence- 4 months ago
- 2 comments
Labels: bug, libcudf
#16443 - [DOC] low-memory reader options not very discoverable
Issue -
State: open - Opened by wence- 4 months ago
- 3 comments
Labels: doc, Python
#16440 - Fix parquet_field_list read_func lambda capture invalid this pointer
Pull Request -
State: closed - Opened by davidwendt 4 months ago
- 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking
#16428 - Enable cudf.pandas REPL and -c command support
Pull Request -
State: closed - Opened by bdice 4 months ago
- 3 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking, cudf.pandas
#16423 - Update docs of the TPC-H derived examples
Pull Request -
State: closed - Opened by JayjeetAtGithub 4 months ago
- 1 comment
Labels: libcudf, improvement, non-breaking
#16390 - [FEA] libcudf read json lines mode support nrows
Issue -
State: open - Opened by lithomas1 4 months ago
- 2 comments
Labels: feature request, libcudf, cuIO, cudf.polars
#16386 - Use `make_host_vector` instead of `make_std_vector` to facilitate pinned memory optimizations
Pull Request -
State: open - Opened by vuule 4 months ago
- 2 comments
Labels: 3 - Ready for Review, libcudf, Performance, improvement, non-breaking
#16383 - [FEA] Add cudf-polars to test.yaml
Issue -
State: open - Opened by lithomas1 4 months ago
- 1 comment
Labels: tests, cudf.polars
#16306 - Pylibcudf polars date from str
Pull Request -
State: open - Opened by brandon-b-miller 4 months ago
- 10 comments
Labels: feature request, Python, CMake, non-breaking, cudf.polars, pylibcudf
#16304 - Avoid decoding long runs in a single thread
Pull Request -
State: open - Opened by gerashegalov 4 months ago
- 4 comments
Labels: feature request, libcudf, 4 - Needs Review, cuIO, Performance, Spark, non-breaking
#16300 - Rebuild w/NumPy 2 (restrict to 1)
Pull Request -
State: open - Opened by jakirkham 4 months ago
- 1 comment
Labels: Python, improvement, non-breaking
#16299 - Setup pylibcudf package
Pull Request -
State: open - Opened by lithomas1 4 months ago
- 6 comments
Labels: feature request, Python, CMake, non-breaking, cudf.pandas, cudf.polars, pylibcudf
#16294 - Add a libcudf/thrust-based TPC-H derived datagen
Pull Request -
State: open - Opened by JayjeetAtGithub 4 months ago
- 2 comments
Labels: feature request, libcudf, CMake, non-breaking
#16286 - Initial investigation into NumPy proxying in `cudf.pandas`
Pull Request -
State: open - Opened by Matt711 4 months ago
- 8 comments
Labels: feature request, Python, non-breaking, cudf.pandas
#16282 - [BUG] Integer promotion fixes needed for NumPy 2 for comparison operators
Issue -
State: closed - Opened by seberg 4 months ago
- 2 comments
#16272 - [FEA] Refactor Column/NamedColumn split in cudf-polars
Issue -
State: closed - Opened by wence- 4 months ago
Labels: feature request, 0 - Backlog, cudf.polars
#16271 - Fix issue in horizontal concat implementation in cudf-polars
Pull Request -
State: closed - Opened by wence- 4 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.polars, pylibcudf
#16266 - Cleanup how args and kwargs are passed in `_fast_slow_function_call`
Pull Request -
State: open - Opened by Matt711 4 months ago
- 9 comments
Labels: bug, Python, non-breaking, cudf.pandas
#16266 - Cleanup how args and kwargs are passed in `_fast_slow_function_call`
Pull Request -
State: open - Opened by Matt711 4 months ago
- 9 comments
Labels: bug, Python, non-breaking, cudf.pandas