Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rapidsai/cudf issues and pull requests

#16777 - Fix slice_strings wide strings logic with multi-byte characters

Pull Request - State: closed - Opened by davidwendt 2 months ago - 3 comments
Labels: bug, 3 - Ready for Review, libcudf, strings, non-breaking

#16776 - Implement `extract_datetime_component` in `libcudf`/`pylibcudf`

Pull Request - State: closed - Opened by brandon-b-miller 2 months ago - 6 comments
Labels: feature request, libcudf, Python, CMake, non-breaking, cudf.polars, pylibcudf

#16775 - [FEA] Expose Stream Ordering to the Datetime APIs

Issue - State: closed - Opened by lamarrr 2 months ago
Labels: feature request

#16774 - Exposed stream-ordering to datetime API

Pull Request - State: closed - Opened by lamarrr 2 months ago - 16 comments
Labels: libcudf, CMake, improvement, non-breaking

#16773 - Fix nvbench output for sha512

Pull Request - State: closed - Opened by davidwendt 2 months ago - 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking

#16771 - Add io/timezone APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke 2 months ago - 2 comments
Labels: Python, CMake, 5 - Ready to Merge, improvement, non-breaking, pylibcudf

#16770 - nvCOMP GZIP integration

Pull Request - State: closed - Opened by vuule 2 months ago - 2 comments
Labels: feature request, libcudf, 5 - Ready to Merge, cuIO, non-breaking

#16769 - Allow read_csv(header=None) to return int column labels in `mode.pandas_compatible`

Pull Request - State: closed - Opened by mroeschke 2 months ago - 1 comment
Labels: bug, Python, non-breaking

#16768 - [BUG] slice_strings producing incorrect results for some input strings

Issue - State: closed - Opened by jlowe 2 months ago - 1 comment
Labels: bug, libcudf, Spark, strings

#16767 - Remove `MultiIndex._poplevel` inplace implementation.

Pull Request - State: closed - Opened by mroeschke 2 months ago - 1 comment
Labels: Python, improvement, non-breaking

#16766 - [BUG] cudf.pandas fallback fails with read_csv due to columns being strings

Issue - State: closed - Opened by bdice 2 months ago - 2 comments
Labels: bug

#16765 - Migrate dask-cudf README improvements to dask-cudf sphinx docs

Pull Request - State: closed - Opened by rjzamora 2 months ago - 3 comments
Labels: doc, Python, 5 - Ready to Merge, dask, non-breaking

#16764 - [FEA] Implement Java/JNI function `applyNullMask`

Issue - State: open - Opened by ttnghia 2 months ago - 2 comments
Labels: feature request, good first issue

#16763 - allow pandas patch version to float in cudf-pandas unit tests

Pull Request - State: closed - Opened by jameslamb 2 months ago - 2 comments
Labels: improvement, non-breaking

#16762 - Simplify the nvCOMP adapter

Pull Request - State: closed - Opened by vuule 2 months ago - 1 comment
Labels: libcudf, cuIO, improvement, non-breaking

#16761 - Add labeling APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke 2 months ago - 1 comment
Labels: Python, CMake, improvement, non-breaking

#16760 - Add transform APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke 2 months ago - 2 comments
Labels: Python, improvement, non-breaking

#16759 - Whitespace normalization of nested column coerced as string column in JSONL inputs

Pull Request - State: closed - Opened by shrshi 2 months ago - 11 comments
Labels: bug, libcudf, 5 - Ready to Merge, cuIO, breaking

#16758 - [FEA] Add `io_type` axis with `PINNED_BUFFER` default value to `PARQUET_MULTITHREAD_READER_NVBENCH`

Issue - State: closed - Opened by GregoryKimball 2 months ago - 1 comment
Labels: feature request, libcudf, cuIO

#16758 - [FEA] Add `io_type` axis with `PINNED_BUFFER` default value to `PARQUET_MULTITHREAD_READER_NVBENCH`

Issue - State: closed - Opened by GregoryKimball 2 months ago - 1 comment
Labels: feature request, libcudf, cuIO

#16756 - Add new nvtext minhash_permuted API

Pull Request - State: closed - Opened by davidwendt 2 months ago - 6 comments
Labels: 3 - Ready for Review, libcudf, Python, CMake, improvement, non-breaking, pylibcudf

#16755 - Polars 1.7 will change a minor thing in the IR, adapt to that

Pull Request - State: closed - Opened by wence- 2 months ago
Labels: Python, cudf.polars

#16754 - test

Pull Request - State: closed - Opened by galipremsagar 2 months ago
Labels: Python

#16753 - API Doc for Polars GPU Engine

Pull Request - State: closed - Opened by singhmanas1 2 months ago - 9 comments
Labels: doc, non-breaking

#16752 - [BUG] Linker error: undefined reference to `cudf::detail::initialize_with_identity`

Issue - State: closed - Opened by matal-nvidia 2 months ago - 3 comments
Labels: bug

#16751 - Add a benchmark to study Parquet reader's performance for wide tables

Pull Request - State: closed - Opened by mhaseeb123 2 months ago - 6 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking

#16750 - Change the Parquet writer's `default_row_group_size_bytes` from 128MB to inf

Pull Request - State: closed - Opened by mhaseeb123 2 months ago - 6 comments
Labels: libcudf, Python, 5 - Ready to Merge, improvement, non-breaking

#16749 - Add transpose API to pylibcudf

Pull Request - State: open - Opened by mroeschke 2 months ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16748 - [FEA] Make use of cached_property in Cython classes once we update beyond 3.0.11

Issue - State: open - Opened by vyasr 2 months ago
Labels: feature request

#16747 - Avoid public constructors when called with columns to avoid unnecessary validation

Pull Request - State: closed - Opened by mroeschke 2 months ago - 2 comments
Labels: Python, improvement, non-breaking

#16746 - [FEA] Add libcudf data structure and algorithm prefetching to the NDS-H benchmarks

Issue - State: open - Opened by GregoryKimball 2 months ago
Labels: feature request, libcudf

#16745 - Add support for Python 3.12, update Kafka dependencies to 2.5.x

Pull Request - State: closed - Opened by jameslamb 2 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.pandas, cudf.polars

#16744 - [FEA] Add support for `cudf.NamedAgg`

Pull Request - State: closed - Opened by Matt711 2 months ago - 3 comments
Labels: feature request, Python, non-breaking

#16743 - Split pandas pytests to prepare for GPU vs CPU metrics reporting

Pull Request - State: closed - Opened by galipremsagar 2 months ago - 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas

#16742 - Add cudf::strings::find_re API

Pull Request - State: closed - Opened by davidwendt 2 months ago - 1 comment
Labels: feature request, 3 - Ready for Review, libcudf, Python, strings, non-breaking, pylibcudf

#16741 - [BUG] JSON string with certain UTF-8 data fails to tokenize fully

Issue - State: open - Opened by karthikeyann 2 months ago - 1 comment
Labels: bug, libcudf, cuIO

#16740 - [FEA] Add support for the cuFile Stream API in cuIO

Issue - State: open - Opened by GregoryKimball 2 months ago
Labels: feature request, libcudf, cuIO

#16739 - Generate GPU vs CPU usage metrics per pytest file in pandas testsuite for `cudf.pandas`

Pull Request - State: closed - Opened by galipremsagar 2 months ago - 3 comments
Labels: Python, improvement, non-breaking, cudf.pandas

#16738 - [FEA] Support GPU-accelerated IO writers in Polars

Issue - State: open - Opened by beckernick 2 months ago
Labels: feature request

#16737 - Spill OOM Protection

Pull Request - State: closed - Opened by madsbk 2 months ago
Labels: Python, improvement, non-breaking

#16735 - [FEA] Consider supporting `extract_datetime_component` in libcudf

Issue - State: closed - Opened by brandon-b-miller 2 months ago - 2 comments
Labels: feature request, libcudf

#16734 - [QST] Regarding erased_key_sentinel in static map

Issue - State: open - Opened by avithemad 2 months ago - 4 comments
Labels: question

#16733 - [FEA] Change PQ writer's row group size default from 128 MB to 1M rows.

Issue - State: closed - Opened by mhaseeb123 2 months ago
Labels: feature request

#16732 - Avoid registering the byte range padding in the memory mapped datasource

Pull Request - State: closed - Opened by vuule 2 months ago - 1 comment
Labels: libcudf, cuIO, Performance, improvement, non-breaking

#16731 - Change mixed type as string to have higher priority over JSON schema

Pull Request - State: closed - Opened by karthikeyann 2 months ago - 2 comments
Labels: bug, libcudf, cuIO, Spark, 5 - DO NOT MERGE, breaking

#16729 - [FEA] Method to return starting index of regex match

Issue - State: closed - Opened by JamesMaki 2 months ago - 1 comment
Labels: feature request

#16728 - Refactor cudf pandas integration tests CI

Pull Request - State: closed - Opened by Matt711 2 months ago - 1 comment
Labels: tests, improvement, non-breaking, ci

#16727 - Strings batch construction

Pull Request - State: closed - Opened by sdrp713 2 months ago - 2 comments
Labels: libcudf, CMake

#16726 - [FEA] Broaden set of supported expressions in groupby-aggregation for cudf-polars

Issue - State: open - Opened by wence- 2 months ago
Labels: feature request, cudf.polars

#16725 - [FEA] Support struct data types in Polars

Issue - State: open - Opened by beckernick 2 months ago
Labels: feature request, cudf.polars

#16724 - [BUG] cudf_backtrace missing for static build with CUDF_BUILD_STACKTRACE_DEBUG=ON

Issue - State: open - Opened by jlowe 2 months ago - 2 comments
Labels: bug, CMake

#16723 - [FEA] Enable Polars GPU execution via global configuration (default options)

Issue - State: open - Opened by beckernick 2 months ago - 1 comment
Labels: feature request, cudf.polars

#16722 - Remove ERROR_TEST gtest from libcudf

Pull Request - State: closed - Opened by davidwendt 2 months ago - 3 comments
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking

#16721 - Small scan-handler fixes

Pull Request - State: closed - Opened by wence- 3 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.polars

#16720 - Partially reject dynamic groupby

Pull Request - State: closed - Opened by wence- 3 months ago - 1 comment
Labels: Python, improvement, non-breaking, cudf.polars

#16719 - Some fixes for unary functions

Pull Request - State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars

#16718 - [FEA] Increase the default thread count for kvikIO file reads

Issue - State: open - Opened by GregoryKimball 3 months ago - 4 comments
Labels: feature request, libcudf, cuIO

#16717 - [FEA] Add multi-threaded Parquet read example

Issue - State: closed - Opened by GregoryKimball 3 months ago
Labels: feature request, libcudf, cuIO

#16716 - Use Series._from_column more consistently to avoid validation

Pull Request - State: closed - Opened by mroeschke 3 months ago - 1 comment
Labels: Python, improvement, non-breaking

#16715 - Make `column_empty` mask buffer creation consistent with libcudf

Pull Request - State: open - Opened by mroeschke 3 months ago - 3 comments
Labels: Python, improvement, non-breaking

#16714 - remove some unnecessary libcudf nightly builds

Pull Request - State: closed - Opened by jameslamb 3 months ago - 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking

#16713 - Use `changed-files` shared workflow

Pull Request - State: closed - Opened by KyleFromNVIDIA 3 months ago - 1 comment
Labels: improvement, non-breaking

#16712 - Fix DataFrame.drop(columns=cudf.Series/Index, axis=1)

Pull Request - State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking

#16711 - [FEA] Change from TPC-H to NDS-H

Issue - State: closed - Opened by JayjeetAtGithub 3 months ago
Labels: feature request

#16710 - Run polars test suite (defaulting to GPU) in CI

Pull Request - State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars

#16708 - Ensure we pass the has_nulls tparam to mixed_join kernels

Pull Request - State: closed - Opened by abellina 3 months ago - 8 comments
Labels: bug, libcudf, non-breaking

#16707 - Add boost-devel to Java CI Docker image

Pull Request - State: closed - Opened by jlowe 3 months ago - 1 comment
Labels: bug, Java, non-breaking

#16706 - [BUG] illegal access error in mixed_join after ODR cleanup PR

Issue - State: closed - Opened by abellina 3 months ago - 1 comment
Labels: bug, libcudf

#16705 - Remove xfail from torch-cudf.pandas integration test

Pull Request - State: closed - Opened by Matt711 3 months ago - 4 comments
Labels: tests, Python, improvement, non-breaking, cudf.pandas

#16702 - No support for producing dataframes with Empty columns

Pull Request - State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars

#16700 - Fix typo in column_factories.hpp comment from 'depth 1' to 'depth 2'

Pull Request - State: closed - Opened by a-hirota 3 months ago - 4 comments
Labels: bug, 3 - Ready for Review, libcudf, non-breaking

#16697 - [DOC] Remove out of date section from cudf.pandas docs

Pull Request - State: closed - Opened by Matt711 3 months ago - 1 comment
Labels: 3 - Ready for Review, doc, non-breaking

#16696 - Add return type annotations to MultiIndex

Pull Request - State: closed - Opened by mroeschke 3 months ago - 2 comments
Labels: Python, improvement, non-breaking

#16693 - Add performance tips to cudf.pandas FAQ.

Pull Request - State: open - Opened by bdice 3 months ago - 3 comments
Labels: doc, non-breaking

#16690 - [FEA] Report all unsupported operations for a query in cudf-polars

Issue - State: open - Opened by wence- 3 months ago
Labels: feature request, cudf.polars

#16690 - [FEA] Report all unsupported operations for a query in cudf-polars

Issue - State: closed - Opened by wence- 3 months ago - 1 comment
Labels: feature request, cudf.polars

#16689 - Implement unpivot in cudf-polars

Pull Request - State: closed - Opened by wence- 3 months ago - 2 comments
Labels: Python, improvement, non-breaking, cudf.polars

#16686 - [FEA] Support JNI build in rapids dev container

Issue - State: closed - Opened by pxLi 3 months ago - 2 comments
Labels: feature request, Java, ci, cuDF (Java)

#16685 - lint: replace `isort` with Ruff's rule I

Pull Request - State: closed - Opened by Borda 3 months ago - 28 comments
Labels: Python, improvement, non-breaking, cudf.pandas, pylibcudf

#16684 - Add experimental `filesystem="arrow"` support in `dask_cudf.read_parquet`

Pull Request - State: closed - Opened by rjzamora 3 months ago - 3 comments
Labels: feature request, Python, 5 - Ready to Merge, dask, non-breaking

#16680 - Creation of CI artifacts for cudf-polars wheels

Pull Request - State: closed - Opened by wence- 3 months ago - 10 comments
Labels: feature request, libcudf, Python, CMake, non-breaking, cudf.polars, pylibcudf

#16679 - Add libcudf wrappers around current_device_resource functions.

Pull Request - State: closed - Opened by harrism 3 months ago - 6 comments
Labels: feature request, libcudf, Python, Java, breaking

#16678 - [BUG] CUDF JNI build failed FindBoost

Issue - State: closed - Opened by pxLi 3 months ago - 4 comments
Labels: bug, Spark, cuDF (Java)

#16676 - [FEA] Add libcudf cudf::get_current_device_resource() wrapper for rmm::get_current_device_resource()

Issue - State: closed - Opened by harrism 3 months ago - 1 comment
Labels: feature request, libcudf

#16675 - Fix empty cluster handling in tdigest merge

Pull Request - State: closed - Opened by jihoonson 3 months ago - 10 comments
Labels: bug, 3 - Ready for Review, libcudf, breaking

#16668 - Update patches for CCCL 2.6

Pull Request - State: open - Opened by miscco 3 months ago - 4 comments
Labels: libcudf, CMake

#16663 - Move NDS-H examples into benchmarks

Pull Request - State: closed - Opened by JayjeetAtGithub 3 months ago - 9 comments
Labels: feature request, libcudf, CMake, non-breaking

#16657 - Multi-file and Parquet-aware prefetching from remote storage

Pull Request - State: closed - Opened by rjzamora 3 months ago - 1 comment
Labels: Python, 5 - Ready to Merge, Performance, improvement, non-breaking

#16652 - Access Frame attributes instead of ColumnAccessor attributes when available

Pull Request - State: closed - Opened by mroeschke 3 months ago - 1 comment
Labels: Python, improvement, non-breaking

#16641 - strings::contains() for multiple scalar search targets

Pull Request - State: open - Opened by res-life 3 months ago - 29 comments
Labels: feature request, libcudf, Java, non-breaking

#16640 - Remove arrow dependency

Pull Request - State: closed - Opened by vyasr 3 months ago - 5 comments
Labels: libcudf, Python, CMake, Java, improvement, non-breaking

#16639 - Support reading multiple PQ sources with mismatching nullability for columns

Pull Request - State: closed - Opened by mhaseeb123 3 months ago - 6 comments
Labels: libcudf, Python, 5 - Ready to Merge, cuIO, improvement, breaking

#16619 - Improve the performance of low cardinality groupby

Pull Request - State: open - Opened by PointKernel 3 months ago - 8 comments
Labels: 3 - Ready for Review, libcudf, CMake, Performance, improvement, non-breaking

#16601 - Make isinstance check pass for proxy ndarrays

Pull Request - State: open - Opened by Matt711 3 months ago - 12 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking, cudf.pandas

#16598 - Implement `first_contains_index`

Pull Request - State: closed - Opened by ttnghia 3 months ago - 2 comments
Labels: libcudf

#16591 - Extend the Parquet writer's dictionary encoding benchmark.

Pull Request - State: closed - Opened by mhaseeb123 3 months ago - 9 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking