Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rapidsai/cudf issues and pull requests
#16777 - Fix slice_strings wide strings logic with multi-byte characters
Pull Request -
State: closed - Opened by davidwendt 2 months ago
- 3 comments
Labels: bug, 3 - Ready for Review, libcudf, strings, non-breaking
#16776 - Implement `extract_datetime_component` in `libcudf`/`pylibcudf`
Pull Request -
State: closed - Opened by brandon-b-miller 2 months ago
- 6 comments
Labels: feature request, libcudf, Python, CMake, non-breaking, cudf.polars, pylibcudf
#16775 - [FEA] Expose Stream Ordering to the Datetime APIs
Issue -
State: closed - Opened by lamarrr 2 months ago
Labels: feature request
#16774 - Exposed stream-ordering to datetime API
Pull Request -
State: closed - Opened by lamarrr 2 months ago
- 16 comments
Labels: libcudf, CMake, improvement, non-breaking
#16773 - Fix nvbench output for sha512
Pull Request -
State: closed - Opened by davidwendt 2 months ago
- 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking
#16772 - [BUG] fail to download rapids_cpm_generate_pinned_versions nvcomp version 4.0.1.0 from developer site
Issue -
State: open - Opened by pxLi 2 months ago
- 1 comment
Labels: bug
#16771 - Add io/timezone APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 2 comments
Labels: Python, CMake, 5 - Ready to Merge, improvement, non-breaking, pylibcudf
#16770 - nvCOMP GZIP integration
Pull Request -
State: closed - Opened by vuule 2 months ago
- 2 comments
Labels: feature request, libcudf, 5 - Ready to Merge, cuIO, non-breaking
#16769 - Allow read_csv(header=None) to return int column labels in `mode.pandas_compatible`
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 1 comment
Labels: bug, Python, non-breaking
#16768 - [BUG] slice_strings producing incorrect results for some input strings
Issue -
State: closed - Opened by jlowe 2 months ago
- 1 comment
Labels: bug, libcudf, Spark, strings
#16767 - Remove `MultiIndex._poplevel` inplace implementation.
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 1 comment
Labels: Python, improvement, non-breaking
#16766 - [BUG] cudf.pandas fallback fails with read_csv due to columns being strings
Issue -
State: closed - Opened by bdice 2 months ago
- 2 comments
Labels: bug
#16765 - Migrate dask-cudf README improvements to dask-cudf sphinx docs
Pull Request -
State: closed - Opened by rjzamora 2 months ago
- 3 comments
Labels: doc, Python, 5 - Ready to Merge, dask, non-breaking
#16764 - [FEA] Implement Java/JNI function `applyNullMask`
Issue -
State: open - Opened by ttnghia 2 months ago
- 2 comments
Labels: feature request, good first issue
#16763 - allow pandas patch version to float in cudf-pandas unit tests
Pull Request -
State: closed - Opened by jameslamb 2 months ago
- 2 comments
Labels: improvement, non-breaking
#16762 - Simplify the nvCOMP adapter
Pull Request -
State: closed - Opened by vuule 2 months ago
- 1 comment
Labels: libcudf, cuIO, improvement, non-breaking
#16761 - Add labeling APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking
#16760 - Add transform APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 2 comments
Labels: Python, improvement, non-breaking
#16759 - Whitespace normalization of nested column coerced as string column in JSONL inputs
Pull Request -
State: closed - Opened by shrshi 2 months ago
- 11 comments
Labels: bug, libcudf, 5 - Ready to Merge, cuIO, breaking
#16758 - [FEA] Add `io_type` axis with `PINNED_BUFFER` default value to `PARQUET_MULTITHREAD_READER_NVBENCH`
Issue -
State: closed - Opened by GregoryKimball 2 months ago
- 1 comment
Labels: feature request, libcudf, cuIO
#16758 - [FEA] Add `io_type` axis with `PINNED_BUFFER` default value to `PARQUET_MULTITHREAD_READER_NVBENCH`
Issue -
State: closed - Opened by GregoryKimball 2 months ago
- 1 comment
Labels: feature request, libcudf, cuIO
#16756 - Add new nvtext minhash_permuted API
Pull Request -
State: closed - Opened by davidwendt 2 months ago
- 6 comments
Labels: 3 - Ready for Review, libcudf, Python, CMake, improvement, non-breaking, pylibcudf
#16755 - Polars 1.7 will change a minor thing in the IR, adapt to that
Pull Request -
State: closed - Opened by wence- 2 months ago
Labels: Python, cudf.polars
#16754 - test
Pull Request -
State: closed - Opened by galipremsagar 2 months ago
Labels: Python
#16753 - API Doc for Polars GPU Engine
Pull Request -
State: closed - Opened by singhmanas1 2 months ago
- 9 comments
Labels: doc, non-breaking
#16752 - [BUG] Linker error: undefined reference to `cudf::detail::initialize_with_identity`
Issue -
State: closed - Opened by matal-nvidia 2 months ago
- 3 comments
Labels: bug
#16751 - Add a benchmark to study Parquet reader's performance for wide tables
Pull Request -
State: closed - Opened by mhaseeb123 2 months ago
- 6 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking
#16750 - Change the Parquet writer's `default_row_group_size_bytes` from 128MB to inf
Pull Request -
State: closed - Opened by mhaseeb123 2 months ago
- 6 comments
Labels: libcudf, Python, 5 - Ready to Merge, improvement, non-breaking
#16749 - Add transpose API to pylibcudf
Pull Request -
State: open - Opened by mroeschke 2 months ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16748 - [FEA] Make use of cached_property in Cython classes once we update beyond 3.0.11
Issue -
State: open - Opened by vyasr 2 months ago
Labels: feature request
#16747 - Avoid public constructors when called with columns to avoid unnecessary validation
Pull Request -
State: closed - Opened by mroeschke 2 months ago
- 2 comments
Labels: Python, improvement, non-breaking
#16746 - [FEA] Add libcudf data structure and algorithm prefetching to the NDS-H benchmarks
Issue -
State: open - Opened by GregoryKimball 2 months ago
Labels: feature request, libcudf
#16745 - Add support for Python 3.12, update Kafka dependencies to 2.5.x
Pull Request -
State: closed - Opened by jameslamb 2 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.pandas, cudf.polars
#16744 - [FEA] Add support for `cudf.NamedAgg`
Pull Request -
State: closed - Opened by Matt711 2 months ago
- 3 comments
Labels: feature request, Python, non-breaking
#16743 - Split pandas pytests to prepare for GPU vs CPU metrics reporting
Pull Request -
State: closed - Opened by galipremsagar 2 months ago
- 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas
#16742 - Add cudf::strings::find_re API
Pull Request -
State: closed - Opened by davidwendt 2 months ago
- 1 comment
Labels: feature request, 3 - Ready for Review, libcudf, Python, strings, non-breaking, pylibcudf
#16741 - [BUG] JSON string with certain UTF-8 data fails to tokenize fully
Issue -
State: open - Opened by karthikeyann 2 months ago
- 1 comment
Labels: bug, libcudf, cuIO
#16740 - [FEA] Add support for the cuFile Stream API in cuIO
Issue -
State: open - Opened by GregoryKimball 2 months ago
Labels: feature request, libcudf, cuIO
#16739 - Generate GPU vs CPU usage metrics per pytest file in pandas testsuite for `cudf.pandas`
Pull Request -
State: closed - Opened by galipremsagar 2 months ago
- 3 comments
Labels: Python, improvement, non-breaking, cudf.pandas
#16738 - [FEA] Support GPU-accelerated IO writers in Polars
Issue -
State: open - Opened by beckernick 2 months ago
Labels: feature request
#16737 - Spill OOM Protection
Pull Request -
State: closed - Opened by madsbk 2 months ago
Labels: Python, improvement, non-breaking
#16736 - [FEA]: pre-commit reminders when pandas/polars dependencies get out of sync with "version" compat flags
Issue -
State: open - Opened by wence- 2 months ago
#16735 - [FEA] Consider supporting `extract_datetime_component` in libcudf
Issue -
State: closed - Opened by brandon-b-miller 2 months ago
- 2 comments
Labels: feature request, libcudf
#16734 - [QST] Regarding erased_key_sentinel in static map
Issue -
State: open - Opened by avithemad 2 months ago
- 4 comments
Labels: question
#16733 - [FEA] Change PQ writer's row group size default from 128 MB to 1M rows.
Issue -
State: closed - Opened by mhaseeb123 2 months ago
Labels: feature request
#16732 - Avoid registering the byte range padding in the memory mapped datasource
Pull Request -
State: closed - Opened by vuule 2 months ago
- 1 comment
Labels: libcudf, cuIO, Performance, improvement, non-breaking
#16731 - Change mixed type as string to have higher priority over JSON schema
Pull Request -
State: closed - Opened by karthikeyann 2 months ago
- 2 comments
Labels: bug, libcudf, cuIO, Spark, 5 - DO NOT MERGE, breaking
#16730 - [BUG] .str.findall returning incorrect results when using a quantifier with a capturing group
Issue -
State: open - Opened by JamesMaki 2 months ago
- 1 comment
Labels: bug
#16729 - [FEA] Method to return starting index of regex match
Issue -
State: closed - Opened by JamesMaki 2 months ago
- 1 comment
Labels: feature request
#16728 - Refactor cudf pandas integration tests CI
Pull Request -
State: closed - Opened by Matt711 2 months ago
- 1 comment
Labels: tests, improvement, non-breaking, ci
#16727 - Strings batch construction
Pull Request -
State: closed - Opened by sdrp713 2 months ago
- 2 comments
Labels: libcudf, CMake
#16726 - [FEA] Broaden set of supported expressions in groupby-aggregation for cudf-polars
Issue -
State: open - Opened by wence- 2 months ago
Labels: feature request, cudf.polars
#16725 - [FEA] Support struct data types in Polars
Issue -
State: open - Opened by beckernick 2 months ago
Labels: feature request, cudf.polars
#16724 - [BUG] cudf_backtrace missing for static build with CUDF_BUILD_STACKTRACE_DEBUG=ON
Issue -
State: open - Opened by jlowe 2 months ago
- 2 comments
Labels: bug, CMake
#16723 - [FEA] Enable Polars GPU execution via global configuration (default options)
Issue -
State: open - Opened by beckernick 2 months ago
- 1 comment
Labels: feature request, cudf.polars
#16722 - Remove ERROR_TEST gtest from libcudf
Pull Request -
State: closed - Opened by davidwendt 2 months ago
- 3 comments
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking
#16721 - Small scan-handler fixes
Pull Request -
State: closed - Opened by wence- 3 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.polars
#16720 - Partially reject dynamic groupby
Pull Request -
State: closed - Opened by wence- 3 months ago
- 1 comment
Labels: Python, improvement, non-breaking, cudf.polars
#16719 - Some fixes for unary functions
Pull Request -
State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars
#16718 - [FEA] Increase the default thread count for kvikIO file reads
Issue -
State: open - Opened by GregoryKimball 3 months ago
- 4 comments
Labels: feature request, libcudf, cuIO
#16717 - [FEA] Add multi-threaded Parquet read example
Issue -
State: closed - Opened by GregoryKimball 3 months ago
Labels: feature request, libcudf, cuIO
#16716 - Use Series._from_column more consistently to avoid validation
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 1 comment
Labels: Python, improvement, non-breaking
#16715 - Make `column_empty` mask buffer creation consistent with libcudf
Pull Request -
State: open - Opened by mroeschke 3 months ago
- 3 comments
Labels: Python, improvement, non-breaking
#16714 - remove some unnecessary libcudf nightly builds
Pull Request -
State: closed - Opened by jameslamb 3 months ago
- 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking
#16713 - Use `changed-files` shared workflow
Pull Request -
State: closed - Opened by KyleFromNVIDIA 3 months ago
- 1 comment
Labels: improvement, non-breaking
#16712 - Fix DataFrame.drop(columns=cudf.Series/Index, axis=1)
Pull Request -
State: open - Opened by mroeschke 3 months ago
Labels: bug, Python, non-breaking
#16711 - [FEA] Change from TPC-H to NDS-H
Issue -
State: closed - Opened by JayjeetAtGithub 3 months ago
Labels: feature request
#16710 - Run polars test suite (defaulting to GPU) in CI
Pull Request -
State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars
#16708 - Ensure we pass the has_nulls tparam to mixed_join kernels
Pull Request -
State: closed - Opened by abellina 3 months ago
- 8 comments
Labels: bug, libcudf, non-breaking
#16707 - Add boost-devel to Java CI Docker image
Pull Request -
State: closed - Opened by jlowe 3 months ago
- 1 comment
Labels: bug, Java, non-breaking
#16706 - [BUG] illegal access error in mixed_join after ODR cleanup PR
Issue -
State: closed - Opened by abellina 3 months ago
- 1 comment
Labels: bug, libcudf
#16705 - Remove xfail from torch-cudf.pandas integration test
Pull Request -
State: closed - Opened by Matt711 3 months ago
- 4 comments
Labels: tests, Python, improvement, non-breaking, cudf.pandas
#16702 - No support for producing dataframes with Empty columns
Pull Request -
State: closed - Opened by wence- 3 months ago
Labels: Python, improvement, non-breaking, cudf.polars
#16701 - [FEA] Make TPC-H derived datagen reuse random column generation functions already present in benchmark utils
Issue -
State: open - Opened by JayjeetAtGithub 3 months ago
Labels: feature request
#16700 - Fix typo in column_factories.hpp comment from 'depth 1' to 'depth 2'
Pull Request -
State: closed - Opened by a-hirota 3 months ago
- 4 comments
Labels: bug, 3 - Ready for Review, libcudf, non-breaking
#16697 - [DOC] Remove out of date section from cudf.pandas docs
Pull Request -
State: closed - Opened by Matt711 3 months ago
- 1 comment
Labels: 3 - Ready for Review, doc, non-breaking
#16696 - Add return type annotations to MultiIndex
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 2 comments
Labels: Python, improvement, non-breaking
#16693 - Add performance tips to cudf.pandas FAQ.
Pull Request -
State: open - Opened by bdice 3 months ago
- 3 comments
Labels: doc, non-breaking
#16690 - [FEA] Report all unsupported operations for a query in cudf-polars
Issue -
State: open - Opened by wence- 3 months ago
Labels: feature request, cudf.polars
#16690 - [FEA] Report all unsupported operations for a query in cudf-polars
Issue -
State: closed - Opened by wence- 3 months ago
- 1 comment
Labels: feature request, cudf.polars
#16689 - Implement unpivot in cudf-polars
Pull Request -
State: closed - Opened by wence- 3 months ago
- 2 comments
Labels: Python, improvement, non-breaking, cudf.polars
#16686 - [FEA] Support JNI build in rapids dev container
Issue -
State: closed - Opened by pxLi 3 months ago
- 2 comments
Labels: feature request, Java, ci, cuDF (Java)
#16685 - lint: replace `isort` with Ruff's rule I
Pull Request -
State: closed - Opened by Borda 3 months ago
- 28 comments
Labels: Python, improvement, non-breaking, cudf.pandas, pylibcudf
#16684 - Add experimental `filesystem="arrow"` support in `dask_cudf.read_parquet`
Pull Request -
State: closed - Opened by rjzamora 3 months ago
- 3 comments
Labels: feature request, Python, 5 - Ready to Merge, dask, non-breaking
#16680 - Creation of CI artifacts for cudf-polars wheels
Pull Request -
State: closed - Opened by wence- 3 months ago
- 10 comments
Labels: feature request, libcudf, Python, CMake, non-breaking, cudf.polars, pylibcudf
#16679 - Add libcudf wrappers around current_device_resource functions.
Pull Request -
State: closed - Opened by harrism 3 months ago
- 6 comments
Labels: feature request, libcudf, Python, Java, breaking
#16678 - [BUG] CUDF JNI build failed FindBoost
Issue -
State: closed - Opened by pxLi 3 months ago
- 4 comments
Labels: bug, Spark, cuDF (Java)
#16676 - [FEA] Add libcudf cudf::get_current_device_resource() wrapper for rmm::get_current_device_resource()
Issue -
State: closed - Opened by harrism 3 months ago
- 1 comment
Labels: feature request, libcudf
#16675 - Fix empty cluster handling in tdigest merge
Pull Request -
State: closed - Opened by jihoonson 3 months ago
- 10 comments
Labels: bug, 3 - Ready for Review, libcudf, breaking
#16668 - Update patches for CCCL 2.6
Pull Request -
State: open - Opened by miscco 3 months ago
- 4 comments
Labels: libcudf, CMake
#16663 - Move NDS-H examples into benchmarks
Pull Request -
State: closed - Opened by JayjeetAtGithub 3 months ago
- 9 comments
Labels: feature request, libcudf, CMake, non-breaking
#16657 - Multi-file and Parquet-aware prefetching from remote storage
Pull Request -
State: closed - Opened by rjzamora 3 months ago
- 1 comment
Labels: Python, 5 - Ready to Merge, Performance, improvement, non-breaking
#16652 - Access Frame attributes instead of ColumnAccessor attributes when available
Pull Request -
State: closed - Opened by mroeschke 3 months ago
- 1 comment
Labels: Python, improvement, non-breaking
#16641 - strings::contains() for multiple scalar search targets
Pull Request -
State: open - Opened by res-life 3 months ago
- 29 comments
Labels: feature request, libcudf, Java, non-breaking
#16640 - Remove arrow dependency
Pull Request -
State: closed - Opened by vyasr 3 months ago
- 5 comments
Labels: libcudf, Python, CMake, Java, improvement, non-breaking
#16639 - Support reading multiple PQ sources with mismatching nullability for columns
Pull Request -
State: closed - Opened by mhaseeb123 3 months ago
- 6 comments
Labels: libcudf, Python, 5 - Ready to Merge, cuIO, improvement, breaking
#16619 - Improve the performance of low cardinality groupby
Pull Request -
State: open - Opened by PointKernel 3 months ago
- 8 comments
Labels: 3 - Ready for Review, libcudf, CMake, Performance, improvement, non-breaking
#16601 - Make isinstance check pass for proxy ndarrays
Pull Request -
State: open - Opened by Matt711 3 months ago
- 12 comments
Labels: feature request, 3 - Ready for Review, Python, non-breaking, cudf.pandas
#16598 - Implement `first_contains_index`
Pull Request -
State: closed - Opened by ttnghia 3 months ago
- 2 comments
Labels: libcudf
#16591 - Extend the Parquet writer's dictionary encoding benchmark.
Pull Request -
State: closed - Opened by mhaseeb123 3 months ago
- 9 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking