Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rapidsai/cudf issues and pull requests

#17179 - Make ai.rapids.cudf.HostMemoryBuffer#copyFromStream public.

Pull Request - State: open - Opened by liurenjie1024 23 days ago - 5 comments
Labels: Java

#17178 - [WIP] Nullify non-struct rows in JSONL inputs

Pull Request - State: closed - Opened by shrshi 23 days ago - 2 comments
Labels: bug, libcudf, cuIO, non-breaking

#17177 - [BUG] dask-cudf.to_parquet(write_metadata_file=True, append=True) fails

Issue - State: open - Opened by ayushdg 23 days ago - 1 comment
Labels: bug, Python

#17176 - Support storing `precision` of decimal types in `Schema` class

Pull Request - State: open - Opened by ttnghia 23 days ago - 6 comments
Labels: feature request, 3 - Ready for Review, Java, Spark, non-breaking

#17175 - Separate evaluation logic from `IR` objects in cudf-polars

Pull Request - State: closed - Opened by rjzamora 23 days ago - 4 comments
Labels: Python, 4 - Needs Review, improvement, breaking, cudf.polars

#17174 - [FEA] dt.date (mirror pyarrow-backed pandas behaviour)

Issue - State: open - Opened by MarcoGorelli 23 days ago
Labels: feature request

#17173 - Move nvtext ngrams benchmarks to nvbench

Pull Request - State: closed - Opened by davidwendt 23 days ago - 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking

#17172 - [PERF] Remove stream sync in concatenate for better pipelining

Issue - State: open - Opened by tgujar 23 days ago - 1 comment
Labels: libcudf, Performance, improvement

#17171 - [FEA] Remove redundant aggregatiom details in shared memory groupby

Issue - State: open - Opened by PointKernel 23 days ago
Labels: feature request, libcudf

#17170 - Remove includes suggested by include-what-you-use

Pull Request - State: closed - Opened by vyasr 23 days ago - 8 comments
Labels: libcudf, improvement, non-breaking

#17169 - [FEA] Add libcudf example showing `MERGE_LISTS` aggregation kind

Issue - State: open - Opened by GregoryKimball 23 days ago
Labels: feature request, libcudf

#17168 - [BUG] Make `__iter__` a fast-slow attribute of the `pd.Series` proxy type

Pull Request - State: open - Opened by Matt711 24 days ago - 2 comments
Labels: bug, Python, non-breaking, cudf.pandas

#17167 - [FEA] Support column factory function `all_nulls_like`

Issue - State: open - Opened by ttnghia 24 days ago - 5 comments
Labels: feature request

#17166 - [BUG] Series.plot method populates pd.util.version when run under cudf.pandas

Issue - State: closed - Opened by wphicks 24 days ago
Labels: bug, Python, cudf.pandas

#17165 - [BUG] Incorrect dtype when iterating over dtypes in cudf.pandas

Issue - State: open - Opened by wphicks 24 days ago - 2 comments
Labels: bug

#17164 - [FEA] Use bloom filters in Parquet reader to filter row groups with equality predicates

Issue - State: open - Opened by mhaseeb123 24 days ago
Labels: feature request, libcudf, cuIO, improvement, cuco

#17163 - Migrate CSV writer to pylibcudf

Pull Request - State: open - Opened by Matt711 24 days ago
Labels: feature request, libcudf, Python, non-breaking, pylibcudf

#17162 - Add compute_shared_memory_aggs used by shared memory groupby

Pull Request - State: open - Opened by PointKernel 24 days ago
Labels: feature request, 3 - Ready for Review, libcudf, CMake, non-breaking

#17161 - Reading multi-source compressed JSONL files

Pull Request - State: open - Opened by shrshi 24 days ago - 4 comments
Labels: libcudf, CMake, cuIO, improvement, non-breaking

#17158 - [BUG] Chunked parquet reader incorrect results for large string columns

Issue - State: open - Opened by brandon-b-miller 24 days ago
Labels: bug, cuIO, cudf.polars

#17157 - Process parquet bools with microkernels

Pull Request - State: open - Opened by pmattione-nvidia 24 days ago
Labels: libcudf, Performance, improvement, non-breaking

#17156 - Added ast tree to simplify expression lifetime management

Pull Request - State: open - Opened by lamarrr 24 days ago
Labels: feature request, libcudf, CMake, non-breaking

#17155 - [BUG] Misaligned timestamps produced by ORC reader

Issue - State: open - Opened by mlahir1 24 days ago - 3 comments
Labels: bug, libcudf, cuIO

#17154 - Upgrade to polars 1.11 in cudf-polars

Pull Request - State: closed - Opened by wence- 24 days ago - 2 comments
Labels: Python, improvement, breaking, cudf.polars

#17153 - [BUG] INTEROP_TEST fails on a libcudf debug build at ToArrowTest.NestedList

Issue - State: open - Opened by davidwendt 24 days ago - 2 comments
Labels: bug, libcudf

#17152 - Deprecate current libcudf nvtext minhash functions

Pull Request - State: closed - Opened by davidwendt 24 days ago - 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking

#17151 - Remove unused variable in internal merge_tdigests utility

Pull Request - State: closed - Opened by davidwendt 24 days ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#17149 - Fix host-to-device copy missing sync in strings/duration convert

Pull Request - State: closed - Opened by davidwendt 24 days ago - 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking

#17148 - Move `segmented_gather` function from the copying module to the lists module

Pull Request - State: closed - Opened by Matt711 25 days ago - 1 comment
Labels: Python, improvement, non-breaking

#17147 - Add compute_mapping_indices used by shared memory groupby

Pull Request - State: closed - Opened by PointKernel 25 days ago - 3 comments
Labels: feature request, 3 - Ready for Review, libcudf, CMake, non-breaking

#17145 - [test] compression benchmarks

Pull Request - State: open - Opened by vuule 25 days ago
Labels: libcudf

#17144 - [FEA] Support for dynamic groupby in cudf-polars

Issue - State: open - Opened by wence- 25 days ago - 2 comments
Labels: feature request, cudf.polars

#17143 - Add remaining datetime APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 25 days ago
Labels: feature request, Python, non-breaking, pylibcudf

#17142 - [FEA] Parquet reader filter improvements

Issue - State: open - Opened by wence- 25 days ago - 4 comments
Labels: feature request, libcudf, cuIO

#17141 - Add conversion from cudf-polars expressions to libcudf ast for parquet filters

Pull Request - State: open - Opened by wence- 25 days ago
Labels: Python, improvement, non-breaking, cudf.polars, pylibcudf

#17140 - [BUG] Slow Performance of cuDF Pandas on L4

Issue - State: closed - Opened by ericphan-nv 26 days ago - 2 comments
Labels: bug

#17138 - JSON spark reader plan for 24.12

Issue - State: open - Opened by karthikeyann 26 days ago
Labels: cuIO, Spark, improvement

#17134 - devcontainer: replace `VAULT_HOST` with `AWS_ROLE_ARN`

Pull Request - State: closed - Opened by jjacobelli 26 days ago - 1 comment
Labels: improvement, non-breaking

#17133 - Support hyper log log plus plus(HLL++)

Pull Request - State: open - Opened by res-life 26 days ago - 2 comments
Labels: libcudf, CMake, Java

#17131 - use rapids-generate-pip-constraints to pin to oldest dependencies in CI

Pull Request - State: closed - Opened by jameslamb 29 days ago - 1 comment
Labels: improvement, non-breaking

#17128 - Added strings AST vs BINARY_OP benchmarks

Pull Request - State: closed - Opened by lamarrr 29 days ago - 5 comments
Labels: feature request, libcudf, non-breaking

#17127 - [FEA] [Proposal] Separate IR evaluation logic from the IR object in cudf-polars

Issue - State: closed - Opened by rjzamora 29 days ago - 1 comment
Labels: feature request, proposal, improvement, cudf.polars

#17119 - [Story] Enable multithreading in cuIO and libcudf

Issue - State: open - Opened by mhaseeb123 30 days ago
Labels: proposal

#17116 - [DO NOT MERGE/REVIEW] GDS debugging

Pull Request - State: open - Opened by kingcrimsontianyu 30 days ago - 1 comment
Labels: libcudf

#17115 - Improve cudf::io::datasource::create().

Pull Request - State: open - Opened by tpn about 1 month ago - 5 comments
Labels: libcudf

#17114 - [FEA] Faster path for calculating total output symbols in FST

Issue - State: open - Opened by karthikeyann about 1 month ago - 4 comments
Labels: feature request

#17109 - Use `libcudf_exception_handler` throughout `pylibcudf.libcudf`

Pull Request - State: open - Opened by brandon-b-miller about 1 month ago - 1 comment
Labels: feature request, Python, non-breaking, pylibcudf

#17106 - [FEA] Improve WSL testing

Issue - State: open - Opened by vyasr about 1 month ago - 1 comment
Labels: feature request

#17105 - Minor I/O code quality improvements

Pull Request - State: closed - Opened by kingcrimsontianyu about 1 month ago - 1 comment
Labels: libcudf, improvement, non-breaking

#17101 - Migrate NVText Byte Pair Encoding APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17100 - Migrate NVText Tokenizing APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago - 1 comment
Labels: feature request, libcudf, Python, CMake, non-breaking, pylibcudf

#17098 - Fix bug in recovering invalid lines in JSONL inputs

Pull Request - State: open - Opened by shrshi about 1 month ago - 3 comments
Labels: bug, libcudf, cuIO, 5 - DO NOT MERGE, non-breaking

#17096 - Migrate NVtext subword tokenizing APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago - 1 comment
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17088 - build wheels without build isolation

Pull Request - State: closed - Opened by jameslamb about 1 month ago - 1 comment
Labels: improvement, non-breaking

#17087 - [BUG] datetime comparison is sensitive to ordering with different precision

Issue - State: closed - Opened by hoxbro about 1 month ago - 1 comment
Labels: bug, Python

#17080 - [FEA] Do not convert decimal32/64 cols to decimal128 in `to_arrow` API and PQ writer when arrow schema is in use

Issue - State: open - Opened by mhaseeb123 about 1 month ago - 1 comment
Labels: feature request, 0 - Blocked

#17078 - Add IWYU to CI

Pull Request - State: closed - Opened by vyasr about 1 month ago - 3 comments
Labels: feature request, libcudf, CMake, non-breaking

#17076 - `cudf-polars` string/numeric casting

Pull Request - State: open - Opened by brandon-b-miller about 1 month ago - 4 comments
Labels: feature request, libcudf, Python, non-breaking, cudf.polars, pylibcudf

#17073 - [BUG] Off-by-1 error in `__floordiv__` binary op

Issue - State: open - Opened by galipremsagar about 1 month ago
Labels: bug

#17072 - [WIP] Migrate NVText Normalizing APIs to Pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17071 - [Do not review] Test stream 2

Pull Request - State: open - Opened by ttnghia about 1 month ago
Labels: libcudf, CMake

#17070 - Migrate remaining nvtext NGrams APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17069 - [Do not review] Test stream 1

Pull Request - State: open - Opened by ttnghia about 1 month ago
Labels: libcudf, CMake

#17068 - [BUG] cuDF.read_json fails with cudaErrorInvalidValue invalid argument

Issue - State: open - Opened by ayushdg about 1 month ago - 1 comment
Labels: bug

#17067 - Fix regex handling of fixed quantifier with 0 range

Pull Request - State: open - Opened by davidwendt about 1 month ago
Labels: bug, 3 - Ready for Review, libcudf, strings, non-breaking

#17066 - Add profilers to CUDA 12 conda devcontainers

Pull Request - State: open - Opened by vyasr about 1 month ago
Labels: feature request, non-breaking

#17065 - [BUG] 0 range regexp appear to be broken

Issue - State: open - Opened by revans2 about 1 month ago
Labels: bug, libcudf

#17064 - Use the "Force CI run" feature of changed-files workflow

Pull Request - State: open - Opened by KyleFromNVIDIA about 1 month ago
Labels: 5 - DO NOT MERGE

#17063 - [DO NOT REVIEW/MERGE] GDS test

Pull Request - State: open - Opened by kingcrimsontianyu about 1 month ago
Labels: libcudf

#17062 - Polars: DataFrame Serialization

Pull Request - State: open - Opened by madsbk about 1 month ago - 6 comments
Labels: Python, improvement, non-breaking, cudf.polars

#17061 - [BUG] Without providing metadata cannot round-trip struct columns via interop/from/to_arrow

Issue - State: open - Opened by wence- about 1 month ago
Labels: bug, pylibcudf

#17060 - [FEA] Support missing casting operations in `cudf-polars`

Issue - State: open - Opened by brandon-b-miller about 1 month ago
Labels: feature request, cudf.polars

#17059 - :construction: Limit the number of keys to calculate column sizes and page starts in PQ reader to 1B

Pull Request - State: open - Opened by mhaseeb123 about 1 month ago
Labels: bug, 2 - In Progress, libcudf, cuIO, non-breaking

#17057 - Adding assertion to check for regular JSON inputs of size greater than `INT_MAX` bytes

Pull Request - State: open - Opened by shrshi about 1 month ago - 1 comment
Labels: bug, libcudf, cuIO, non-breaking

#17056 - Remove unused hash helper functions

Pull Request - State: closed - Opened by PointKernel about 1 month ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#17055 - Add to_dlpack/from_dlpack APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, improvement, non-breaking, pylibcudf

#17054 - [FEA] Support direct construction of pylibcudf Scalars

Issue - State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, pylibcudf

#17053 - Move `flatten_single_pass_aggs` to its own TU

Pull Request - State: closed - Opened by PointKernel about 1 month ago - 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking

#17052 - Replace deprecated cuco APIs with updated versions

Pull Request - State: closed - Opened by PointKernel about 1 month ago - 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking, cuco

#17051 - [FEA] Display operations unsupported by the GPU in a user-friendly way

Issue - State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, cudf.polars

#17050 - [FEA] Standardize approach for various TPC-H-adjacent benchmarks

Issue - State: open - Opened by vyasr about 1 month ago
Labels: feature request, proposal, Performance

#17049 - Refactor ORC dictionary encoding to migrate to the new `cuco::static_map`

Pull Request - State: closed - Opened by mhaseeb123 about 1 month ago - 2 comments
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking, cuco

#17048 - Move pylibcudf/libcudf/wrappers/decimals to pylibcudf/libcudf/fixed_point

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, improvement, non-breaking, pylibcudf

#17047 - [FEA] Support GPUs in the Polars eager API

Issue - State: closed - Opened by vyasr about 1 month ago - 1 comment
Labels: feature request, Python, cudf.polars

#17046 - [FEA] Make pyarrow an optional dependency of pylibcudf

Issue - State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, pylibcudf

#17045 - [FEA] Make cudf tests deterministic

Issue - State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python

#17044 - fix: use `self.ck_consumer` in `poll` method of kafka.py to align with `__init__`

Pull Request - State: open - Opened by a-hirota about 1 month ago - 2 comments
Labels: Python, Java, cudf.pandas, cudf.polars, pylibcudf

#17043 - [FEA] Provide more information on reasons for CPU fallback in the cudf.pandas profiler

Issue - State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, cudf.pandas

#17042 - make conda installs in CI stricter (part 2)

Pull Request - State: closed - Opened by jameslamb about 1 month ago - 1 comment
Labels: improvement, non-breaking

#17041 - docs: change 'CSV' to 'csv' in python/custreamz/README.md to match kafka.py

Pull Request - State: closed - Opened by a-hirota about 1 month ago - 3 comments
Labels: doc, Python, non-breaking

#17040 - [FEA] Create `pylibcudf.Table` from a `table_view` and an arbitrary owning object

Issue - State: open - Opened by madsbk about 1 month ago - 3 comments
Labels: feature request, pylibcudf

#17040 - [FEA] Create `pylibcudf.Table` from a `table_view` and an arbitrary owning object

Issue - State: open - Opened by madsbk about 1 month ago - 3 comments
Labels: feature request, pylibcudf

#17039 - Use managed memory for NDSH benchmarks

Pull Request - State: open - Opened by karthikeyann about 1 month ago - 1 comment
Labels: 2 - In Progress, tests, libcudf, CMake, Performance, improvement, non-breaking

#17038 - Special treatment of Grace Hopper on the choice of default env var

Pull Request - State: open - Opened by kingcrimsontianyu about 1 month ago
Labels: feature request, libcudf, non-breaking

#17037 - Add conda recipe for cudf-polars

Pull Request - State: open - Opened by bdice about 1 month ago
Labels: feature request, non-breaking

#17037 - Add conda recipe for cudf-polars

Pull Request - State: open - Opened by bdice about 1 month ago
Labels: feature request, non-breaking

#17036 - [FEA] Use `except +libcudf_exception_handler` for all Cython function declarations in `pylibcudf/libcudf`

Issue - State: open - Opened by mroeschke about 1 month ago
Labels: feature request, pylibcudf

#17035 - Implement batch construction for strings columns

Pull Request - State: open - Opened by ttnghia about 1 month ago
Labels: feature request, 2 - In Progress, libcudf, CMake, Performance, Spark, strings, non-breaking