Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rapidsai/cudf issues and pull requests
#17179 - Make ai.rapids.cudf.HostMemoryBuffer#copyFromStream public.
Pull Request -
State: open - Opened by liurenjie1024 23 days ago
- 5 comments
Labels: Java
#17178 - [WIP] Nullify non-struct rows in JSONL inputs
Pull Request -
State: closed - Opened by shrshi 23 days ago
- 2 comments
Labels: bug, libcudf, cuIO, non-breaking
#17177 - [BUG] dask-cudf.to_parquet(write_metadata_file=True, append=True) fails
Issue -
State: open - Opened by ayushdg 23 days ago
- 1 comment
Labels: bug, Python
#17176 - Support storing `precision` of decimal types in `Schema` class
Pull Request -
State: open - Opened by ttnghia 23 days ago
- 6 comments
Labels: feature request, 3 - Ready for Review, Java, Spark, non-breaking
#17175 - Separate evaluation logic from `IR` objects in cudf-polars
Pull Request -
State: closed - Opened by rjzamora 23 days ago
- 4 comments
Labels: Python, 4 - Needs Review, improvement, breaking, cudf.polars
#17174 - [FEA] dt.date (mirror pyarrow-backed pandas behaviour)
Issue -
State: open - Opened by MarcoGorelli 23 days ago
Labels: feature request
#17173 - Move nvtext ngrams benchmarks to nvbench
Pull Request -
State: closed - Opened by davidwendt 23 days ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking
#17172 - [PERF] Remove stream sync in concatenate for better pipelining
Issue -
State: open - Opened by tgujar 23 days ago
- 1 comment
Labels: libcudf, Performance, improvement
#17171 - [FEA] Remove redundant aggregatiom details in shared memory groupby
Issue -
State: open - Opened by PointKernel 23 days ago
Labels: feature request, libcudf
#17170 - Remove includes suggested by include-what-you-use
Pull Request -
State: closed - Opened by vyasr 23 days ago
- 8 comments
Labels: libcudf, improvement, non-breaking
#17169 - [FEA] Add libcudf example showing `MERGE_LISTS` aggregation kind
Issue -
State: open - Opened by GregoryKimball 23 days ago
Labels: feature request, libcudf
#17168 - [BUG] Make `__iter__` a fast-slow attribute of the `pd.Series` proxy type
Pull Request -
State: open - Opened by Matt711 24 days ago
- 2 comments
Labels: bug, Python, non-breaking, cudf.pandas
#17167 - [FEA] Support column factory function `all_nulls_like`
Issue -
State: open - Opened by ttnghia 24 days ago
- 5 comments
Labels: feature request
#17166 - [BUG] Series.plot method populates pd.util.version when run under cudf.pandas
Issue -
State: closed - Opened by wphicks 24 days ago
Labels: bug, Python, cudf.pandas
#17165 - [BUG] Incorrect dtype when iterating over dtypes in cudf.pandas
Issue -
State: open - Opened by wphicks 24 days ago
- 2 comments
Labels: bug
#17164 - [FEA] Use bloom filters in Parquet reader to filter row groups with equality predicates
Issue -
State: open - Opened by mhaseeb123 24 days ago
Labels: feature request, libcudf, cuIO, improvement, cuco
#17163 - Migrate CSV writer to pylibcudf
Pull Request -
State: open - Opened by Matt711 24 days ago
Labels: feature request, libcudf, Python, non-breaking, pylibcudf
#17162 - Add compute_shared_memory_aggs used by shared memory groupby
Pull Request -
State: open - Opened by PointKernel 24 days ago
Labels: feature request, 3 - Ready for Review, libcudf, CMake, non-breaking
#17161 - Reading multi-source compressed JSONL files
Pull Request -
State: open - Opened by shrshi 24 days ago
- 4 comments
Labels: libcudf, CMake, cuIO, improvement, non-breaking
#17158 - [BUG] Chunked parquet reader incorrect results for large string columns
Issue -
State: open - Opened by brandon-b-miller 24 days ago
Labels: bug, cuIO, cudf.polars
#17157 - Process parquet bools with microkernels
Pull Request -
State: open - Opened by pmattione-nvidia 24 days ago
Labels: libcudf, Performance, improvement, non-breaking
#17156 - Added ast tree to simplify expression lifetime management
Pull Request -
State: open - Opened by lamarrr 24 days ago
Labels: feature request, libcudf, CMake, non-breaking
#17155 - [BUG] Misaligned timestamps produced by ORC reader
Issue -
State: open - Opened by mlahir1 24 days ago
- 3 comments
Labels: bug, libcudf, cuIO
#17154 - Upgrade to polars 1.11 in cudf-polars
Pull Request -
State: closed - Opened by wence- 24 days ago
- 2 comments
Labels: Python, improvement, breaking, cudf.polars
#17153 - [BUG] INTEROP_TEST fails on a libcudf debug build at ToArrowTest.NestedList
Issue -
State: open - Opened by davidwendt 24 days ago
- 2 comments
Labels: bug, libcudf
#17152 - Deprecate current libcudf nvtext minhash functions
Pull Request -
State: closed - Opened by davidwendt 24 days ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking
#17151 - Remove unused variable in internal merge_tdigests utility
Pull Request -
State: closed - Opened by davidwendt 24 days ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#17149 - Fix host-to-device copy missing sync in strings/duration convert
Pull Request -
State: closed - Opened by davidwendt 24 days ago
- 1 comment
Labels: bug, 3 - Ready for Review, libcudf, non-breaking
#17148 - Move `segmented_gather` function from the copying module to the lists module
Pull Request -
State: closed - Opened by Matt711 25 days ago
- 1 comment
Labels: Python, improvement, non-breaking
#17147 - Add compute_mapping_indices used by shared memory groupby
Pull Request -
State: closed - Opened by PointKernel 25 days ago
- 3 comments
Labels: feature request, 3 - Ready for Review, libcudf, CMake, non-breaking
#17145 - [test] compression benchmarks
Pull Request -
State: open - Opened by vuule 25 days ago
Labels: libcudf
#17144 - [FEA] Support for dynamic groupby in cudf-polars
Issue -
State: open - Opened by wence- 25 days ago
- 2 comments
Labels: feature request, cudf.polars
#17143 - Add remaining datetime APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 25 days ago
Labels: feature request, Python, non-breaking, pylibcudf
#17142 - [FEA] Parquet reader filter improvements
Issue -
State: open - Opened by wence- 25 days ago
- 4 comments
Labels: feature request, libcudf, cuIO
#17141 - Add conversion from cudf-polars expressions to libcudf ast for parquet filters
Pull Request -
State: open - Opened by wence- 25 days ago
Labels: Python, improvement, non-breaking, cudf.polars, pylibcudf
#17140 - [BUG] Slow Performance of cuDF Pandas on L4
Issue -
State: closed - Opened by ericphan-nv 26 days ago
- 2 comments
Labels: bug
#17138 - JSON spark reader plan for 24.12
Issue -
State: open - Opened by karthikeyann 26 days ago
Labels: cuIO, Spark, improvement
#17134 - devcontainer: replace `VAULT_HOST` with `AWS_ROLE_ARN`
Pull Request -
State: closed - Opened by jjacobelli 26 days ago
- 1 comment
Labels: improvement, non-breaking
#17133 - Support hyper log log plus plus(HLL++)
Pull Request -
State: open - Opened by res-life 26 days ago
- 2 comments
Labels: libcudf, CMake, Java
#17131 - use rapids-generate-pip-constraints to pin to oldest dependencies in CI
Pull Request -
State: closed - Opened by jameslamb 29 days ago
- 1 comment
Labels: improvement, non-breaking
#17128 - Added strings AST vs BINARY_OP benchmarks
Pull Request -
State: closed - Opened by lamarrr 29 days ago
- 5 comments
Labels: feature request, libcudf, non-breaking
#17127 - [FEA] [Proposal] Separate IR evaluation logic from the IR object in cudf-polars
Issue -
State: closed - Opened by rjzamora 29 days ago
- 1 comment
Labels: feature request, proposal, improvement, cudf.polars
#17119 - [Story] Enable multithreading in cuIO and libcudf
Issue -
State: open - Opened by mhaseeb123 30 days ago
Labels: proposal
#17116 - [DO NOT MERGE/REVIEW] GDS debugging
Pull Request -
State: open - Opened by kingcrimsontianyu 30 days ago
- 1 comment
Labels: libcudf
#17115 - Improve cudf::io::datasource::create().
Pull Request -
State: open - Opened by tpn about 1 month ago
- 5 comments
Labels: libcudf
#17114 - [FEA] Faster path for calculating total output symbols in FST
Issue -
State: open - Opened by karthikeyann about 1 month ago
- 4 comments
Labels: feature request
#17109 - Use `libcudf_exception_handler` throughout `pylibcudf.libcudf`
Pull Request -
State: open - Opened by brandon-b-miller about 1 month ago
- 1 comment
Labels: feature request, Python, non-breaking, pylibcudf
#17108 - [BUG] JSON reader metadata contains in an extra child for string column inside (deeply) nested structs/lists
Issue -
State: closed - Opened by mhaseeb123 about 1 month ago
- 2 comments
Labels: bug
#17106 - [FEA] Improve WSL testing
Issue -
State: open - Opened by vyasr about 1 month ago
- 1 comment
Labels: feature request
#17105 - Minor I/O code quality improvements
Pull Request -
State: closed - Opened by kingcrimsontianyu about 1 month ago
- 1 comment
Labels: libcudf, improvement, non-breaking
#17101 - Migrate NVText Byte Pair Encoding APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17100 - Migrate NVText Tokenizing APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
- 1 comment
Labels: feature request, libcudf, Python, CMake, non-breaking, pylibcudf
#17098 - Fix bug in recovering invalid lines in JSONL inputs
Pull Request -
State: open - Opened by shrshi about 1 month ago
- 3 comments
Labels: bug, libcudf, cuIO, 5 - DO NOT MERGE, non-breaking
#17096 - Migrate NVtext subword tokenizing APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
- 1 comment
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17088 - build wheels without build isolation
Pull Request -
State: closed - Opened by jameslamb about 1 month ago
- 1 comment
Labels: improvement, non-breaking
#17087 - [BUG] datetime comparison is sensitive to ordering with different precision
Issue -
State: closed - Opened by hoxbro about 1 month ago
- 1 comment
Labels: bug, Python
#17080 - [FEA] Do not convert decimal32/64 cols to decimal128 in `to_arrow` API and PQ writer when arrow schema is in use
Issue -
State: open - Opened by mhaseeb123 about 1 month ago
- 1 comment
Labels: feature request, 0 - Blocked
#17078 - Add IWYU to CI
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 3 comments
Labels: feature request, libcudf, CMake, non-breaking
#17076 - `cudf-polars` string/numeric casting
Pull Request -
State: open - Opened by brandon-b-miller about 1 month ago
- 4 comments
Labels: feature request, libcudf, Python, non-breaking, cudf.polars, pylibcudf
#17073 - [BUG] Off-by-1 error in `__floordiv__` binary op
Issue -
State: open - Opened by galipremsagar about 1 month ago
Labels: bug
#17072 - [WIP] Migrate NVText Normalizing APIs to Pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17071 - [Do not review] Test stream 2
Pull Request -
State: open - Opened by ttnghia about 1 month ago
Labels: libcudf, CMake
#17070 - Migrate remaining nvtext NGrams APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17069 - [Do not review] Test stream 1
Pull Request -
State: open - Opened by ttnghia about 1 month ago
Labels: libcudf, CMake
#17068 - [BUG] cuDF.read_json fails with cudaErrorInvalidValue invalid argument
Issue -
State: open - Opened by ayushdg about 1 month ago
- 1 comment
Labels: bug
#17067 - Fix regex handling of fixed quantifier with 0 range
Pull Request -
State: open - Opened by davidwendt about 1 month ago
Labels: bug, 3 - Ready for Review, libcudf, strings, non-breaking
#17066 - Add profilers to CUDA 12 conda devcontainers
Pull Request -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, non-breaking
#17065 - [BUG] 0 range regexp appear to be broken
Issue -
State: open - Opened by revans2 about 1 month ago
Labels: bug, libcudf
#17064 - Use the "Force CI run" feature of changed-files workflow
Pull Request -
State: open - Opened by KyleFromNVIDIA about 1 month ago
Labels: 5 - DO NOT MERGE
#17063 - [DO NOT REVIEW/MERGE] GDS test
Pull Request -
State: open - Opened by kingcrimsontianyu about 1 month ago
Labels: libcudf
#17062 - Polars: DataFrame Serialization
Pull Request -
State: open - Opened by madsbk about 1 month ago
- 6 comments
Labels: Python, improvement, non-breaking, cudf.polars
#17061 - [BUG] Without providing metadata cannot round-trip struct columns via interop/from/to_arrow
Issue -
State: open - Opened by wence- about 1 month ago
Labels: bug, pylibcudf
#17060 - [FEA] Support missing casting operations in `cudf-polars`
Issue -
State: open - Opened by brandon-b-miller about 1 month ago
Labels: feature request, cudf.polars
#17059 - :construction: Limit the number of keys to calculate column sizes and page starts in PQ reader to 1B
Pull Request -
State: open - Opened by mhaseeb123 about 1 month ago
Labels: bug, 2 - In Progress, libcudf, cuIO, non-breaking
#17058 - [BUG] Limit size of buffer read by batched multi-source JSON lines reader to be at most `INT_MAX` bytes
Issue -
State: open - Opened by shrshi about 1 month ago
Labels: bug
#17057 - Adding assertion to check for regular JSON inputs of size greater than `INT_MAX` bytes
Pull Request -
State: open - Opened by shrshi about 1 month ago
- 1 comment
Labels: bug, libcudf, cuIO, non-breaking
#17056 - Remove unused hash helper functions
Pull Request -
State: closed - Opened by PointKernel about 1 month ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#17055 - Add to_dlpack/from_dlpack APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, improvement, non-breaking, pylibcudf
#17054 - [FEA] Support direct construction of pylibcudf Scalars
Issue -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, pylibcudf
#17053 - Move `flatten_single_pass_aggs` to its own TU
Pull Request -
State: closed - Opened by PointKernel about 1 month ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, CMake, improvement, non-breaking
#17052 - Replace deprecated cuco APIs with updated versions
Pull Request -
State: closed - Opened by PointKernel about 1 month ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking, cuco
#17051 - [FEA] Display operations unsupported by the GPU in a user-friendly way
Issue -
State: open - Opened by Matt711 about 1 month ago
Labels: feature request, Python, cudf.polars
#17050 - [FEA] Standardize approach for various TPC-H-adjacent benchmarks
Issue -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, proposal, Performance
#17049 - Refactor ORC dictionary encoding to migrate to the new `cuco::static_map`
Pull Request -
State: closed - Opened by mhaseeb123 about 1 month ago
- 2 comments
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking, cuco
#17048 - Move pylibcudf/libcudf/wrappers/decimals to pylibcudf/libcudf/fixed_point
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, improvement, non-breaking, pylibcudf
#17047 - [FEA] Support GPUs in the Polars eager API
Issue -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
Labels: feature request, Python, cudf.polars
#17046 - [FEA] Make pyarrow an optional dependency of pylibcudf
Issue -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, pylibcudf
#17045 - [FEA] Make cudf tests deterministic
Issue -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python
#17044 - fix: use `self.ck_consumer` in `poll` method of kafka.py to align with `__init__`
Pull Request -
State: open - Opened by a-hirota about 1 month ago
- 2 comments
Labels: Python, Java, cudf.pandas, cudf.polars, pylibcudf
#17043 - [FEA] Provide more information on reasons for CPU fallback in the cudf.pandas profiler
Issue -
State: open - Opened by vyasr about 1 month ago
Labels: feature request, Python, cudf.pandas
#17042 - make conda installs in CI stricter (part 2)
Pull Request -
State: closed - Opened by jameslamb about 1 month ago
- 1 comment
Labels: improvement, non-breaking
#17041 - docs: change 'CSV' to 'csv' in python/custreamz/README.md to match kafka.py
Pull Request -
State: closed - Opened by a-hirota about 1 month ago
- 3 comments
Labels: doc, Python, non-breaking
#17040 - [FEA] Create `pylibcudf.Table` from a `table_view` and an arbitrary owning object
Issue -
State: open - Opened by madsbk about 1 month ago
- 3 comments
Labels: feature request, pylibcudf
#17040 - [FEA] Create `pylibcudf.Table` from a `table_view` and an arbitrary owning object
Issue -
State: open - Opened by madsbk about 1 month ago
- 3 comments
Labels: feature request, pylibcudf
#17039 - Use managed memory for NDSH benchmarks
Pull Request -
State: open - Opened by karthikeyann about 1 month ago
- 1 comment
Labels: 2 - In Progress, tests, libcudf, CMake, Performance, improvement, non-breaking
#17038 - Special treatment of Grace Hopper on the choice of default env var
Pull Request -
State: open - Opened by kingcrimsontianyu about 1 month ago
Labels: feature request, libcudf, non-breaking
#17037 - Add conda recipe for cudf-polars
Pull Request -
State: open - Opened by bdice about 1 month ago
Labels: feature request, non-breaking
#17037 - Add conda recipe for cudf-polars
Pull Request -
State: open - Opened by bdice about 1 month ago
Labels: feature request, non-breaking
#17036 - [FEA] Use `except +libcudf_exception_handler` for all Cython function declarations in `pylibcudf/libcudf`
Issue -
State: open - Opened by mroeschke about 1 month ago
Labels: feature request, pylibcudf
#17035 - Implement batch construction for strings columns
Pull Request -
State: open - Opened by ttnghia about 1 month ago
Labels: feature request, 2 - In Progress, libcudf, CMake, Performance, Spark, strings, non-breaking