Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rapidsai/cudf issues and pull requests
#17035 - Implement batch construction for strings columns
Pull Request -
State: open - Opened by ttnghia about 1 month ago
Labels: feature request, 2 - In Progress, libcudf, CMake, Performance, Spark, strings, non-breaking
#17034 - Clean up hash-groupby `var_hash_functor`
Pull Request -
State: open - Opened by PointKernel about 1 month ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#17034 - Clean up hash-groupby `var_hash_functor`
Pull Request -
State: open - Opened by PointKernel about 1 month ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking
#17033 - [FEA] Use NVTX extended payloads to add row counts to nvtx ranges for libcudf APIs
Issue -
State: open - Opened by GregoryKimball about 1 month ago
Labels: feature request, libcudf
#17033 - [FEA] Use NVTX extended payloads to add row counts to nvtx ranges for libcudf APIs
Issue -
State: open - Opened by GregoryKimball about 1 month ago
Labels: feature request, libcudf
#17032 - [FEA] Refactor to eliminate redundant device aggregation logic
Issue -
State: open - Opened by PointKernel about 1 month ago
Labels: libcudf, improvement
#17032 - [FEA] Refactor to eliminate redundant device aggregation logic
Issue -
State: open - Opened by PointKernel about 1 month ago
Labels: libcudf, improvement
#17031 - Add device aggregators used by shared memory groupby
Pull Request -
State: open - Opened by PointKernel about 1 month ago
Labels: feature request, 3 - Ready for Review, libcudf, non-breaking
#17031 - Add device aggregators used by shared memory groupby
Pull Request -
State: open - Opened by PointKernel about 1 month ago
Labels: feature request, 3 - Ready for Review, libcudf, non-breaking
#17030 - disable array of arrays for recovery with null
Pull Request -
State: open - Opened by karthikeyann about 1 month ago
Labels: bug, libcudf, non-breaking
#17030 - disable array of arrays for recovery with null
Pull Request -
State: closed - Opened by karthikeyann about 1 month ago
- 1 comment
Labels: bug, libcudf, non-breaking
#17029 - Add optional column_order in JSON reader
Pull Request -
State: open - Opened by karthikeyann about 1 month ago
- 5 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking
#17028 - add option to nullify empty lines
Pull Request -
State: closed - Opened by karthikeyann about 1 month ago
- 2 comments
Labels: feature request, 2 - In Progress, libcudf, cuIO, non-breaking
#17027 - Forward-merge branch-24.10 into branch-24.12
Pull Request -
State: closed - Opened by rapids-bot[bot] about 1 month ago
- 1 comment
#17026 - Disable kvikio remote I/O to avoid openssl dependencies in JNI build
Pull Request -
State: closed - Opened by pxLi about 1 month ago
- 3 comments
Labels: bug, Java, non-breaking, cuDF (Java)
#17025 - Add json APIs to pylibcudf
Pull Request -
State: open - Opened by mroeschke about 1 month ago
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#17025 - Add json APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#17024 - Test
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
#17024 - Test
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
#17023 - Add string.replace_re APIs to pylibcudf
Pull Request -
State: open - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#17022 - [DO NOT MERGE] Testing arrow seg fault
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
Labels: Python, CMake, cudf.pandas, cudf.polars
#17022 - [DO NOT MERGE] Testing arrow seg fault
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
Labels: Python, CMake, cudf.pandas, cudf.polars
#17021 - Migrate Min Hashing APIs to pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
- 1 comment
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17021 - Migrate Min Hashing APIs to pylibcudf
Pull Request -
State: closed - Opened by Matt711 about 1 month ago
- 2 comments
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17020 - Fix `host_span` constructor to correctly copy `is_device_accessible`
Pull Request -
State: closed - Opened by vuule about 1 month ago
- 1 comment
Labels: bug, libcudf, non-breaking
#17019 - Replace old host tree algorithm with new algorithm in JSON reader
Pull Request -
State: open - Opened by karthikeyann about 1 month ago
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking
#17019 - Replace old host tree algorithm with new algorithm in JSON reader
Pull Request -
State: open - Opened by karthikeyann about 1 month ago
- 1 comment
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking
#17018 - Add pinning for pyarrow in wheels
Pull Request -
State: closed - Opened by vyasr about 1 month ago
- 1 comment
Labels: bug, Python, non-breaking, cudf.polars, pylibcudf
#17017 - [BUG] Limit JSON reader input to 2GiB
Issue -
State: open - Opened by karthikeyann about 1 month ago
- 3 comments
Labels: bug, cuIO
#17016 - Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL
Pull Request -
State: open - Opened by wence- about 1 month ago
- 5 comments
Labels: Python, improvement, breaking, cudf.polars
#17015 - Use std::optional for host types
Pull Request -
State: closed - Opened by robertmaynard about 1 month ago
- 3 comments
Labels: bug, libcudf, non-breaking
#17014 - Reorganize `cudf_polars` expression code
Pull Request -
State: closed - Opened by brandon-b-miller about 1 month ago
- 2 comments
Labels: feature request, Python, non-breaking, cudf.polars
#17013 - make conda installs in CI stricter
Pull Request -
State: closed - Opened by jameslamb about 1 month ago
- 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking
#17012 - Pylibcudf: pack and unpack
Pull Request -
State: closed - Opened by madsbk about 1 month ago
- 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#17011 - Use block per string for super long strings in cudf::strings::find()
Pull Request -
State: open - Opened by sarda-devesh about 1 month ago
- 8 comments
Labels: libcudf, improvement, non-breaking
#17011 - Use block per string for super long strings in cudf::strings::find()
Pull Request -
State: open - Opened by sarda-devesh about 1 month ago
- 9 comments
Labels: libcudf, improvement, non-breaking
#17010 - Remove unneeded pylibcudf.libcudf.wrappers.duration usage in cudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 5 comments
Labels: Python, improvement, non-breaking
#17009 - Add custom "fused" groupby aggregation to Dask cuDF
Pull Request -
State: open - Opened by rjzamora about 1 month ago
Labels: 3 - Ready for Review, Python, dask, improvement, non-breaking
#17008 - Make tests more deterministic
Pull Request -
State: open - Opened by galipremsagar about 1 month ago
- 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas
#17007 - Migrate nvtext jaccard API to pylibcudf
Pull Request -
State: closed - Opened by Matt711 about 1 month ago
- 1 comment
Labels: feature request, 3 - Ready for Review, Python, CMake, non-breaking, pylibcudf
#17006 - Migrate nvtext generate_ngrams APIs to pylibcudf
Pull Request -
State: closed - Opened by Matt711 about 1 month ago
- 5 comments
Labels: feature request, Python, CMake, non-breaking, pylibcudf
#17005 - Remove unused import
Pull Request -
State: closed - Opened by Matt711 about 1 month ago
- 2 comments
Labels: Python, improvement, non-breaking
#17005 - Remove unused import
Pull Request -
State: closed - Opened by Matt711 about 1 month ago
- 2 comments
Labels: Python, improvement, non-breaking
#17004 - Environment variables to configure file data source
Pull Request -
State: open - Opened by vuule about 1 month ago
Labels: feature request, libcudf, cuIO, non-breaking
#17003 - Add string.convert.convert_urls APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: libcudf, Python, CMake, improvement, non-breaking, pylibcudf
#17002 - [FEA] Implement a better JNI function to assemble the output columns from `cudf::read_json`
Issue -
State: open - Opened by ttnghia about 1 month ago
Labels: feature request, Spark
#17001 - Add release tracking to project automation scripts
Pull Request -
State: closed - Opened by jarmak-nv about 1 month ago
- 1 comment
Labels: improvement, non-breaking
#17000 - Implement inequality joins by translation to conditional joins
Pull Request -
State: open - Opened by wence- about 1 month ago
- 3 comments
Labels: Python, improvement, non-breaking, cudf.polars
#16999 - [BUG] `cudf::read_json` incorrectly parses invalid JSON string
Issue -
State: open - Opened by ttnghia about 1 month ago
Labels: bug, cuIO
#16998 - Test Ubuntu 24.04 in CI workflows.
Pull Request -
State: open - Opened by bdice about 1 month ago
- 1 comment
Labels: 5 - DO NOT MERGE, improvement, non-breaking
#16997 - Add string.convert.convert_lists APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16996 - Performance optimization of JSON validation
Pull Request -
State: closed - Opened by karthikeyann about 1 month ago
- 3 comments
Labels: 3 - Ready for Review, libcudf, cuIO, Performance, Spark, improvement, non-breaking
#16995 - Fix write_json to handle empty string column
Pull Request -
State: closed - Opened by karthikeyann about 1 month ago
- 3 comments
Labels: bug, 3 - Ready for Review, libcudf, cuIO, non-breaking
#16994 - Add string.convert.convert_ipv4 APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16993 - [FEA] make cudf to_json engine=cudf as default
Issue -
State: open - Opened by karthikeyann about 1 month ago
Labels: feature request, Needs Triage, cudf.pandas
#16992 - [BUG] Error in merge of 8million obs dataset in databricks
Issue -
State: closed - Opened by matt7salomon about 1 month ago
- 2 comments
Labels: bug
#16991 - Add string.convert.convert_integers APIs to pylibcudf
Pull Request -
State: open - Opened by mroeschke about 1 month ago
- 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16990 - Add string.convert_floats APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 2 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16989 - [FEA] Dynamic filtering during joins
Issue -
State: open - Opened by wence- about 1 month ago
Labels: feature request
#16988 - Restore export of nvcomp outside of wheel builds
Pull Request -
State: closed - Opened by KyleFromNVIDIA about 1 month ago
- 1 comment
Labels: bug, libcudf, Python, CMake, non-breaking
#16987 - [FEA] Improve scaling of data generation in NDS-H-cpp benchmarks
Issue -
State: open - Opened by GregoryKimball about 1 month ago
- 2 comments
Labels: feature request, libcudf
#16986 - [BUG] libcudf install does not install nvcomp dependency
Issue -
State: closed - Opened by jlowe about 1 month ago
- 4 comments
Labels: bug, libcudf, CMake, Spark
#16985 - [BUG] Avoid allocating and using `size_input` vector while computing output col sizes when lists are present.
Issue -
State: open - Opened by mhaseeb123 about 1 month ago
Labels: bug, 1 - On Deck, libcudf, cuIO
#16984 - Add string.convert.convert_fixed_type APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf
#16983 - Remove unnecessary `std::move`'s in pylibcudf
Pull Request -
State: open - Opened by Matt711 about 1 month ago
Labels: 3 - Ready for Review, Python, improvement, non-breaking, pylibcudf
#16982 - Add docstrings and test for strings.convert_durations APIs for pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 1 month ago
- 1 comment
Labels: Python, improvement, non-breaking, pylibcudf
#16981 - Allow melt(var_name=) to be a falsy label
Pull Request -
State: closed - Opened by mroeschke about 2 months ago
- 1 comment
Labels: bug, Python, non-breaking
#16980 - Fix astype from tz-aware type to tz-aware type
Pull Request -
State: closed - Opened by mroeschke about 2 months ago
- 3 comments
Labels: bug, Python, non-breaking
#16979 - Forward-merge branch-24.10 into branch-24.12
Pull Request -
State: closed - Opened by rapids-bot[bot] about 2 months ago
- 1 comment
Labels: Python, pylibcudf
#16978 - JSON tokenizer memory optimizations
Pull Request -
State: open - Opened by shrshi about 2 months ago
- 4 comments
Labels: libcudf, cuIO, Performance, improvement, non-breaking
#16977 - Turn on `xfail_strict = true` for all python packages
Pull Request -
State: closed - Opened by wence- about 2 months ago
- 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas, cudf.polars, pylibcudf
#16976 - Add license to the pylibcudf wheel
Pull Request -
State: closed - Opened by raydouglass about 2 months ago
- 1 comment
Labels: bug, non-breaking
#16975 - Use `libcudf` wheel from PR rather than nightly for `polars-polars` CI test job
Pull Request -
State: closed - Opened by brandon-b-miller about 2 months ago
- 1 comment
Labels: bug, non-breaking
#16974 - [BUG] some pytest configurations not used
Issue -
State: closed - Opened by wence- about 2 months ago
- 1 comment
Labels: bug
#16973 - [BUG] astype ignores time zone
Issue -
State: closed - Opened by MarcoGorelli about 2 months ago
Labels: bug, Python
#16972 - [BUG] `melt` doesn't respect `var_name=''`
Issue -
State: closed - Opened by MarcoGorelli about 2 months ago
Labels: bug, Python
#16971 - Add string.convert.convert_datetime/convert_booleans APIs to pylibcudf
Pull Request -
State: closed - Opened by mroeschke about 2 months ago
- 1 comment
Labels: Python, CMake, improvement, non-breaking, cudf.polars, pylibcudf
#16970 - [DONT REVIEW] Revert upgrade to nvcomp 4.0.1
Pull Request -
State: open - Opened by gerashegalov about 2 months ago
Labels: libcudf, Python, CMake, Java
#16969 - Auto assign PR to author
Pull Request -
State: open - Opened by Matt711 about 2 months ago
- 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking, ci
#16968 - [BUG] Incorrect read_parquet on spark distributed parquet files.
Issue -
State: open - Opened by matt7salomon about 2 months ago
- 2 comments
Labels: bug
#16967 - [TEST-ONLY] Test cuco hyperloglog version bump
Pull Request -
State: closed - Opened by PointKernel about 2 months ago
- 1 comment
Labels: CMake, 5 - DO NOT MERGE
#16966 - Branch 24.12 merge branch 24.10
Pull Request -
State: closed - Opened by vyasr about 2 months ago
Labels: libcudf
#16965 - [FEA] Implement merged 'mega' kernel to parse leaf-level columns in JSON reader
Issue -
State: open - Opened by shrshi about 2 months ago
Labels: feature request
#16964 - Deprecate support for directly accessing logger
Pull Request -
State: closed - Opened by vyasr about 2 months ago
- 1 comment
Labels: libcudf, improvement, breaking
#16963 - Switched BINARY_OP Benchmarks from GoogleBench to NVBench
Pull Request -
State: closed - Opened by lamarrr about 2 months ago
- 1 comment
Labels: feature request, libcudf, CMake, non-breaking
#16962 - Expunge NamedColumn
Pull Request -
State: closed - Opened by wence- about 2 months ago
- 4 comments
Labels: Python, improvement, non-breaking, cudf.polars
#16961 - OSError: libcudart.so: cannot open shared object file: No such file or directory (Nvidia Studio Driver 561.09)
Issue -
State: closed - Opened by blademoon about 2 months ago
- 11 comments
Labels: bug
#16960 - [FEA] Report all unsupported operations for a query in cudf.polars
Pull Request -
State: closed - Opened by Matt711 about 2 months ago
- 18 comments
Labels: feature request, Python, non-breaking, cudf.polars
#16959 - [FEA] Add JSON reader column projection example
Issue -
State: closed - Opened by GregoryKimball about 2 months ago
- 2 comments
Labels: feature request, libcudf, cuIO
#16958 - Add clang-tidy to CI
Pull Request -
State: open - Opened by vyasr about 2 months ago
- 7 comments
Labels: libcudf, CMake, improvement, non-breaking
#16957 - [FEA] Migrate nvtext/edit_distance APIs to pylibcudf
Pull Request -
State: closed - Opened by Matt711 about 2 months ago
- 1 comment
Labels: feature request, libcudf, Python, CMake, non-breaking, pylibcudf
#16956 - Address all remaining clang-tidy errors
Pull Request -
State: closed - Opened by vyasr about 2 months ago
- 12 comments
Labels: libcudf, CMake, improvement, non-breaking
#16955 - [DOC] Document limitation using `cudf.pandas` proxy arrays
Pull Request -
State: closed - Opened by Matt711 about 2 months ago
- 1 comment
Labels: 3 - Ready for Review, doc, non-breaking
#16954 - Forward-merge branch-24.10 into branch-24.12
Pull Request -
State: closed - Opened by rapids-bot[bot] about 2 months ago
- 1 comment
Labels: libcudf
#16953 - [WIP] Implement contiguous_split in pylibcudf
Pull Request -
State: closed - Opened by Matt711 about 2 months ago
- 1 comment
Labels: feature request, Python, non-breaking, pylibcudf
#16952 - Switched AST benchmarks from GoogleBench to NVBench
Pull Request -
State: closed - Opened by lamarrr about 2 months ago
- 4 comments
Labels: feature request, libcudf, CMake, non-breaking
#16951 - [Feature Request] Make fmt and spdlog Optional with a -D_USE_EXTERNAL_SPDLOG_FMT Flag
Issue -
State: open - Opened by ava6969 about 2 months ago
- 1 comment
Labels: feature request
#16950 - Parse newline as whitespace character while tokenizing JSONL inputs with non-newline delimiter
Pull Request -
State: open - Opened by shrshi about 2 months ago
- 3 comments
Labels: libcudf, ! - Hotfix, cuIO
#16949 - Apply clang-tidy autofixes
Pull Request -
State: closed - Opened by vyasr about 2 months ago
- 1 comment
Labels: libcudf, improvement, non-breaking
#16948 - [FEA] Lift the 2.1B character limit in `STRINGS_NVBENCH`
Issue -
State: open - Opened by GregoryKimball about 2 months ago
Labels: feature request, libcudf, strings