Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rapidsai/cudf issues and pull requests

#17035 - Implement batch construction for strings columns

Pull Request - State: open - Opened by ttnghia about 1 month ago
Labels: feature request, 2 - In Progress, libcudf, CMake, Performance, Spark, strings, non-breaking

#17034 - Clean up hash-groupby `var_hash_functor`

Pull Request - State: open - Opened by PointKernel about 1 month ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#17034 - Clean up hash-groupby `var_hash_functor`

Pull Request - State: open - Opened by PointKernel about 1 month ago
Labels: 3 - Ready for Review, libcudf, improvement, non-breaking

#17033 - [FEA] Use NVTX extended payloads to add row counts to nvtx ranges for libcudf APIs

Issue - State: open - Opened by GregoryKimball about 1 month ago
Labels: feature request, libcudf

#17033 - [FEA] Use NVTX extended payloads to add row counts to nvtx ranges for libcudf APIs

Issue - State: open - Opened by GregoryKimball about 1 month ago
Labels: feature request, libcudf

#17032 - [FEA] Refactor to eliminate redundant device aggregation logic

Issue - State: open - Opened by PointKernel about 1 month ago
Labels: libcudf, improvement

#17032 - [FEA] Refactor to eliminate redundant device aggregation logic

Issue - State: open - Opened by PointKernel about 1 month ago
Labels: libcudf, improvement

#17031 - Add device aggregators used by shared memory groupby

Pull Request - State: open - Opened by PointKernel about 1 month ago
Labels: feature request, 3 - Ready for Review, libcudf, non-breaking

#17031 - Add device aggregators used by shared memory groupby

Pull Request - State: open - Opened by PointKernel about 1 month ago
Labels: feature request, 3 - Ready for Review, libcudf, non-breaking

#17030 - disable array of arrays for recovery with null

Pull Request - State: open - Opened by karthikeyann about 1 month ago
Labels: bug, libcudf, non-breaking

#17030 - disable array of arrays for recovery with null

Pull Request - State: closed - Opened by karthikeyann about 1 month ago - 1 comment
Labels: bug, libcudf, non-breaking

#17029 - Add optional column_order in JSON reader

Pull Request - State: open - Opened by karthikeyann about 1 month ago - 5 comments
Labels: libcudf, 5 - Ready to Merge, improvement, non-breaking

#17028 - add option to nullify empty lines

Pull Request - State: closed - Opened by karthikeyann about 1 month ago - 2 comments
Labels: feature request, 2 - In Progress, libcudf, cuIO, non-breaking

#17027 - Forward-merge branch-24.10 into branch-24.12

Pull Request - State: closed - Opened by rapids-bot[bot] about 1 month ago - 1 comment

#17026 - Disable kvikio remote I/O to avoid openssl dependencies in JNI build

Pull Request - State: closed - Opened by pxLi about 1 month ago - 3 comments
Labels: bug, Java, non-breaking, cuDF (Java)

#17025 - Add json APIs to pylibcudf

Pull Request - State: open - Opened by mroeschke about 1 month ago
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#17025 - Add json APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#17024 - Test

Pull Request - State: closed - Opened by vyasr about 1 month ago - 1 comment

#17024 - Test

Pull Request - State: closed - Opened by vyasr about 1 month ago - 1 comment

#17023 - Add string.replace_re APIs to pylibcudf

Pull Request - State: open - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#17022 - [DO NOT MERGE] Testing arrow seg fault

Pull Request - State: closed - Opened by vyasr about 1 month ago - 1 comment
Labels: Python, CMake, cudf.pandas, cudf.polars

#17022 - [DO NOT MERGE] Testing arrow seg fault

Pull Request - State: closed - Opened by vyasr about 1 month ago - 1 comment
Labels: Python, CMake, cudf.pandas, cudf.polars

#17021 - Migrate Min Hashing APIs to pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago - 1 comment
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17021 - Migrate Min Hashing APIs to pylibcudf

Pull Request - State: closed - Opened by Matt711 about 1 month ago - 2 comments
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17020 - Fix `host_span` constructor to correctly copy `is_device_accessible`

Pull Request - State: closed - Opened by vuule about 1 month ago - 1 comment
Labels: bug, libcudf, non-breaking

#17019 - Replace old host tree algorithm with new algorithm in JSON reader

Pull Request - State: open - Opened by karthikeyann about 1 month ago
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking

#17019 - Replace old host tree algorithm with new algorithm in JSON reader

Pull Request - State: open - Opened by karthikeyann about 1 month ago - 1 comment
Labels: 3 - Ready for Review, libcudf, cuIO, improvement, non-breaking

#17018 - Add pinning for pyarrow in wheels

Pull Request - State: closed - Opened by vyasr about 1 month ago - 1 comment
Labels: bug, Python, non-breaking, cudf.polars, pylibcudf

#17017 - [BUG] Limit JSON reader input to 2GiB

Issue - State: open - Opened by karthikeyann about 1 month ago - 3 comments
Labels: bug, cuIO

#17016 - Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL

Pull Request - State: open - Opened by wence- about 1 month ago - 5 comments
Labels: Python, improvement, breaking, cudf.polars

#17015 - Use std::optional for host types

Pull Request - State: closed - Opened by robertmaynard about 1 month ago - 3 comments
Labels: bug, libcudf, non-breaking

#17014 - Reorganize `cudf_polars` expression code

Pull Request - State: closed - Opened by brandon-b-miller about 1 month ago - 2 comments
Labels: feature request, Python, non-breaking, cudf.polars

#17013 - make conda installs in CI stricter

Pull Request - State: closed - Opened by jameslamb about 1 month ago - 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking

#17012 - Pylibcudf: pack and unpack

Pull Request - State: closed - Opened by madsbk about 1 month ago - 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#17011 - Use block per string for super long strings in cudf::strings::find()

Pull Request - State: open - Opened by sarda-devesh about 1 month ago - 8 comments
Labels: libcudf, improvement, non-breaking

#17011 - Use block per string for super long strings in cudf::strings::find()

Pull Request - State: open - Opened by sarda-devesh about 1 month ago - 9 comments
Labels: libcudf, improvement, non-breaking

#17010 - Remove unneeded pylibcudf.libcudf.wrappers.duration usage in cudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 5 comments
Labels: Python, improvement, non-breaking

#17009 - Add custom "fused" groupby aggregation to Dask cuDF

Pull Request - State: open - Opened by rjzamora about 1 month ago
Labels: 3 - Ready for Review, Python, dask, improvement, non-breaking

#17008 - Make tests more deterministic

Pull Request - State: open - Opened by galipremsagar about 1 month ago - 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas

#17007 - Migrate nvtext jaccard API to pylibcudf

Pull Request - State: closed - Opened by Matt711 about 1 month ago - 1 comment
Labels: feature request, 3 - Ready for Review, Python, CMake, non-breaking, pylibcudf

#17006 - Migrate nvtext generate_ngrams APIs to pylibcudf

Pull Request - State: closed - Opened by Matt711 about 1 month ago - 5 comments
Labels: feature request, Python, CMake, non-breaking, pylibcudf

#17005 - Remove unused import

Pull Request - State: closed - Opened by Matt711 about 1 month ago - 2 comments
Labels: Python, improvement, non-breaking

#17005 - Remove unused import

Pull Request - State: closed - Opened by Matt711 about 1 month ago - 2 comments
Labels: Python, improvement, non-breaking

#17004 - Environment variables to configure file data source

Pull Request - State: open - Opened by vuule about 1 month ago
Labels: feature request, libcudf, cuIO, non-breaking

#17003 - Add string.convert.convert_urls APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: libcudf, Python, CMake, improvement, non-breaking, pylibcudf

#17002 - [FEA] Implement a better JNI function to assemble the output columns from `cudf::read_json`

Issue - State: open - Opened by ttnghia about 1 month ago
Labels: feature request, Spark

#17001 - Add release tracking to project automation scripts

Pull Request - State: closed - Opened by jarmak-nv about 1 month ago - 1 comment
Labels: improvement, non-breaking

#17000 - Implement inequality joins by translation to conditional joins

Pull Request - State: open - Opened by wence- about 1 month ago - 3 comments
Labels: Python, improvement, non-breaking, cudf.polars

#16999 - [BUG] `cudf::read_json` incorrectly parses invalid JSON string

Issue - State: open - Opened by ttnghia about 1 month ago
Labels: bug, cuIO

#16998 - Test Ubuntu 24.04 in CI workflows.

Pull Request - State: open - Opened by bdice about 1 month ago - 1 comment
Labels: 5 - DO NOT MERGE, improvement, non-breaking

#16997 - Add string.convert.convert_lists APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16996 - Performance optimization of JSON validation

Pull Request - State: closed - Opened by karthikeyann about 1 month ago - 3 comments
Labels: 3 - Ready for Review, libcudf, cuIO, Performance, Spark, improvement, non-breaking

#16995 - Fix write_json to handle empty string column

Pull Request - State: closed - Opened by karthikeyann about 1 month ago - 3 comments
Labels: bug, 3 - Ready for Review, libcudf, cuIO, non-breaking

#16994 - Add string.convert.convert_ipv4 APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16993 - [FEA] make cudf to_json engine=cudf as default

Issue - State: open - Opened by karthikeyann about 1 month ago
Labels: feature request, Needs Triage, cudf.pandas

#16992 - [BUG] Error in merge of 8million obs dataset in databricks

Issue - State: closed - Opened by matt7salomon about 1 month ago - 2 comments
Labels: bug

#16991 - Add string.convert.convert_integers APIs to pylibcudf

Pull Request - State: open - Opened by mroeschke about 1 month ago - 3 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16990 - Add string.convert_floats APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 2 comments
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16989 - [FEA] Dynamic filtering during joins

Issue - State: open - Opened by wence- about 1 month ago
Labels: feature request

#16988 - Restore export of nvcomp outside of wheel builds

Pull Request - State: closed - Opened by KyleFromNVIDIA about 1 month ago - 1 comment
Labels: bug, libcudf, Python, CMake, non-breaking

#16987 - [FEA] Improve scaling of data generation in NDS-H-cpp benchmarks

Issue - State: open - Opened by GregoryKimball about 1 month ago - 2 comments
Labels: feature request, libcudf

#16986 - [BUG] libcudf install does not install nvcomp dependency

Issue - State: closed - Opened by jlowe about 1 month ago - 4 comments
Labels: bug, libcudf, CMake, Spark

#16985 - [BUG] Avoid allocating and using `size_input` vector while computing output col sizes when lists are present.

Issue - State: open - Opened by mhaseeb123 about 1 month ago
Labels: bug, 1 - On Deck, libcudf, cuIO

#16984 - Add string.convert.convert_fixed_type APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, pylibcudf

#16983 - Remove unnecessary `std::move`'s in pylibcudf

Pull Request - State: open - Opened by Matt711 about 1 month ago
Labels: 3 - Ready for Review, Python, improvement, non-breaking, pylibcudf

#16982 - Add docstrings and test for strings.convert_durations APIs for pylibcudf

Pull Request - State: closed - Opened by mroeschke about 1 month ago - 1 comment
Labels: Python, improvement, non-breaking, pylibcudf

#16981 - Allow melt(var_name=) to be a falsy label

Pull Request - State: closed - Opened by mroeschke about 2 months ago - 1 comment
Labels: bug, Python, non-breaking

#16980 - Fix astype from tz-aware type to tz-aware type

Pull Request - State: closed - Opened by mroeschke about 2 months ago - 3 comments
Labels: bug, Python, non-breaking

#16979 - Forward-merge branch-24.10 into branch-24.12

Pull Request - State: closed - Opened by rapids-bot[bot] about 2 months ago - 1 comment
Labels: Python, pylibcudf

#16978 - JSON tokenizer memory optimizations

Pull Request - State: open - Opened by shrshi about 2 months ago - 4 comments
Labels: libcudf, cuIO, Performance, improvement, non-breaking

#16977 - Turn on `xfail_strict = true` for all python packages

Pull Request - State: closed - Opened by wence- about 2 months ago - 2 comments
Labels: Python, improvement, non-breaking, cudf.pandas, cudf.polars, pylibcudf

#16976 - Add license to the pylibcudf wheel

Pull Request - State: closed - Opened by raydouglass about 2 months ago - 1 comment
Labels: bug, non-breaking

#16975 - Use `libcudf` wheel from PR rather than nightly for `polars-polars` CI test job

Pull Request - State: closed - Opened by brandon-b-miller about 2 months ago - 1 comment
Labels: bug, non-breaking

#16974 - [BUG] some pytest configurations not used

Issue - State: closed - Opened by wence- about 2 months ago - 1 comment
Labels: bug

#16973 - [BUG] astype ignores time zone

Issue - State: closed - Opened by MarcoGorelli about 2 months ago
Labels: bug, Python

#16972 - [BUG] `melt` doesn't respect `var_name=''`

Issue - State: closed - Opened by MarcoGorelli about 2 months ago
Labels: bug, Python

#16971 - Add string.convert.convert_datetime/convert_booleans APIs to pylibcudf

Pull Request - State: closed - Opened by mroeschke about 2 months ago - 1 comment
Labels: Python, CMake, improvement, non-breaking, cudf.polars, pylibcudf

#16970 - [DONT REVIEW] Revert upgrade to nvcomp 4.0.1

Pull Request - State: open - Opened by gerashegalov about 2 months ago
Labels: libcudf, Python, CMake, Java

#16969 - Auto assign PR to author

Pull Request - State: open - Opened by Matt711 about 2 months ago - 1 comment
Labels: 3 - Ready for Review, improvement, non-breaking, ci

#16968 - [BUG] Incorrect read_parquet on spark distributed parquet files.

Issue - State: open - Opened by matt7salomon about 2 months ago - 2 comments
Labels: bug

#16967 - [TEST-ONLY] Test cuco hyperloglog version bump

Pull Request - State: closed - Opened by PointKernel about 2 months ago - 1 comment
Labels: CMake, 5 - DO NOT MERGE

#16966 - Branch 24.12 merge branch 24.10

Pull Request - State: closed - Opened by vyasr about 2 months ago
Labels: libcudf

#16965 - [FEA] Implement merged 'mega' kernel to parse leaf-level columns in JSON reader

Issue - State: open - Opened by shrshi about 2 months ago
Labels: feature request

#16964 - Deprecate support for directly accessing logger

Pull Request - State: closed - Opened by vyasr about 2 months ago - 1 comment
Labels: libcudf, improvement, breaking

#16963 - Switched BINARY_OP Benchmarks from GoogleBench to NVBench

Pull Request - State: closed - Opened by lamarrr about 2 months ago - 1 comment
Labels: feature request, libcudf, CMake, non-breaking

#16962 - Expunge NamedColumn

Pull Request - State: closed - Opened by wence- about 2 months ago - 4 comments
Labels: Python, improvement, non-breaking, cudf.polars

#16960 - [FEA] Report all unsupported operations for a query in cudf.polars

Pull Request - State: closed - Opened by Matt711 about 2 months ago - 18 comments
Labels: feature request, Python, non-breaking, cudf.polars

#16959 - [FEA] Add JSON reader column projection example

Issue - State: closed - Opened by GregoryKimball about 2 months ago - 2 comments
Labels: feature request, libcudf, cuIO

#16958 - Add clang-tidy to CI

Pull Request - State: open - Opened by vyasr about 2 months ago - 7 comments
Labels: libcudf, CMake, improvement, non-breaking

#16957 - [FEA] Migrate nvtext/edit_distance APIs to pylibcudf

Pull Request - State: closed - Opened by Matt711 about 2 months ago - 1 comment
Labels: feature request, libcudf, Python, CMake, non-breaking, pylibcudf

#16956 - Address all remaining clang-tidy errors

Pull Request - State: closed - Opened by vyasr about 2 months ago - 12 comments
Labels: libcudf, CMake, improvement, non-breaking

#16955 - [DOC] Document limitation using `cudf.pandas` proxy arrays

Pull Request - State: closed - Opened by Matt711 about 2 months ago - 1 comment
Labels: 3 - Ready for Review, doc, non-breaking

#16954 - Forward-merge branch-24.10 into branch-24.12

Pull Request - State: closed - Opened by rapids-bot[bot] about 2 months ago - 1 comment
Labels: libcudf

#16953 - [WIP] Implement contiguous_split in pylibcudf

Pull Request - State: closed - Opened by Matt711 about 2 months ago - 1 comment
Labels: feature request, Python, non-breaking, pylibcudf

#16952 - Switched AST benchmarks from GoogleBench to NVBench

Pull Request - State: closed - Opened by lamarrr about 2 months ago - 4 comments
Labels: feature request, libcudf, CMake, non-breaking

#16951 - [Feature Request] Make fmt and spdlog Optional with a -D_USE_EXTERNAL_SPDLOG_FMT Flag

Issue - State: open - Opened by ava6969 about 2 months ago - 1 comment
Labels: feature request

#16950 - Parse newline as whitespace character while tokenizing JSONL inputs with non-newline delimiter

Pull Request - State: open - Opened by shrshi about 2 months ago - 3 comments
Labels: libcudf, ! - Hotfix, cuIO

#16949 - Apply clang-tidy autofixes

Pull Request - State: closed - Opened by vyasr about 2 months ago - 1 comment
Labels: libcudf, improvement, non-breaking

#16948 - [FEA] Lift the 2.1B character limit in `STRINGS_NVBENCH`

Issue - State: open - Opened by GregoryKimball about 2 months ago
Labels: feature request, libcudf, strings