Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / NVIDIA/spark-rapids issues and pull requests

#11443 - Implement IGNORE NULLS for LEAD/LAG window functions

Issue - State: open - Opened by mythrocks 27 days ago - 1 comment

#11443 - Implement IGNORE NULLS for LEAD/LAG window functions

Issue - State: open - Opened by mythrocks 27 days ago - 1 comment

#11442 - [FEA] Add in support for setting row group sizes for parquet

Issue - State: closed - Opened by revans2 27 days ago - 1 comment
Labels: feature request

#11440 - [FEA] [SPARK-48947][SQL] Use lowercased charset name to decrease cache missing in Charset.forName

Issue - State: open - Opened by revans2 27 days ago
Labels: feature request, performance, audit_4.0.0

#11440 - [FEA] [SPARK-48947][SQL] Use lowercased charset name to decrease cache missing in Charset.forName

Issue - State: open - Opened by revans2 27 days ago
Labels: feature request, ? - Needs Triage, performance, audit_4.0.0

#11439 - [BUG] [SPARK-49205][SQL] KeyGroupedPartitioning should inherit HashPartitioningLike

Issue - State: open - Opened by revans2 27 days ago
Labels: bug, audit_4.0.0

#11437 - [BUG] array and map casts to string tests failed

Issue - State: closed - Opened by jlowe 27 days ago - 3 comments
Labels: bug, cudf_dependency

#11436 - [BUG] Mortgage unit tests fail with RAPIDS shuffle manager

Issue - State: closed - Opened by jlowe 27 days ago - 1 comment
Labels: bug

#11434 - schema mismatch failure error message for parquet reader

Issue - State: open - Opened by Feng-Jiang28 28 days ago - 1 comment
Labels: bug

#11433 - SPARK-34212 Parquet should read decimals correctly

Issue - State: open - Opened by Feng-Jiang28 28 days ago
Labels: bug, good first issue

#11433 - SPARK-34212 Parquet should read decimals correctly

Issue - State: open - Opened by Feng-Jiang28 28 days ago
Labels: bug, good first issue

#11432 - [BUG] SPARK UT Framework showing tests passed, but Exception printed without failing UT.

Issue - State: open - Opened by Feng-Jiang28 28 days ago
Labels: bug, ? - Needs Triage, test

#11432 - [BUG] SPARK UT Framework showing tests passed, but Exception printed without failing UT.

Issue - State: open - Opened by Feng-Jiang28 28 days ago
Labels: bug, ? - Needs Triage, test

#11430 - [BUG] Issues found by Spark UT Framework of RapidsParquetPartitionDiscoverySuite

Issue - State: open - Opened by Feng-Jiang28 28 days ago
Labels: bug, ? - Needs Triage

#11429 - Fixed some of the failing parquet_tests [databricks]

Pull Request - State: closed - Opened by razajafri 28 days ago - 6 comments
Labels: bug, Spark 4.0+

#11428 - [BUG] fast dist assembly no longer possible due to artifacts missing

Issue - State: open - Opened by gerashegalov 28 days ago - 1 comment
Labels: bug, build

#11427 - Update CI scripts to work with the "Dynamic Shim Detection" change [skip ci]

Pull Request - State: closed - Opened by NvTimLiu 29 days ago - 3 comments
Labels: build

#11427 - Update CI scripts to work with the "Dynamic Shim Detection" change [skip ci]

Pull Request - State: closed - Opened by NvTimLiu 29 days ago - 3 comments
Labels: build

#11426 - [BUG] Allow memory allocation from pinned memory pool to avoid task fail

Issue - State: open - Opened by winningsix 29 days ago - 1 comment
Labels: bug, cudf_dependency, reliability

#11425 - Update signoff usage [skip ci]

Pull Request - State: closed - Opened by pxLi 29 days ago - 1 comment
Labels: build

#11425 - Update signoff usage [skip ci]

Pull Request - State: closed - Opened by pxLi 29 days ago - 1 comment
Labels: build

#11422 - [FEA] Remove workaround for array_join

Issue - State: open - Opened by revans2 30 days ago
Labels: feature request

#11422 - [FEA] Remove workaround for array_join

Issue - State: open - Opened by revans2 30 days ago
Labels: feature request

#11421 - [DOC] remove the redundant archive link [skip ci]

Pull Request - State: closed - Opened by nvliyuan about 1 month ago - 3 comments
Labels: documentation

#11421 - [DOC] remove the redundant archive link [skip ci]

Pull Request - State: closed - Opened by nvliyuan about 1 month ago - 3 comments
Labels: documentation

#11420 - Add in array_join support

Pull Request - State: closed - Opened by revans2 about 1 month ago - 1 comment
Labels: feature request

#11419 - [FEA] Support Spark 3.5.3 release

Issue - State: open - Opened by tgravescs about 1 month ago
Labels: feature request

#11418 - stop using copyWithBooleanColumnAsValidity [databricks]

Pull Request - State: closed - Opened by res-life about 1 month ago - 3 comments
Labels: bug

#11418 - stop using copyWithBooleanColumnAsValidity [databricks]

Pull Request - State: closed - Opened by res-life about 1 month ago - 3 comments
Labels: bug

#11417 - [DOC]In archived release page, one redundant archived link exists.

Issue - State: closed - Opened by GaryShen2008 about 1 month ago
Labels: documentation

#11417 - [DOC]In archived release page, one redundant archived link exists.

Issue - State: closed - Opened by GaryShen2008 about 1 month ago
Labels: documentation

#11416 - [BUG] Create parquet table with compression

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago - 4 comments
Labels: bug

#11415 - Create parquet table with compression

Issue - State: closed - Opened by Feng-Jiang28 about 1 month ago
Labels: invalid

#11414 - Fix `collection_ops_tests` for Spark 4.0

Pull Request - State: open - Opened by mythrocks about 1 month ago - 1 comment
Labels: Spark 4.0+

#11414 - Fix `collection_ops_tests` for Spark 4.0

Pull Request - State: open - Opened by mythrocks about 1 month ago - 3 comments
Labels: Spark 4.0+

#11413 - Support multi string contains [databricks]

Pull Request - State: open - Opened by res-life about 1 month ago - 1 comment
Labels: performance

#11413 - Support multi string contians [databricks]

Pull Request - State: open - Opened by res-life about 1 month ago - 1 comment
Labels: performance

#11412 - [FEA][Follow on] Improve performance of min_by and max_by

Issue - State: open - Opened by thirtiseven about 1 month ago
Labels: feature request, performance

#11412 - [FEA][Follow on] Improve performance of min_by and max_by

Issue - State: open - Opened by thirtiseven about 1 month ago
Labels: feature request, ? - Needs Triage, performance

#11411 - Fix asymmetric join crash when stream side is empty

Pull Request - State: closed - Opened by jlowe about 1 month ago - 2 comments
Labels: bug

#11410 - [DOC] updates download page in ghpage [skip ci]

Pull Request - State: closed - Opened by nvliyuan about 1 month ago - 1 comment
Labels: documentation

#11410 - [DOC] updates download page in ghpage [skip ci]

Pull Request - State: closed - Opened by nvliyuan about 1 month ago - 1 comment
Labels: documentation

#11409 - Merge branch-24.08 into main [skip ci]

Pull Request - State: closed - Opened by nvauto about 1 month ago - 1 comment
Labels: build

#11408 - GDS Usage Discrepancy: cuDF Succeeds, Spark Write Fails

Issue - State: closed - Opened by kun429973 about 1 month ago - 4 comments

#11408 - GDS Usage Discrepancy: cuDF Succeeds, Spark Write Fails

Issue - State: closed - Opened by kun429973 about 1 month ago - 4 comments

#11407 - To mitigate github degraded perf [databricks]

Pull Request - State: closed - Opened by pxLi about 1 month ago - 2 comments
Labels: build

#11406 - [auto-merge] branch-24.08 to branch-24.10 [skip ci] [bot]

Pull Request - State: closed - Opened by nvauto about 1 month ago - 1 comment

#11406 - [auto-merge] branch-24.08 to branch-24.10 [skip ci] [bot]

Pull Request - State: closed - Opened by nvauto about 1 month ago - 1 comment

#11404 - [BUG] Issues found by Spark UT Framework of RapidsParquetRebaseDatetimeSuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11403 - [BUG] Issues found by Spark UT Framework of RapidsParquetQuerySuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11403 - [BUG] Issues found by Spark UT Framework of RapidsParquetQuerySuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11398 - [FEA] Support json_array_length

Issue - State: open - Opened by viadea about 1 month ago
Labels: feature request

#11395 - Fix a Pandas UDF slowness issue

Pull Request - State: closed - Opened by firestarman about 1 month ago - 3 comments
Labels: bug

#11394 - [TASK] cudf dropped python 3.9 support

Issue - State: closed - Opened by pxLi about 1 month ago - 1 comment
Labels: build, P0, cudf_dependency

#11392 - [AUDIT] Handle IgnoreNulls Expressions for Window Expressions

Issue - State: closed - Opened by razajafri about 1 month ago - 6 comments
Labels: bug, audit_4.0.0

#11392 - [AUDIT] Handle IgnoreNulls Expressions for Window Expressions

Issue - State: closed - Opened by razajafri about 1 month ago - 6 comments
Labels: bug, audit_4.0.0

#11386 - [FEA] move to multi-get_json_object for json_tuple

Issue - State: open - Opened by revans2 about 1 month ago
Labels: feature request, performance

#11380 - [BUG] Issues found by Spark UT Framework of RapidsParquetSchemaSuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11380 - [BUG] Issues found by Spark UT Framework of RapidsParquetSchemaSuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11379 - [BUG] Issues found by Spark UT Framework of RapidsParquetProtobufCompatibilitySuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11378 - [BUG] Issues found by Spark UT Framework of RapidsParquetInteroperabilitySuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11378 - [BUG] Issues found by Spark UT Framework of RapidsParquetInteroperabilitySuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11377 - [BUG] Issues found by Spark UT Framework on RapidsParquetCompressionCodecPrecedenceSuite

Issue - State: open - Opened by Feng-Jiang28 about 1 month ago
Labels: bug, ? - Needs Triage

#11376 - Create a PrioritySemaphore to back the GpuSemaphore

Pull Request - State: closed - Opened by zpuller about 1 month ago - 6 comments

#11376 - Create a PrioritySemaphore to back the GpuSemaphore

Pull Request - State: closed - Opened by zpuller about 1 month ago - 6 comments

#11371 - Support MinBy and MaxBy for non-float ordering

Pull Request - State: closed - Opened by thirtiseven about 1 month ago - 7 comments
Labels: feature request

#11366 - Enable parquet suites from Spark UT

Pull Request - State: closed - Opened by Feng-Jiang28 about 1 month ago - 15 comments
Labels: test

#11360 - [DOC] updates gh-pages docs for 24.08.0 release [skip ci]

Pull Request - State: closed - Opened by nvliyuan about 2 months ago
Labels: documentation

#11348 - Update `GpuJsonToStructs` to use the new JNI kernel when the input schema is `StructType`

Pull Request - State: open - Opened by ttnghia about 2 months ago
Labels: SQL, P0

#11348 - Update `GpuJsonToStructs` to use the new JNI kernel when the input schema is `StructType`

Pull Request - State: open - Opened by ttnghia about 2 months ago
Labels: SQL, P0

#11345 - [FEA] file reads in a background thread with flow control.

Issue - State: open - Opened by revans2 about 2 months ago
Labels: feature request, performance

#11344 - [FEA] shuffle reads with flow control in a background thread

Issue - State: open - Opened by revans2 about 2 months ago
Labels: feature request, performance

#11343 - [FEA] triple buffering/pipelineing for SQL

Issue - State: open - Opened by revans2 about 2 months ago
Labels: feature request, performance, epic

#11343 - [FEA] triple buffering/pipelineing for SQL

Issue - State: open - Opened by revans2 about 2 months ago
Labels: feature request, performance, epic

#11341 - [FEA] Write shuffle data in a background thread with flow control

Issue - State: open - Opened by revans2 about 2 months ago
Labels: feature request, performance

#11331 - Add companion metrics for all nsTiming metrics without semaphore

Pull Request - State: closed - Opened by binmahone about 2 months ago - 4 comments
Labels: task

#11331 - Add companion metrics for all nsTiming metrics without semaphore

Pull Request - State: closed - Opened by binmahone about 2 months ago - 4 comments
Labels: task

#11326 - [BUG] Dynamic partitions metric for insert into hive appears to be off

Issue - State: open - Opened by revans2 about 2 months ago - 4 comments
Labels: bug

#11326 - [BUG] Dynamic partitions metric for insert into hive appears to be off

Issue - State: open - Opened by revans2 about 2 months ago - 4 comments
Labels: bug

#11308 - Dynamic Shim Detection for `build` Process [databricks]

Pull Request - State: closed - Opened by razajafri about 2 months ago - 16 comments
Labels: build

#11304 - Update changelog for v24.08.0 release [skip ci]

Pull Request - State: closed - Opened by NvTimLiu about 2 months ago - 7 comments
Labels: documentation

#11297 - [FEA] Enable Parquet DataSource tests in Spark UT on Spark 3.3.0

Issue - State: closed - Opened by GaryShen2008 about 2 months ago - 1 comment
Labels: feature request

#11281 - Consider releasing the GPU semaphore earlier during shuffle partitioning

Issue - State: open - Opened by jlowe 2 months ago - 2 comments
Labels: performance

#11249 - Release Checklist v24.10

Issue - State: open - Opened by caryr35 2 months ago
Labels: documentation