Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / apache/datafusion issues and pull requests

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#10156 - September 2024 ASF Board Report

Issue - State: closed - Opened by alamb 5 months ago - 6 comments
Labels: enhancement

#8932 - Change Expr `PartialOrd` to not rely on comparing hash values

Issue - State: open - Opened by alamb 8 months ago - 6 comments
Labels: bug, help wanted

#8051 - Specialized / Pre-compiled / Prepared ScalarUDFs

Issue - State: open - Opened by alamb 11 months ago - 17 comments
Labels: enhancement

#8051 - Specialized / Pre-compiled / Prepared ScalarUDFs

Issue - State: open - Opened by alamb 11 months ago - 17 comments
Labels: enhancement

#7845 - Any plan to support JSON or JSONB?

Issue - State: open - Opened by dojiong 12 months ago - 35 comments
Labels: enhancement

#7845 - Any plan to support JSON or JSONB?

Issue - State: open - Opened by dojiong 12 months ago - 35 comments
Labels: enhancement

#7690 - avro_to_arrow: Support in memory apache_avro Value's

Issue - State: open - Opened by Samrose-Ahmed almost 1 year ago - 3 comments
Labels: enhancement

#7688 - Eliminate filter when `pushdown_filters` is enabled

Issue - State: closed - Opened by Dandandan almost 1 year ago - 1 comment
Labels: enhancement, performance

#7400 - feat: Support spilling for hash aggregation

Pull Request - State: closed - Opened by kazuyukitanimura about 1 year ago - 19 comments
Labels: enhancement, physical-expr, core

#7317 - Allowing setting sort order of parquet files without specifying the schema

Issue - State: open - Opened by alamb about 1 year ago - 13 comments
Labels: enhancement, good first issue

#6906 - Implement fast min/max accumulator for binary / strings (now it uses the slower path)

Issue - State: open - Opened by alamb about 1 year ago - 31 comments
Labels: enhancement

#5725 - Depend on Arrow Subcrates

Issue - State: closed - Opened by tustvold over 1 year ago - 3 comments
Labels: enhancement, good first issue, help wanted

#5647 - Support Grouping functions with Group By CUBE/ROLLUP/GROUPING SETS

Issue - State: open - Opened by mingmwang over 1 year ago - 6 comments
Labels: enhancement

#5034 - Error during physical planning when joining to subquery with count distinct aggregate

Issue - State: closed - Opened by jonmmease over 1 year ago - 6 comments
Labels: bug

#5034 - Error during physical planning when joining to subquery with count distinct aggregate

Issue - State: closed - Opened by jonmmease over 1 year ago - 6 comments
Labels: bug

#5004 - `LogicalPlan.schema()` returns incorrect schema for `CreateMemoryTable` and `CreateView`

Issue - State: closed - Opened by charlesbluca over 1 year ago - 2 comments
Labels: bug, good first issue

#4850 - Support `select .. from 'data.parquet'` files in SQL from any `SessionContext` (optionally)

Issue - State: closed - Opened by alamb over 1 year ago - 25 comments
Labels: enhancement, good first issue

#4414 - Add support for physical operators serde

Issue - State: closed - Opened by Kikkon almost 2 years ago - 1 comment

#4028 - Return TableProviderFilterPushDown::Exact when Parquet Pushdown Enabled

Issue - State: closed - Opened by tustvold almost 2 years ago - 14 comments
Labels: enhancement, performance

#3463 - Enable parquet filter pushdown by default

Issue - State: open - Opened by alamb about 2 years ago - 14 comments

#3174 - Bug with csv type inference

Issue - State: open - Opened by andygrove about 2 years ago - 4 comments
Labels: bug, good first issue

#825 - Add documentation for support for skipping Parquet row groups

Issue - State: open - Opened by andygrove about 3 years ago - 5 comments
Labels: documentation, enhancement, good first issue, devrel

#208 - Add provider for user defined function

Issue - State: closed - Opened by alamb over 3 years ago - 1 comment

#207 - Enable all clippy lints

Issue - State: closed - Opened by alamb over 3 years ago - 4 comments
Labels: good first issue, help wanted

#134 - Implement better tests for ParquetExec

Issue - State: closed - Opened by alamb over 3 years ago - 5 comments
Labels: good first issue, help wanted, datafusion

#100 - [Rust] Verify that projection push down does not remove aliases columns

Issue - State: open - Opened by alamb over 3 years ago - 1 comment
Labels: datafusion

#99 - Implement modulus expression

Issue - State: closed - Opened by alamb over 3 years ago - 1 comment
Labels: datafusion

#98 - [Rust] Add constant folding to expressions during logically planning

Issue - State: closed - Opened by alamb over 3 years ago - 3 comments
Labels: datafusion

#97 - [Rust] DataFrame.collect should return RecordBatchReader

Issue - State: open - Opened by alamb over 3 years ago - 3 comments
Labels: datafusion

#96 - Add FORMAT to explain plan and an easy to visualize format

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#95 - [Rust] Implement metrics framework

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#94 - [Rust] Implement micro benchmarks for each operator

Issue - State: open - Opened by alamb over 3 years ago - 8 comments
Labels: good first issue, help wanted, datafusion

#93 - [Rust] Implement pretty print for physical query plan

Issue - State: closed - Opened by alamb over 3 years ago - 3 comments
Labels: datafusion

#92 - Physical plan refactor to support optimization rules and more efficient use of threads

Issue - State: open - Opened by alamb over 3 years ago - 2 comments
Labels: help wanted, datafusion

#91 - Can not group by boolean columns (add boolean to valid keys of groupBy)

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#90 - improve performance of building literal arrays

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#89 - [rust][datafusion] optimize count(*) queries on parquet sources

Issue - State: closed - Opened by alamb over 3 years ago - 3 comments
Labels: datafusion

#88 - Improve like/nlike performance

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#87 - Add better and faster support for dictionary types

Issue - State: open - Opened by alamb over 3 years ago - 1 comment
Labels: datafusion

#86 - [Rust] Implement optimizer rule to remove redundant projections

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: datafusion

#85 - [Rust][Datafusion] Add datafusion-cli to the docker-compose setup

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: good first issue, help wanted, datafusion

#84 - Add Squirtle to list of projects using DataFusion

Issue - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: documentation, good first issue

#83 - [Rust] Parquet data source does not support complex types

Issue - State: closed - Opened by alamb over 3 years ago - 12 comments
Labels: datafusion

#82 - [DataFusion] Add validation for unreferenced table in query

Issue - State: closed - Opened by alamb over 3 years ago - 2 comments
Labels: enhancement, good first issue, help wanted

#81 - Add support for explicit casts between signed and unsigned ints

Issue - State: closed - Opened by alamb over 3 years ago - 1 comment
Labels: datafusion

#80 - Use released version of arrow

Issue - State: closed - Opened by tustvold over 3 years ago - 7 comments
Labels: enhancement

#79 - Deduplicate README.md

Pull Request - State: closed - Opened by msathis over 3 years ago - 2 comments
Labels: documentation, datafusion

#78 - Address performance/execution plan of TPCH query 19

Issue - State: closed - Opened by Dandandan over 3 years ago - 6 comments
Labels: bug

#77 - Address performance/execution plan of TPCH query 9

Issue - State: closed - Opened by Dandandan over 3 years ago - 2 comments
Labels: bug

#76 - Make external hostname in executor optional

Issue - State: closed - Opened by edrevo over 3 years ago
Labels: enhancement

#75 - Remove namespace from executors

Pull Request - State: closed - Opened by edrevo over 3 years ago - 3 comments
Labels: bug, api change

#74 - Fix tpch-gen

Pull Request - State: closed - Opened by edrevo over 3 years ago - 1 comment
Labels: development-process

#73 - tpch-gen is broken

Issue - State: closed - Opened by edrevo over 3 years ago
Labels: bug

#72 - Update link in Ballista donation blog post

Issue - State: closed - Opened by andygrove over 3 years ago - 2 comments
Labels: bug

#71 - Deduplicate `README.md` files in root and datafusion directory

Issue - State: closed - Opened by Dandandan over 3 years ago - 2 comments
Labels: bug, good first issue

#70 - wasm target

Issue - State: closed - Opened by alippai over 3 years ago
Labels: enhancement

#69 - Add datafusion-python

Pull Request - State: closed - Opened by jorgecarleitao over 3 years ago - 14 comments

#68 - Experimenting with arrow2

Pull Request - State: closed - Opened by jorgecarleitao over 3 years ago - 70 comments
Labels: documentation, datafusion, sql

#67 - DataFusion logo needs a white background

Issue - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: bug, documentation, good first issue

#66 - Remove unnecessary references to namespace in executor

Issue - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: enhancement

#64 - Integrate Ballista scheduler with DataFusion

Issue - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: enhancement

#63 - Implement scalable distributed joins

Issue - State: closed - Opened by andygrove over 3 years ago - 3 comments
Labels: enhancement

#62 - DataFrame.collect() should be extensible

Issue - State: closed - Opened by andygrove over 3 years ago - 2 comments
Labels: enhancement

#61 - MINOR: Remove empty rust dir

Pull Request - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: datafusion

#60 - Add example of distributed query execution with DataFusion/Ballista

Issue - State: closed - Opened by andygrove over 3 years ago - 1 comment
Labels: enhancement

#59 - Add query 19 to TPC-H regression tests

Pull Request - State: closed - Opened by Dandandan over 3 years ago - 1 comment
Labels: enhancement

#58 - Add TPC-H query 19 to regression test

Issue - State: closed - Opened by Dandandan over 3 years ago
Labels: enhancement

#57 - Support JOIN table alias

Issue - State: closed - Opened by houqp over 3 years ago - 1 comment
Labels: enhancement

#56 - Support column qualifer in queries

Issue - State: closed - Opened by houqp over 3 years ago
Labels: enhancement

#55 - Support qualified columns in queries

Pull Request - State: closed - Opened by houqp over 3 years ago - 39 comments
Labels: enhancement, datafusion, api change

#54 - Read CSV format text from stdin or memory

Pull Request - State: closed - Opened by heymind over 3 years ago - 2 comments
Labels: enhancement, datafusion, api change

#53 - Read CSV format text from stdin or memory

Pull Request - State: closed - Opened by heymind over 3 years ago

#52 - Use arrow eq kernels in CaseWhen expression evaluation

Pull Request - State: closed - Opened by Dandandan over 3 years ago - 1 comment
Labels: enhancement, datafusion

#51 - Use arrow eq kernels in CaseWhen expression evaluation

Issue - State: closed - Opened by Dandandan over 3 years ago
Labels: enhancement

#50 - Vectorize hash join collision check

Issue - State: closed - Opened by Dandandan over 3 years ago - 4 comments
Labels: enhancement, performance

#49 - [Ballista] Remove hard coded ballista versions in scripts

Pull Request - State: closed - Opened by msathis over 3 years ago - 1 comment
Labels: development-process

#48 - Remove Ballista DataFrame

Pull Request - State: closed - Opened by andygrove over 3 years ago - 2 comments
Labels: api change

#47 - DataFrame.collect() should return async stream rather than a Vec<RecordBatch>

Issue - State: closed - Opened by andygrove over 3 years ago - 2 comments
Labels: enhancement, good first issue

#45 - Add Dockerfile for Ballista scheduler UI

Issue - State: closed - Opened by andygrove over 3 years ago
Labels: enhancement

#44 - Optimize hash join inner workings

Issue - State: closed - Opened by Dandandan over 3 years ago
Labels: enhancement

#43 - Define data store path for `standalone` mode

Issue - State: closed - Opened by kination over 3 years ago
Labels: enhancement

#42 - Add option param for standalone mode

Pull Request - State: closed - Opened by kination over 3 years ago - 3 comments
Labels: enhancement

#41 - Support hash repartion elimination

Issue - State: closed - Opened by Dandandan over 3 years ago
Labels: enhancement

#40 - Support join pruning optimization

Issue - State: closed - Opened by Dandandan over 3 years ago
Labels: enhancement

#39 - Re-export Arrow and Parquet crates from DataFusion

Pull Request - State: closed - Opened by returnString over 3 years ago - 3 comments
Labels: enhancement, datafusion