GitHub / apache/datafusion-python issues and pull requests
#1287 - Preserve PyArrow extension metadata when chaining Python scalar UDFs
Pull Request -
State: open - Opened by kosiew 4 months ago
#1264 - feat: expose DataFrame.write_table
Pull Request -
State: open - Opened by timsaucer 4 months ago
#1263 - build(deps): bump datafusion-substrait from 50.0.0 to 50.1.0
Pull Request -
State: open - Opened by dependabot[bot] 4 months ago
Labels: dependencies, rust
#1262 - build(deps): bump datafusion from 50.0.0 to 50.1.0
Pull Request -
State: open - Opened by dependabot[bot] 4 months ago
Labels: dependencies, rust
#1261 - build(deps): bump datafusion-ffi from 50.0.0 to 50.1.0
Pull Request -
State: open - Opened by dependabot[bot] 4 months ago
Labels: dependencies, rust
#1260 - build(deps): bump datafusion-proto from 50.0.0 to 50.1.0
Pull Request -
State: open - Opened by dependabot[bot] 4 months ago
Labels: dependencies, rust
#1259 - macos-13 is deprecated
Pull Request -
State: closed - Opened by kevinjqliu 4 months ago
#1257 - Investigate creating progress indicator
Issue -
State: open - Opened by timsaucer 4 months ago
Labels: enhancement
#1256 - Unify Table representations
Pull Request -
State: open - Opened by timsaucer 4 months ago
#1254 - build(deps): bump object_store from 0.12.3 to 0.12.4
Pull Request -
State: open - Opened by dependabot[bot] 5 months ago
Labels: dependencies, rust
#1253 - Freeze PyO3 wrappers & introduce interior mutability to avoid PyO3 borrow errors
Pull Request -
State: closed - Opened by kosiew 5 months ago
- 1 comment
#1252 - Make most `pyclasses` frozen
Pull Request -
State: closed - Opened by ntjohnson1 5 months ago
- 4 comments
#1250 - Mark `pyclass`es frozen if possible.
Issue -
State: open - Opened by ntjohnson1 5 months ago
Labels: enhancement
#1249 - Update version number, add changelog
Pull Request -
State: closed - Opened by timsaucer 5 months ago
#1249 - Update version number, add changelog
Pull Request -
State: open - Opened by timsaucer 5 months ago
#1248 - Allow Python Threading Around Session Context
Pull Request -
State: open - Opened by ntjohnson1 5 months ago
#1248 - Make Session Context `pyclass` frozen so interior mutability is only managed by rust
Pull Request -
State: open - Opened by ntjohnson1 5 months ago
- 4 comments
#1247 - SessionContext: automatically register Python (Arrow/Pandas/Polars) objects referenced in SQL
Pull Request -
State: open - Opened by kosiew 5 months ago
#1246 - Allow SessionContext.read_table to accept objects exposing `__datafusion_table_provider__` (PyCapsule)
Pull Request -
State: open - Opened by kosiew 5 months ago
- 1 comment
#1246 - Allow SessionContext.read\_table to accept objects exposing `__datafusion_table_provider__` (PyCapsule)
Pull Request -
State: open - Opened by kosiew 5 months ago
#1245 - SessionContext.read_table should take PyCapsule objects
Issue -
State: open - Opened by timsaucer 5 months ago
Labels: enhancement
#1244 - Support for automatic replacement scans
Issue -
State: open - Opened by kylebarron 5 months ago
Labels: enhancement
#1243 - Introduce Table wrapper, unify table registration via register_table; deprecate legacy APIs
Pull Request -
State: open - Opened by kosiew 5 months ago
- 1 comment
#1242 - Fix drop() method to handle quoted column names consistently
Pull Request -
State: closed - Opened by H0TB0X420 5 months ago
- 3 comments
#1242 - Fix drop() method to handle quoted column names consistently
Pull Request -
State: open - Opened by H0TB0X420 5 months ago
- 3 comments
#1241 - Add to_batches() and interpolate() methods to DataFrame
Pull Request -
State: open - Opened by H0TB0X420 5 months ago
#1240 - docs: add apache iceberg as datafusion data source
Pull Request -
State: closed - Opened by kevinjqliu 5 months ago
- 1 comment
#1239 - A single common PyTableProvider that can be created either via a pycapsule or into_view
Issue -
State: open - Opened by kosiew 5 months ago
#1235 - API Suggestion match casing for isnan and is_null
Issue -
State: open - Opened by ntjohnson1 5 months ago
Labels: enhancement
#1233 - Incorrect NaN Comparison
Issue -
State: open - Opened by ntjohnson1 5 months ago
Labels: bug
#1230 - build(deps): bump actions/setup-python from 5 to 6
Pull Request -
State: open - Opened by dependabot[bot] 5 months ago
Labels: dependencies, github_actions
#1229 - build(deps): bump log from 0.4.27 to 0.4.28
Pull Request -
State: open - Opened by dependabot[bot] 5 months ago
Labels: dependencies, rust
#1228 - build(deps): bump uuid from 1.18.0 to 1.18.1
Pull Request -
State: open - Opened by dependabot[bot] 5 months ago
Labels: dependencies, rust
#1227 - Remove pyarrow as required dependency, relying on Arrow PyCapsule Interface
Issue -
State: open - Opened by kylebarron 5 months ago
- 8 comments
Labels: enhancement
#1226 - `CatalogProvider` errors are badly mangled
Issue -
State: open - Opened by colinmarc 5 months ago
- 4 comments
Labels: bug
#1225 - docs: fix CaseBuilder documentation example
Pull Request -
State: closed - Opened by IndexSeek 5 months ago
#1224 - docs: update link to user example for custom table provider
Pull Request -
State: closed - Opened by IndexSeek 5 months ago
- 2 comments
#1223 - Invalid url example reference in documentation
Issue -
State: closed - Opened by IndexSeek 5 months ago
Labels: bug
#1222 - DRAFT Expose Arrow C stream and DataFrame iterator (zero‑copy streaming to PyArrow)
Pull Request -
State: open - Opened by kosiew 5 months ago
#1221 - Support string column identifiers for sort/aggregate/window and stricter Expr validation
Pull Request -
State: open - Opened by kosiew 5 months ago
#1220 - Fixing a few Typos
Pull Request -
State: open - Opened by ntjohnson1 5 months ago
#1219 - build(deps): bump mimalloc from 0.1.47 to 0.1.48
Pull Request -
State: open - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1218 - Set fail on warning for documentation generation
Pull Request -
State: open - Opened by timsaucer 6 months ago
#1217 - "Fatal Python error: Bus error" when using datafusion 49
Issue -
State: open - Opened by kevinjqliu 6 months ago
Labels: bug
#1215 - feat: allow passing a slice to and expression with the [] indexing
Pull Request -
State: closed - Opened by timsaucer 6 months ago
#1215 - feat: allow passing a slice to and expression with the [] indexing
Pull Request -
State: open - Opened by timsaucer 6 months ago
#1214 - Potential bug(?): Inconsistent usage of column() / col() and literal() / lit()
Issue -
State: open - Opened by HeWhoHeWho 6 months ago
Labels: bug
#1213 - Update development guide in README.md
Pull Request -
State: open - Opened by YKoustubhRao 6 months ago
#1212 - Drop column syntax inconsistency
Issue -
State: open - Opened by HeWhoHeWho 6 months ago
Labels: bug
#1211 - 49.0.0 release
Pull Request -
State: closed - Opened by timsaucer 6 months ago
#1209 - datafusion-python pypi release behind datafusion core/rust crate
Issue -
State: open - Opened by kalona 6 months ago
#1208 - Improve `show()` output for empty DataFrames
Pull Request -
State: open - Opened by kosiew 6 months ago
#1206 - export to arrow generate OOM
Issue -
State: open - Opened by djouallah 6 months ago
Labels: bug
#1205 - build(deps): bump slab from 0.4.10 to 0.4.11
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1203 - build(deps): bump async-trait from 0.1.88 to 0.1.89
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1202 - build(deps): bump uuid from 1.17.0 to 1.18.0
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1201 - build(deps): bump actions/download-artifact from 4 to 5
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
- 1 comment
Labels: dependencies, github_actions
#1200 - chore: Upgrade datafusion to version 49
Pull Request -
State: open - Opened by nuno-faria 6 months ago
#1198 - build(deps): bump datafusion-ffi from 48.0.1 to 49.0.0
Pull Request -
State: open - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1197 - build(deps): bump datafusion from 48.0.1 to 49.0.0
Pull Request -
State: open - Opened by dependabot[bot] 6 months ago
Labels: dependencies, rust
#1196 - build(deps): bump datafusion-proto from 48.0.1 to 49.0.0
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
- 1 comment
Labels: dependencies, rust
#1195 - build(deps): bump datafusion-substrait from 48.0.1 to 49.0.0
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
- 1 comment
Labels: dependencies, rust
#1193 - Separable Python and Rust components
Issue -
State: open - Opened by awhyte 7 months ago
Labels: enhancement
#1191 - build(deps): bump tokio from 1.46.1 to 1.47.0
Pull Request -
State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies, rust
#1190 - the python udaf example cannot print the result
Issue -
State: open - Opened by l1t1 7 months ago
Labels: bug
#1188 - Update CI rules
Pull Request -
State: open - Opened by timsaucer 7 months ago
#1187 - Improved window and aggregate function signature
Pull Request -
State: open - Opened by timsaucer 7 months ago
#1186 - Some group by query is 6~7x slower than DuckDB
Issue -
State: open - Opened by wegamekinglc 7 months ago
- 15 comments
#1185 - Add support for automatic join column deduplication in DataFrame joins
Pull Request -
State: closed - Opened by kosiew 7 months ago
- 2 comments
#1185 - Add support for automatic join column deduplication in DataFrame joins
Pull Request -
State: closed - Opened by kosiew 7 months ago
- 2 comments
#1184 - feat: reduce duplicate fields on join
Pull Request -
State: open - Opened by timsaucer 7 months ago
Labels: api change
#1183 - Optional improvements in verification instructions
Pull Request -
State: open - Opened by paleolimbot 7 months ago
#1183 - Optional improvements in verification instructions
Pull Request -
State: open - Opened by paleolimbot 7 months ago
#1182 - Python test dependencies in release verification instructions are out of date
Issue -
State: open - Opened by paleolimbot 7 months ago
Labels: bug
#1181 - Serialize user defined functions and table providers via protobuf
Issue -
State: open - Opened by timsaucer 7 months ago
Labels: enhancement
#1180 - build(deps): bump tokio from 1.45.0 to 1.46.1
Pull Request -
State: closed - Opened by dependabot[bot] 7 months ago
- 1 comment
Labels: dependencies, rust
#1179 - Update how to dev instructions
Pull Request -
State: closed - Opened by ntjohnson1 7 months ago
#1179 - Update how to dev instructions
Pull Request -
State: open - Opened by ntjohnson1 7 months ago
#1178 - How To Develop Misses Edge cases
Issue -
State: open - Opened by ntjohnson1 7 months ago
Labels: bug
#1177 - Simplify HTML Formatter Style Handling Using Script Injection
Pull Request -
State: closed - Opened by kosiew 7 months ago
- 1 comment
#1176 - Typo in the manual
Issue -
State: closed - Opened by albertoRamon 7 months ago
Labels: bug
#1175 - 48.0.0 Release
Pull Request -
State: closed - Opened by timsaucer 7 months ago
- 1 comment
#1175 - 48.0.0 Release
Pull Request -
State: closed - Opened by timsaucer 7 months ago
- 1 comment
#1174 - build(deps): bump arrow from 55.1.0 to 55.2.0
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
- 1 comment
Labels: dependencies, rust
#1174 - build(deps): bump arrow from 55.1.0 to 55.2.0
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
- 1 comment
Labels: dependencies, rust
#1173 - Simplify Joins on Shared Column Name
Issue -
State: open - Opened by ntjohnson1 8 months ago
Labels: enhancement
#1172 - ScalarUDFs created using `datafusion.udf()` do not propagate extension type metadata
Issue -
State: open - Opened by paleolimbot 8 months ago
Labels: bug
#1171 - Add DOM-guarded CSS/JS injection to DataFrameHtmlFormatter to prevent duplicate style/script inserts
Issue -
State: closed - Opened by kosiew 8 months ago
#1170 - Feat/dataframe str formatter
Pull Request -
State: open - Opened by timsaucer 8 months ago
#1170 - feat: dataframe string formatter
Pull Request -
State: closed - Opened by timsaucer 8 months ago
#1169 - Add compression_level support to ParquetWriterOptions and enhance write_parquet to accept full options object
Pull Request -
State: open - Opened by kosiew 8 months ago
#1169 - Add compression_level support to ParquetWriterOptions and enhance write_parquet to accept full options object
Pull Request -
State: closed - Opened by kosiew 8 months ago
#1168 - Fix signature of `__arrow_c_stream__`
Pull Request -
State: open - Opened by kylebarron 8 months ago
#1168 - Fix signature of `__arrow_c_stream__`
Pull Request -
State: closed - Opened by kylebarron 8 months ago
#1167 - feat: collect once during display() in jupyter notebooks
Pull Request -
State: closed - Opened by timsaucer 8 months ago
- 4 comments
#1166 - Dataframe doesn't properly implement ArrowStream export interface
Issue -
State: open - Opened by johnnyg 8 months ago
Labels: bug
#1165 - build(deps): bump mimalloc from 0.1.46 to 0.1.47
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
- 1 comment
Labels: dependencies, rust
#1165 - build(deps): bump mimalloc from 0.1.46 to 0.1.47
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
- 1 comment
Labels: dependencies, rust
#1164 - build(deps): bump prost from 0.13.5 to 0.14.1
Pull Request -
State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies, rust
#1164 - build(deps): bump prost from 0.13.5 to 0.14.1
Pull Request -
State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies, rust