An open API service for providing issue and pull request metadata for open source projects.

GitHub / pandas-dev/pandas issues and pull requests

Labelled with: Arrow

#62657 - FIX: itemsize wrong for date32[day][pyarrow] dtype #57948

Pull Request - State: open - Opened by mraabhijit about 1 month ago - 3 comments
Labels: Arrow

#62656 - BUG: fix regex numeric group replacement in PyArrow string arrays

Pull Request - State: open - Opened by zishan044 about 1 month ago - 1 comment
Labels: Bug, Arrow

#62611 - BUG: Drop duplicates bug when using pyarrow (version 21.0.0) backend

Issue - State: open - Opened by flori-ko about 2 months ago - 8 comments
Labels: Bug, duplicated, Closing Candidate, Arrow

#62520 - BUG: Regression in Series.pow with all-NA `double[pyarrow]` values in pandas 3.x

Issue - State: closed - Opened by TomAugspurger about 2 months ago - 7 comments
Labels: Bug, Regression, Numeric Operations, good first issue, Arrow

#62462 - BUG: cannot use index to set values in `.iloc` for `Int64[pyarrow]` Series

Issue - State: closed - Opened by MarcoGorelli about 2 months ago - 2 comments
Labels: Bug, Indexing, Regression, Arrow

#62410 - BUG: fix bug in str.match for Arrow backend with optional groups

Pull Request - State: closed - Opened by jorisvandenbossche 2 months ago
Labels: Bug, Strings, Arrow

#62328 - Backport PR #62283 on branch 2.3.x (BUG: fix pyarrow string regex replacement)

Pull Request - State: closed - Opened by meeseeksmachine 2 months ago
Labels: Bug, Strings, Arrow

#62290 - BUG: Fix incorrect FutureWarning for logical ops on pyarrow bool Series (#62260)

Pull Request - State: closed - Opened by Tarun2605 3 months ago - 14 comments
Labels: Bug, Numeric Operations, Arrow

#61964 - BUG: fix Series.str.fullmatch() and Series.str.match() with a compiled regex failing with arrow strings

Pull Request - State: closed - Opened by khemkaran10 4 months ago - 2 comments
Labels: Strings, Arrow

#61948 - BUG: Replacement fails after NA value with PyArrow-backed strings

Issue - State: closed - Opened by rhshadrach 4 months ago - 1 comment
Labels: Bug, replace, Arrow

#61946 - BUG: Fix Series.str.contains with compiled regex on Arrow string dtype

Pull Request - State: closed - Opened by Aniketsy 4 months ago - 8 comments
Labels: Bug, Strings, Arrow

#61942 - BUG: Using `Series.str.contains()` with a compiled regex and arrow strings fails

Issue - State: closed - Opened by Dr-Irv 4 months ago
Labels: Bug, Strings, Arrow

#61926 - BUG: Merge fails on pyarrow datetime columns

Issue - State: closed - Opened by Meet1995 4 months ago - 14 comments
Labels: Bug, Datetime, Reshaping, good first issue, Arrow

#61816 - BUG: DataFrame.aggregate to preserve extension dtypes with callable functions

Pull Request - State: closed - Opened by AdrianoCLeao 5 months ago - 5 comments
Labels: Bug, Stale, Arrow, pyarrow dtype retention

#61799 - TST: update expected dtype for sum of decimals with pyarrow 21+

Pull Request - State: closed - Opened by jorisvandenbossche 5 months ago - 1 comment
Labels: Testing, Arrow

#61799 - TST: update expected dtype for sum of decimals with pyarrow 21+

Pull Request - State: closed - Opened by jorisvandenbossche 5 months ago - 3 comments
Labels: Testing, Arrow

#61786 - PERF: avoid object-dtype path in ArrowEA._explode

Pull Request - State: closed - Opened by jbrockmendel 5 months ago
Labels: Arrow

#61786 - PERF: avoid object-dtype path in ArrowEA._explode

Pull Request - State: closed - Opened by jbrockmendel 5 months ago - 1 comment
Labels: Arrow

#61782 - BUG: Errors using pyarrow datetime types on windows

Issue - State: open - Opened by Liam3851 5 months ago - 1 comment
Labels: Bug, Docs, Datetime, Windows, Upstream issue, Arrow

#61780 - BUG: tz_localize(None) with Arrow timestamp

Issue - State: open - Opened by jbrockmendel 5 months ago
Labels: Bug, Datetime, Timezones, Needs Triage, Arrow

#61773 - BUG: Decimal(NaN) incorrectly allowed in ArrowEA constructor with tim…

Pull Request - State: closed - Opened by jbrockmendel 5 months ago - 2 comments
Labels: Arrow

#61747 - PERF: Arrow dtypes are much slower than Numpy for DataFrame.apply

Issue - State: open - Opened by ehsantn 5 months ago - 8 comments
Labels: Performance, Apply, Arrow

#61723 - DEPS: bump pyarrow minimum version from 10.0 to 12.0

Pull Request - State: closed - Opened by jorisvandenbossche 5 months ago - 3 comments
Labels: Dependencies, Arrow

#61723 - DEPS: bump pyarrow minimum version from 10.0 to 12.0

Pull Request - State: open - Opened by jorisvandenbossche 5 months ago
Labels: Dependencies, Arrow

#61659 - BUG: to_numeric fails to convert a Pyarrow Decimal series containing NA values

Pull Request - State: open - Opened by chilin0525 5 months ago
Labels: Bug, Dtype Conversions, Arrow

#61659 - BUG: to_numeric fails to convert a Pyarrow Decimal series containing NA values

Pull Request - State: closed - Opened by chilin0525 5 months ago - 2 comments
Labels: Bug, Dtype Conversions, Stale, Arrow

#61644 - BUG: Add PyArrow datelike type support for `map()`

Pull Request - State: closed - Opened by KevsterAmp 5 months ago - 8 comments
Labels: Bug, Apply, Arrow

#61644 - BUG: Add PyArrow datelike type support for `map()`

Pull Request - State: open - Opened by KevsterAmp 5 months ago - 6 comments
Labels: Bug, Apply, Arrow

#61640 - BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes

Pull Request - State: closed - Opened by heoh 6 months ago - 4 comments
Labels: Bug, Groupby, Stale, Arrow

#61640 - BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes

Pull Request - State: open - Opened by heoh 6 months ago
Labels: Bug, Groupby, Arrow

#61636 - BUG: Groupby aggregate coersion of outputs inconsistency for pyarrow dtypes

Issue - State: closed - Opened by AndrejIring 6 months ago - 2 comments
Labels: Bug, Groupby, Arrow

#61496 - BUG: Passing string[pyarrow] to the dtype parameter of e.g. csv_read() does produce a string type Series

Issue - State: open - Opened by ClauPet 6 months ago - 3 comments
Labels: Dtype Conversions, IO CSV, Strings, Needs Discussion, Arrow

#61485 - BUG: zfill with pyarrow string

Issue - State: closed - Opened by williambdean 6 months ago - 3 comments
Labels: Bug, Strings, Arrow

#61466 - BUG: Series.str.isdigit with pyarrow dtype doesn't honor unicode superscripts

Issue - State: open - Opened by GarrettWu 6 months ago - 6 comments
Labels: Bug, Strings, Needs Discussion, Upstream issue, Arrow

#61376 - BUG: Series.dot for arrow and nullable dtypes returns object-dtyped series

Pull Request - State: closed - Opened by theavey 7 months ago
Labels: Numeric Operations, Arrow

#61376 - BUG: Series.dot for arrow and nullable dtypes returns object-dtyped series

Pull Request - State: closed - Opened by theavey 7 months ago - 1 comment
Labels: Numeric Operations, Arrow

#61322 - BUG: memory issues with `string[pyarrow]` after sorted `pd.merge`

Issue - State: open - Opened by noahblakesmith 7 months ago - 5 comments
Labels: Bug, Closing Candidate, Upstream issue, Arrow

#61316 - BUG: `.to_parquet` fails with `schema` for `string[pyarrow]` but not `string[python]`

Issue - State: open - Opened by noahblakesmith 7 months ago - 1 comment
Labels: Bug, Closing Candidate, Upstream issue, Arrow

#61236 - BUG: Pyarrow timestamp support for map() function

Pull Request - State: closed - Opened by arthurlw 8 months ago - 2 comments
Labels: Stale, Arrow

#61236 - BUG: Pyarrow timestamp support for map() function

Pull Request - State: closed - Opened by arthurlw 8 months ago - 2 comments
Labels: Stale, Arrow

#61231 - BUG: PyArrow timestamp type does not work with map() function

Issue - State: open - Opened by dbalabka 8 months ago - 3 comments
Labels: Bug, Apply, Arrow

#61229 - BUG: Fix #61222: Keep index name when resampling with pyarrow dtype

Pull Request - State: closed - Opened by mthiboust 8 months ago - 2 comments
Labels: Resample, Arrow

#61222 - BUG: Index name lost when using "resample" with pyarrow dtypes

Issue - State: closed - Opened by mthiboust 8 months ago - 2 comments
Labels: Bug, Resample, Arrow

#61193 - BUG: Fix pyarrow categoricals not working for pivot and multiindex

Pull Request - State: closed - Opened by robin-mader-bis 8 months ago - 2 comments
Labels: Reshaping, Arrow

#61193 - BUG: Fix pyarrow categoricals not working for pivot and multiindex

Pull Request - State: closed - Opened by robin-mader-bis 8 months ago - 3 comments
Labels: Reshaping, Arrow

#61144 - BUG(?): rolling sum with pyarrow types results in float64 instead of preserving integer type

Issue - State: open - Opened by MarcoGorelli 8 months ago - 4 comments
Labels: Bug, Dtype Conversions, Window, Arrow

#61105 - BUG: DataFrame.explode doesn't work for pyarrow.large_list type

Pull Request - State: closed - Opened by snitish 9 months ago - 1 comment
Labels: Arrow

#61105 - BUG: DataFrame.explode doesn't work for pyarrow.large_list type

Pull Request - State: closed - Opened by snitish 9 months ago
Labels: Arrow

#61103 - Enh arrow json extension

Pull Request - State: closed - Opened by asharmalik19 9 months ago - 3 comments
Labels: Arrow

#61103 - Enh arrow json extension

Pull Request - State: closed - Opened by asharmalik19 9 months ago
Labels: Arrow

#61102 - Backport PR #61098: CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew

Pull Request - State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow

#61102 - Backport PR #61098: CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew

Pull Request - State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow

#61098 - CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew

Pull Request - State: closed - Opened by mroeschke 9 months ago - 1 comment
Labels: Testing, Arrow

#61098 - CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew

Pull Request - State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow

#61091 - BUG: DataFrame.explode doesn't explode when using pyarrow large list type

Issue - State: closed - Opened by MarcoGorelli 9 months ago - 1 comment
Labels: Bug, Reshaping, Arrow

#61073 - BUG: fix bug in str.fullmatch for Arrow backend with optional groups

Pull Request - State: closed - Opened by ptth222 9 months ago - 7 comments
Labels: Strings, Arrow

#61072 - BUG: str.fullmatch behavior is not the same for object dtype and string[pyarrow] dtype

Issue - State: closed - Opened by ptth222 9 months ago - 2 comments
Labels: Bug, Strings, Arrow

#60997 - BUG: fix read_json ignoring the dtype with the pyarrow engine

Pull Request - State: closed - Opened by will-larkin 9 months ago - 4 comments
Labels: IO JSON, Arrow

#60997 - BUG: fix read_json ignoring the dtype with the pyarrow engine

Pull Request - State: closed - Opened by will-larkin 9 months ago - 3 comments
Labels: IO JSON, Arrow

#60958 - ENH: Support `pa.json_` in arrow extension type

Issue - State: closed - Opened by chelsea-lin 9 months ago - 5 comments
Labels: Enhancement, Arrow

#60949 - Fix error `value_counts` result with pyarrow categorical columns

Pull Request - State: closed - Opened by chilin0525 9 months ago - 1 comment
Labels: Categorical, Arrow

#60838 - BUG: Unknown Error - Getting from Databricks SQL Python - From PyArrow module (pyarrow.lib.ArrowException)

Issue - State: open - Opened by Gobi2511 10 months ago - 1 comment
Labels: Bug, IO SQL, Needs Info, Arrow

#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray

Pull Request - State: open - Opened by mroeschke 10 months ago
Labels: Performance, Arrow

#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray

Pull Request - State: closed - Opened by mroeschke 10 months ago - 1 comment
Labels: Performance, Arrow

#60777 - Fix arrow groupby na

Pull Request - State: closed - Opened by asharmalik19 10 months ago - 1 comment
Labels: Arrow

#60777 - Fix arrow groupby na

Pull Request - State: closed - Opened by asharmalik19 10 months ago - 2 comments
Labels: Arrow

#60729 - BUG: Series.mask incorrectly replaces positions of pd.NA in the cond argument

Issue - State: open - Opened by kartoria 10 months ago - 7 comments
Labels: Bug, Missing-data, NA - MaskedArrays, Arrow

#60716 - Update PyArrow conversion and arrow/parquet tests for pyarrow 19.0

Pull Request - State: closed - Opened by jorisvandenbossche 11 months ago - 2 comments
Labels: Compat, IO Parquet, Arrow

#60716 - Update PyArrow conversion and arrow/parquet tests for pyarrow 19.0

Pull Request - State: closed - Opened by jorisvandenbossche 11 months ago - 4 comments
Labels: Compat, IO Parquet, Arrow

#60633 - ENH: Implement cum* methods for PyArrow strings

Pull Request - State: open - Opened by rhshadrach 11 months ago
Labels: Enhancement, Strings, Arrow, Transformations

#60633 - ENH: Implement cum* methods for PyArrow strings

Pull Request - State: closed - Opened by rhshadrach 11 months ago - 5 comments
Labels: Enhancement, Strings, Arrow, Transformations

#60569 - BUG: value_counts() check Index with PyArrow categorical columns

Pull Request - State: closed - Opened by NOBODIDI 12 months ago - 2 comments
Labels: Bug, Algos, Categorical, Stale, Arrow

#60569 - BUG: value_counts() check Index with PyArrow categorical columns

Pull Request - State: closed - Opened by NOBODIDI 12 months ago - 2 comments
Labels: Bug, Algos, Categorical, Stale, Arrow

#60567 - BUG: groupby with dropna=False and pa.dictionary drops NA values

Issue - State: open - Opened by rhshadrach 12 months ago
Labels: Groupby, Missing-data, Arrow

#60544 - CI/TST: Use tm.external_error_raised for test_from_arrow_respecting_given_dtype_unsafe

Pull Request - State: closed - Opened by mroeschke 12 months ago - 1 comment
Labels: Testing, Arrow

#60527 - BUG: Fix `ListAccessor` methods to preserve original name

Pull Request - State: closed - Opened by FBruzzesi 12 months ago - 1 comment
Labels: Series, Arrow

#60527 - BUG: Fix `ListAccessor` methods to preserve original name

Pull Request - State: closed - Opened by FBruzzesi 12 months ago - 2 comments
Labels: Series, Arrow

#60522 - BUG: `ListAccessor` does not preserve series name

Issue - State: closed - Opened by FBruzzesi 12 months ago - 2 comments
Labels: Bug, Series, Arrow

#60324 - REF: centralize pyarrow Table to pandas conversions and types_mapper handling

Pull Request - State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Arrow

#60286 - TST: Add test for `pd.read_csv` date parsing not working with `dtype_backend="pyarrow"` and missing values

Pull Request - State: closed - Opened by KevsterAmp about 1 year ago - 1 comment
Labels: Testing, Missing-data, IO CSV, Arrow

#60286 - TST: Add test for `pd.read_csv` date parsing not working with `dtype_backend="pyarrow"` and missing values

Pull Request - State: closed - Opened by KevsterAmp about 1 year ago - 2 comments
Labels: Testing, Missing-data, IO CSV, Arrow

#60248 - Backport PR #60241 on branch 2.3.x (TST (string dtype): resolve xfail in arrow interface tests)

Pull Request - State: closed - Opened by meeseeksmachine about 1 year ago
Labels: Strings, Arrow

#60241 - TST (string dtype): resolve xfail in arrow interface tests

Pull Request - State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Strings, Arrow

#60235 - ENH (string dtype): convert string_view columns to future string dtype instead of object dtype in Parquet/Feather IO

Pull Request - State: closed - Opened by jorisvandenbossche about 1 year ago - 3 comments
Labels: Strings, IO Parquet, Arrow

#60222 - ENH (string dtype): accept string_view in addition to string/large_string for ArrowStringArray input

Pull Request - State: closed - Opened by jorisvandenbossche about 1 year ago - 1 comment
Labels: Strings, Arrow

#60105 - Fix BUG: read_sql tries to convert blob/varbinary to string with pyarrow backend

Pull Request - State: closed - Opened by kastkeepitjumpinlikekangaroos about 1 year ago - 6 comments
Labels: IO SQL, Stale, Arrow

#59904 - BUG: pd.read_csv date parsing not working with dtype_backend="pyarrow" and missing values

Issue - State: closed - Opened by mhabets about 1 year ago - 12 comments
Labels: Bug, Missing-data, IO CSV, good first issue, Needs Tests, Arrow

#59780 - Consider moving pyarrow's pandas compatibility and conversion code to the pandas project?

Issue - State: open - Opened by jorisvandenbossche about 1 year ago
Labels: Needs Discussion, Arrow

#59771 - ENH: ArrowTemporalProperties' object has no attribute 'to_period'

Issue - State: open - Opened by dbalabka about 1 year ago - 4 comments
Labels: Enhancement, Datetime, Needs Discussion, Arrow

#59768 - BUG/API (string dtype): return float dtype for series[str].rank()

Pull Request - State: closed - Opened by jorisvandenbossche about 1 year ago - 2 comments
Labels: Strings, Arrow, Transformations, backported