GitHub / pandas-dev/pandas issues and pull requests
Labelled with: Arrow
#62657 - FIX: itemsize wrong for date32[day][pyarrow] dtype #57948
Pull Request -
State: open - Opened by mraabhijit about 1 month ago
- 3 comments
Labels: Arrow
#62656 - BUG: fix regex numeric group replacement in PyArrow string arrays
Pull Request -
State: open - Opened by zishan044 about 1 month ago
- 1 comment
Labels: Bug, Arrow
#62611 - BUG: Drop duplicates bug when using pyarrow (version 21.0.0) backend
Issue -
State: open - Opened by flori-ko about 2 months ago
- 8 comments
Labels: Bug, duplicated, Closing Candidate, Arrow
#62520 - BUG: Regression in Series.pow with all-NA `double[pyarrow]` values in pandas 3.x
Issue -
State: closed - Opened by TomAugspurger about 2 months ago
- 7 comments
Labels: Bug, Regression, Numeric Operations, good first issue, Arrow
#62462 - BUG: cannot use index to set values in `.iloc` for `Int64[pyarrow]` Series
Issue -
State: closed - Opened by MarcoGorelli about 2 months ago
- 2 comments
Labels: Bug, Indexing, Regression, Arrow
#62412 - Backport PR #62410 on branch 2.3.x (BUG: fix bug in str.match for Arrow backend with optional groups)
Pull Request -
State: open - Opened by meeseeksmachine 2 months ago
Labels: Bug, Strings, Arrow
#62410 - BUG: fix bug in str.match for Arrow backend with optional groups
Pull Request -
State: closed - Opened by jorisvandenbossche 2 months ago
Labels: Bug, Strings, Arrow
#62401 - Backport PR #61073 on branch 2.3.x (BUG: fix bug in str.fullmatch for Arrow backend with optional groups)
Pull Request -
State: closed - Opened by meeseeksmachine 2 months ago
Labels: Strings, Arrow
#62328 - Backport PR #62283 on branch 2.3.x (BUG: fix pyarrow string regex replacement)
Pull Request -
State: closed - Opened by meeseeksmachine 2 months ago
Labels: Bug, Strings, Arrow
#62290 - BUG: Fix incorrect FutureWarning for logical ops on pyarrow bool Series (#62260)
Pull Request -
State: closed - Opened by Tarun2605 3 months ago
- 14 comments
Labels: Bug, Numeric Operations, Arrow
#61964 - BUG: fix Series.str.fullmatch() and Series.str.match() with a compiled regex failing with arrow strings
Pull Request -
State: closed - Opened by khemkaran10 4 months ago
- 2 comments
Labels: Strings, Arrow
#61948 - BUG: Replacement fails after NA value with PyArrow-backed strings
Issue -
State: closed - Opened by rhshadrach 4 months ago
- 1 comment
Labels: Bug, replace, Arrow
#61946 - BUG: Fix Series.str.contains with compiled regex on Arrow string dtype
Pull Request -
State: closed - Opened by Aniketsy 4 months ago
- 8 comments
Labels: Bug, Strings, Arrow
#61942 - BUG: Using `Series.str.contains()` with a compiled regex and arrow strings fails
Issue -
State: closed - Opened by Dr-Irv 4 months ago
Labels: Bug, Strings, Arrow
#61926 - BUG: Merge fails on pyarrow datetime columns
Issue -
State: closed - Opened by Meet1995 4 months ago
- 14 comments
Labels: Bug, Datetime, Reshaping, good first issue, Arrow
#61816 - BUG: DataFrame.aggregate to preserve extension dtypes with callable functions
Pull Request -
State: closed - Opened by AdrianoCLeao 5 months ago
- 5 comments
Labels: Bug, Stale, Arrow, pyarrow dtype retention
#61799 - TST: update expected dtype for sum of decimals with pyarrow 21+
Pull Request -
State: closed - Opened by jorisvandenbossche 5 months ago
- 1 comment
Labels: Testing, Arrow
#61799 - TST: update expected dtype for sum of decimals with pyarrow 21+
Pull Request -
State: closed - Opened by jorisvandenbossche 5 months ago
- 3 comments
Labels: Testing, Arrow
#61786 - PERF: avoid object-dtype path in ArrowEA._explode
Pull Request -
State: closed - Opened by jbrockmendel 5 months ago
Labels: Arrow
#61786 - PERF: avoid object-dtype path in ArrowEA._explode
Pull Request -
State: closed - Opened by jbrockmendel 5 months ago
- 1 comment
Labels: Arrow
#61782 - BUG: Errors using pyarrow datetime types on windows
Issue -
State: open - Opened by Liam3851 5 months ago
- 1 comment
Labels: Bug, Docs, Datetime, Windows, Upstream issue, Arrow
#61780 - BUG: tz_localize(None) with Arrow timestamp
Issue -
State: open - Opened by jbrockmendel 5 months ago
Labels: Bug, Datetime, Timezones, Needs Triage, Arrow
#61773 - BUG: Decimal(NaN) incorrectly allowed in ArrowEA constructor with tim…
Pull Request -
State: closed - Opened by jbrockmendel 5 months ago
- 2 comments
Labels: Arrow
#61747 - PERF: Arrow dtypes are much slower than Numpy for DataFrame.apply
Issue -
State: open - Opened by ehsantn 5 months ago
- 8 comments
Labels: Performance, Apply, Arrow
#61723 - DEPS: bump pyarrow minimum version from 10.0 to 12.0
Pull Request -
State: closed - Opened by jorisvandenbossche 5 months ago
- 3 comments
Labels: Dependencies, Arrow
#61723 - DEPS: bump pyarrow minimum version from 10.0 to 12.0
Pull Request -
State: open - Opened by jorisvandenbossche 5 months ago
Labels: Dependencies, Arrow
#61659 - BUG: to_numeric fails to convert a Pyarrow Decimal series containing NA values
Pull Request -
State: open - Opened by chilin0525 5 months ago
Labels: Bug, Dtype Conversions, Arrow
#61659 - BUG: to_numeric fails to convert a Pyarrow Decimal series containing NA values
Pull Request -
State: closed - Opened by chilin0525 5 months ago
- 2 comments
Labels: Bug, Dtype Conversions, Stale, Arrow
#61644 - BUG: Add PyArrow datelike type support for `map()`
Pull Request -
State: closed - Opened by KevsterAmp 5 months ago
- 8 comments
Labels: Bug, Apply, Arrow
#61644 - BUG: Add PyArrow datelike type support for `map()`
Pull Request -
State: open - Opened by KevsterAmp 5 months ago
- 6 comments
Labels: Bug, Apply, Arrow
#61640 - BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes
Pull Request -
State: closed - Opened by heoh 6 months ago
- 4 comments
Labels: Bug, Groupby, Stale, Arrow
#61640 - BUG: Fix GroupBy aggregate coersion of outputs inconsistency for pyarrow dtypes
Pull Request -
State: open - Opened by heoh 6 months ago
Labels: Bug, Groupby, Arrow
#61636 - BUG: Groupby aggregate coersion of outputs inconsistency for pyarrow dtypes
Issue -
State: closed - Opened by AndrejIring 6 months ago
- 2 comments
Labels: Bug, Groupby, Arrow
#61496 - BUG: Passing string[pyarrow] to the dtype parameter of e.g. csv_read() does produce a string type Series
Issue -
State: open - Opened by ClauPet 6 months ago
- 3 comments
Labels: Dtype Conversions, IO CSV, Strings, Needs Discussion, Arrow
#61485 - BUG: zfill with pyarrow string
Issue -
State: closed - Opened by williambdean 6 months ago
- 3 comments
Labels: Bug, Strings, Arrow
#61466 - BUG: Series.str.isdigit with pyarrow dtype doesn't honor unicode superscripts
Issue -
State: open - Opened by GarrettWu 6 months ago
- 6 comments
Labels: Bug, Strings, Needs Discussion, Upstream issue, Arrow
#61376 - BUG: Series.dot for arrow and nullable dtypes returns object-dtyped series
Pull Request -
State: closed - Opened by theavey 7 months ago
Labels: Numeric Operations, Arrow
#61376 - BUG: Series.dot for arrow and nullable dtypes returns object-dtyped series
Pull Request -
State: closed - Opened by theavey 7 months ago
- 1 comment
Labels: Numeric Operations, Arrow
#61322 - BUG: memory issues with `string[pyarrow]` after sorted `pd.merge`
Issue -
State: open - Opened by noahblakesmith 7 months ago
- 5 comments
Labels: Bug, Closing Candidate, Upstream issue, Arrow
#61316 - BUG: `.to_parquet` fails with `schema` for `string[pyarrow]` but not `string[python]`
Issue -
State: open - Opened by noahblakesmith 7 months ago
- 1 comment
Labels: Bug, Closing Candidate, Upstream issue, Arrow
#61236 - BUG: Pyarrow timestamp support for map() function
Pull Request -
State: closed - Opened by arthurlw 8 months ago
- 2 comments
Labels: Stale, Arrow
#61236 - BUG: Pyarrow timestamp support for map() function
Pull Request -
State: closed - Opened by arthurlw 8 months ago
- 2 comments
Labels: Stale, Arrow
#61231 - BUG: PyArrow timestamp type does not work with map() function
Issue -
State: open - Opened by dbalabka 8 months ago
- 3 comments
Labels: Bug, Apply, Arrow
#61229 - BUG: Fix #61222: Keep index name when resampling with pyarrow dtype
Pull Request -
State: closed - Opened by mthiboust 8 months ago
- 2 comments
Labels: Resample, Arrow
#61222 - BUG: Index name lost when using "resample" with pyarrow dtypes
Issue -
State: closed - Opened by mthiboust 8 months ago
- 2 comments
Labels: Bug, Resample, Arrow
#61193 - BUG: Fix pyarrow categoricals not working for pivot and multiindex
Pull Request -
State: closed - Opened by robin-mader-bis 8 months ago
- 2 comments
Labels: Reshaping, Arrow
#61193 - BUG: Fix pyarrow categoricals not working for pivot and multiindex
Pull Request -
State: closed - Opened by robin-mader-bis 8 months ago
- 3 comments
Labels: Reshaping, Arrow
#61144 - BUG(?): rolling sum with pyarrow types results in float64 instead of preserving integer type
Issue -
State: open - Opened by MarcoGorelli 8 months ago
- 4 comments
Labels: Bug, Dtype Conversions, Window, Arrow
#61105 - BUG: DataFrame.explode doesn't work for pyarrow.large_list type
Pull Request -
State: closed - Opened by snitish 9 months ago
- 1 comment
Labels: Arrow
#61105 - BUG: DataFrame.explode doesn't work for pyarrow.large_list type
Pull Request -
State: closed - Opened by snitish 9 months ago
Labels: Arrow
#61103 - Enh arrow json extension
Pull Request -
State: closed - Opened by asharmalik19 9 months ago
- 3 comments
Labels: Arrow
#61103 - Enh arrow json extension
Pull Request -
State: closed - Opened by asharmalik19 9 months ago
Labels: Arrow
#61102 - Backport PR #61098: CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew
Pull Request -
State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow
#61102 - Backport PR #61098: CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew
Pull Request -
State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow
#61098 - CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew
Pull Request -
State: closed - Opened by mroeschke 9 months ago
- 1 comment
Labels: Testing, Arrow
#61098 - CI/TST: Address TestArrowArray::test_reduce_series_numeric supporting skew
Pull Request -
State: closed - Opened by mroeschke 9 months ago
Labels: Testing, Arrow
#61091 - BUG: DataFrame.explode doesn't explode when using pyarrow large list type
Issue -
State: closed - Opened by MarcoGorelli 9 months ago
- 1 comment
Labels: Bug, Reshaping, Arrow
#61073 - BUG: fix bug in str.fullmatch for Arrow backend with optional groups
Pull Request -
State: closed - Opened by ptth222 9 months ago
- 7 comments
Labels: Strings, Arrow
#61072 - BUG: str.fullmatch behavior is not the same for object dtype and string[pyarrow] dtype
Issue -
State: closed - Opened by ptth222 9 months ago
- 2 comments
Labels: Bug, Strings, Arrow
#60997 - BUG: fix read_json ignoring the dtype with the pyarrow engine
Pull Request -
State: closed - Opened by will-larkin 9 months ago
- 4 comments
Labels: IO JSON, Arrow
#60997 - BUG: fix read_json ignoring the dtype with the pyarrow engine
Pull Request -
State: closed - Opened by will-larkin 9 months ago
- 3 comments
Labels: IO JSON, Arrow
#60958 - ENH: Support `pa.json_` in arrow extension type
Issue -
State: closed - Opened by chelsea-lin 9 months ago
- 5 comments
Labels: Enhancement, Arrow
#60949 - Fix error `value_counts` result with pyarrow categorical columns
Pull Request -
State: closed - Opened by chilin0525 9 months ago
- 1 comment
Labels: Categorical, Arrow
#60838 - BUG: Unknown Error - Getting from Databricks SQL Python - From PyArrow module (pyarrow.lib.ArrowException)
Issue -
State: open - Opened by Gobi2511 10 months ago
- 1 comment
Labels: Bug, IO SQL, Needs Info, Arrow
#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray
Pull Request -
State: open - Opened by mroeschke 10 months ago
Labels: Performance, Arrow
#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray
Pull Request -
State: closed - Opened by mroeschke 10 months ago
- 1 comment
Labels: Performance, Arrow
#60777 - Fix arrow groupby na
Pull Request -
State: closed - Opened by asharmalik19 10 months ago
- 1 comment
Labels: Arrow
#60777 - Fix arrow groupby na
Pull Request -
State: closed - Opened by asharmalik19 10 months ago
- 2 comments
Labels: Arrow
#60729 - BUG: Series.mask incorrectly replaces positions of pd.NA in the cond argument
Issue -
State: open - Opened by kartoria 10 months ago
- 7 comments
Labels: Bug, Missing-data, NA - MaskedArrays, Arrow
#60716 - Update PyArrow conversion and arrow/parquet tests for pyarrow 19.0
Pull Request -
State: closed - Opened by jorisvandenbossche 11 months ago
- 2 comments
Labels: Compat, IO Parquet, Arrow
#60716 - Update PyArrow conversion and arrow/parquet tests for pyarrow 19.0
Pull Request -
State: closed - Opened by jorisvandenbossche 11 months ago
- 4 comments
Labels: Compat, IO Parquet, Arrow
#60633 - ENH: Implement cum* methods for PyArrow strings
Pull Request -
State: open - Opened by rhshadrach 11 months ago
Labels: Enhancement, Strings, Arrow, Transformations
#60633 - ENH: Implement cum* methods for PyArrow strings
Pull Request -
State: closed - Opened by rhshadrach 11 months ago
- 5 comments
Labels: Enhancement, Strings, Arrow, Transformations
#60569 - BUG: value_counts() check Index with PyArrow categorical columns
Pull Request -
State: closed - Opened by NOBODIDI 12 months ago
- 2 comments
Labels: Bug, Algos, Categorical, Stale, Arrow
#60569 - BUG: value_counts() check Index with PyArrow categorical columns
Pull Request -
State: closed - Opened by NOBODIDI 12 months ago
- 2 comments
Labels: Bug, Algos, Categorical, Stale, Arrow
#60567 - BUG: groupby with dropna=False and pa.dictionary drops NA values
Issue -
State: open - Opened by rhshadrach 12 months ago
Labels: Groupby, Missing-data, Arrow
#60545 - Backport PR #60544 on branch 2.3.x (CI/TST: Use tm.external_error_raised for test_from_arrow_respecting_given_dtype_unsafe)
Pull Request -
State: closed - Opened by meeseeksmachine 12 months ago
Labels: Testing, Arrow
#60545 - Backport PR #60544 on branch 2.3.x (CI/TST: Use tm.external_error_raised for test_from_arrow_respecting_given_dtype_unsafe)
Pull Request -
State: closed - Opened by meeseeksmachine 12 months ago
Labels: Testing, Arrow
#60544 - CI/TST: Use tm.external_error_raised for test_from_arrow_respecting_given_dtype_unsafe
Pull Request -
State: closed - Opened by mroeschke 12 months ago
- 1 comment
Labels: Testing, Arrow
#60527 - BUG: Fix `ListAccessor` methods to preserve original name
Pull Request -
State: closed - Opened by FBruzzesi 12 months ago
- 1 comment
Labels: Series, Arrow
#60527 - BUG: Fix `ListAccessor` methods to preserve original name
Pull Request -
State: closed - Opened by FBruzzesi 12 months ago
- 2 comments
Labels: Series, Arrow
#60522 - BUG: `ListAccessor` does not preserve series name
Issue -
State: closed - Opened by FBruzzesi 12 months ago
- 2 comments
Labels: Bug, Series, Arrow
#60413 - BUG: fix NameError raised when specifying dtype with string having "[pyarrow]" while PyArrow is not installed
Pull Request -
State: closed - Opened by yuanx749 about 1 year ago
Labels: Arrow
#60413 - BUG: fix NameError raised when specifying dtype with string having "[pyarrow]" while PyArrow is not installed
Pull Request -
State: closed - Opened by yuanx749 about 1 year ago
- 1 comment
Labels: Arrow
#60324 - REF: centralize pyarrow Table to pandas conversions and types_mapper handling
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Arrow
#60286 - TST: Add test for `pd.read_csv` date parsing not working with `dtype_backend="pyarrow"` and missing values
Pull Request -
State: closed - Opened by KevsterAmp about 1 year ago
- 1 comment
Labels: Testing, Missing-data, IO CSV, Arrow
#60286 - TST: Add test for `pd.read_csv` date parsing not working with `dtype_backend="pyarrow"` and missing values
Pull Request -
State: closed - Opened by KevsterAmp about 1 year ago
- 2 comments
Labels: Testing, Missing-data, IO CSV, Arrow
#60248 - Backport PR #60241 on branch 2.3.x (TST (string dtype): resolve xfail in arrow interface tests)
Pull Request -
State: closed - Opened by meeseeksmachine about 1 year ago
Labels: Strings, Arrow
#60241 - TST (string dtype): resolve xfail in arrow interface tests
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Strings, Arrow
#60235 - ENH (string dtype): convert string_view columns to future string dtype instead of object dtype in Parquet/Feather IO
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
- 3 comments
Labels: Strings, IO Parquet, Arrow
#60235 - ENH (string dtype): convert string_view columns to future string dtype instead of object dtype in Parquet/Feather IO
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Strings, IO Parquet, Arrow
#60223 - Backport PR #60222 on branch 2.3.x (ENH (string dtype): accept string_view in addition to string/large_string for ArrowStringArray input)
Pull Request -
State: closed - Opened by meeseeksmachine about 1 year ago
Labels: Strings, Arrow
#60223 - Backport PR #60222 on branch 2.3.x (ENH (string dtype): accept string_view in addition to string/large_string for ArrowStringArray input)
Pull Request -
State: closed - Opened by meeseeksmachine about 1 year ago
Labels: Strings, Arrow
#60222 - ENH (string dtype): accept string_view in addition to string/large_string for ArrowStringArray input
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
- 1 comment
Labels: Strings, Arrow
#60222 - ENH (string dtype): accept string_view in addition to string/large_string for ArrowStringArray input
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
Labels: Strings, Arrow
#60105 - Fix BUG: read_sql tries to convert blob/varbinary to string with pyarrow backend
Pull Request -
State: closed - Opened by kastkeepitjumpinlikekangaroos about 1 year ago
- 6 comments
Labels: IO SQL, Stale, Arrow
#59904 - BUG: pd.read_csv date parsing not working with dtype_backend="pyarrow" and missing values
Issue -
State: closed - Opened by mhabets about 1 year ago
- 12 comments
Labels: Bug, Missing-data, IO CSV, good first issue, Needs Tests, Arrow
#59780 - Consider moving pyarrow's pandas compatibility and conversion code to the pandas project?
Issue -
State: open - Opened by jorisvandenbossche about 1 year ago
Labels: Needs Discussion, Arrow
#59771 - ENH: ArrowTemporalProperties' object has no attribute 'to_period'
Issue -
State: open - Opened by dbalabka about 1 year ago
- 4 comments
Labels: Enhancement, Datetime, Needs Discussion, Arrow
#59768 - BUG/API (string dtype): return float dtype for series[str].rank()
Pull Request -
State: closed - Opened by jorisvandenbossche about 1 year ago
- 2 comments
Labels: Strings, Arrow, Transformations, backported