GitHub / pandas-dev/pandas issues and pull requests
Labelled with: Strings
#59448 - String dtype: fix alignment sorting in case of python storage
Pull Request -
State: open - Opened by jorisvandenbossche over 1 year ago
Labels: Strings
#59443 - REF (string dtype): de-duplicate _str_map methods
Pull Request -
State: closed - Opened by jbrockmendel over 1 year ago
- 1 comment
Labels: Strings, backported
#59437 - TST (string dtype): add test build with future strings enabled without pyarrow
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: CI, Strings, backported
#59437 - TST (string dtype): add test build with future strings enabled without pyarrow
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: CI, Strings
#59433 - TST (string dtype): un-xfail string tests specific to object dtype
Pull Request -
State: closed - Opened by jbrockmendel over 1 year ago
- 2 comments
Labels: Strings
#59433 - TST (string dtype): un-xfail string tests specific to object dtype
Pull Request -
State: closed - Opened by jbrockmendel over 1 year ago
- 5 comments
Labels: Strings
#59430 - TST (string dtype): fix groupby xfails with using_infer_string + update error message
Pull Request -
State: closed - Opened by jbrockmendel over 1 year ago
- 12 comments
Labels: Groupby, Strings
#59430 - TST (string dtype): fix groupby xfails with using_infer_string + update error message
Pull Request -
State: closed - Opened by jbrockmendel over 1 year ago
- 9 comments
Labels: Groupby, Strings
#59414 - API/TST: expand tests for string any/all reduction + fix pyarrow-based implementation
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: API Design, Strings, Reduction Operations
#59414 - API/TST: expand tests for string any/all reduction + fix pyarrow-based implementation
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 3 comments
Labels: API Design, Strings, Reduction Operations, backported
#59388 - String dtype: use 'str' string alias and representation for NaN-variant of the dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Strings
#59388 - String dtype: use 'str' string alias and representation for NaN-variant of the dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Strings, backported
#59376 - String dtype: restrict options.mode.string_storage to python|pyarrow (remove pyarrow_numpy)
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Strings, backported
#59376 - String dtype: restrict options.mode.string_storage to python|pyarrow (remove pyarrow_numpy)
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Strings
#59375 - TST (string dtype): replace string_storage fixture with explicit storage/na_value keyword arguments for dtype creation
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Testing, Strings
#59375 - TST (string dtype): replace string_storage fixture with explicit storage/na_value keyword arguments for dtype creation
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings, backported
#59368 - TST (string dtype): remove usage of arrow_string_storage fixture
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings, backported
#59368 - TST (string dtype): remove usage of arrow_string_storage fixture
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Testing, Strings
#59352 - TST (string dtype): follow-up on GH-59329 fixing new xfails
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings
#59352 - TST (string dtype): follow-up on GH-59329 fixing new xfails
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Testing, Strings, backported
#59345 - TST (string dtype): change any_string_dtype fixture to use actual dtype instances
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings
#59345 - TST (string dtype): change any_string_dtype fixture to use actual dtype instances
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Testing, Strings, backported
#59330 - String dtype: rename the storage options and add `na_value` keyword in `StringDtype()`
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Strings
#59330 - String dtype: rename the storage options and add `na_value` keyword in `StringDtype()`
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 3 comments
Labels: Strings, backported
#59329 - TST (string dtype): xfail all currently failing tests with future.infer_string
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings
#59329 - TST (string dtype): xfail all currently failing tests with future.infer_string
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 4 comments
Labels: Testing, Strings, backported
#59328 - String dtype: overview of breaking behaviour changes
Issue -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 19 comments
Labels: API Design, Strings
#59323 - TST (string dtype): clean-up xpasssing tests with future string dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings, backported
#59323 - TST (string dtype): clean-up xpasssing tests with future string dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings
#59320 - REF (string dtype): rename using_pyarrow_string_dtype to using_string_dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 1 comment
Labels: Testing, Strings, backported
#59320 - REF (string dtype): rename using_pyarrow_string_dtype to using_string_dtype
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Testing, Strings
#58678 - Potential regression induced by "CLN: Simplify map_infer_mask (#58483)"
Issue -
State: closed - Opened by DeaMariaLeon over 1 year ago
- 1 comment
Labels: Strings, Benchmark
#58613 - Default string dtype (PDEP-14): naming convention to distinguish the dtype variants
Issue -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 28 comments
Labels: API Design, Strings, Needs Discussion
#58597 - Backport PR #58590 on branch 2.2.x (BUG: Use large_string in string array consistently)
Pull Request -
State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Strings, Arrow
#58597 - Backport PR #58590 on branch 2.2.x (BUG: Use large_string in string array consistently)
Pull Request -
State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Strings, Arrow
#58590 - BUG: Use large_string in string array consistently
Pull Request -
State: closed - Opened by phofl over 1 year ago
Labels: Strings, Arrow
#58581 - Default string dtype should not raise fallback performance warnings
Issue -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 5 comments
Labels: Strings, Arrow
#58578 - Allow StringArray[python] to be backed by numpy StringDType in numpy 2.0
Pull Request -
State: closed - Opened by lithomas1 over 1 year ago
- 3 comments
Labels: Strings, Compat, Stale
#58459 - TST / string dtype: add env variable to enable future_string and add test build
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 4 comments
Labels: Testing, Strings, backported
#58451 - String dtype: implement object-dtype based StringArray variant with NumPy semantics
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Strings, backported
#58451 - String dtype: implement object-dtype based StringArray variant with NumPy semantics
Pull Request -
State: closed - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Strings
#58418 - Bug: fix Series.str.split when 'regex=None' for series having 'pd.ArrowDtype(pa.string())' dtype
Pull Request -
State: closed - Opened by yuanx749 over 1 year ago
- 3 comments
Labels: Strings, Stale, Arrow
#58394 - ENH: Add support for numpy 2's string dtype
Pull Request -
State: closed - Opened by lithomas1 over 1 year ago
- 3 comments
Labels: Enhancement, Strings, Stale
#58394 - ENH: Add support for numpy 2's string dtype
Pull Request -
State: closed - Opened by lithomas1 over 1 year ago
- 3 comments
Labels: Enhancement, Strings, Stale
#58321 - BUG: Series.str.split broken with pyarrow strings and regex argument
Issue -
State: open - Opened by WillAyd over 1 year ago
- 3 comments
Labels: Bug, Strings, Arrow
#58215 - BUG: pandas.Series.unique() does not return correct unique values on non UTF8 enodeable strings
Pull Request -
State: closed - Opened by mroeschke over 1 year ago
Labels: Strings
#58215 - BUG: pandas.Series.unique() does not return correct unique values on non UTF8 enodeable strings
Pull Request -
State: closed - Opened by mroeschke over 1 year ago
Labels: Strings
#57733 - PERF: improve StringArray.isna
Pull Request -
State: open - Opened by jorisvandenbossche over 1 year ago
- 2 comments
Labels: Performance, Strings, Stale
#57542 - PERF: Return RangeIndex columns in str.extract when possible
Pull Request -
State: closed - Opened by mroeschke over 1 year ago
- 1 comment
Labels: Performance, Strings
#57542 - PERF: Return RangeIndex columns in str.extract when possible
Pull Request -
State: closed - Opened by mroeschke over 1 year ago
Labels: Performance, Strings
#57465 - PERF: Return RangeIndex columns in str.extract when possible
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
- 1 comment
Labels: Performance, Strings
#57465 - PERF: Return RangeIndex columns in str.extract when possible
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
- 1 comment
Labels: Performance, Strings
#57212 - BUG: ensure_string_array might modify read-only array inplace
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Bug, Strings
#57212 - BUG: ensure_string_array might modify read-only array inplace
Pull Request -
State: closed - Opened by phofl almost 2 years ago
Labels: Bug, Strings
#57120 - Backport PR #57089 on branch 2.2.x (BUG: wide_to_long with string columns)
Pull Request -
State: closed - Opened by meeseeksmachine almost 2 years ago
Labels: Reshaping, Strings
#57089 - BUG: wide_to_long with string columns
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
Labels: Reshaping, Strings
#57089 - BUG: wide_to_long with string columns
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
- 1 comment
Labels: Reshaping, Strings
#57066 - BUG: df.str.match(pattern) fails with pd.options.future.infer_string = True with re.compile()
Issue -
State: closed - Opened by ErichMarx almost 2 years ago
- 1 comment
Labels: Bug, Regression, Strings, Arrow
#56997 - PERF: StringEngine for string dtype indexing ops
Pull Request -
State: closed - Opened by lukemanley almost 2 years ago
Labels: Performance, Strings, Index
#56997 - PERF: StringEngine for string dtype indexing ops
Pull Request -
State: closed - Opened by lukemanley almost 2 years ago
- 1 comment
Labels: Performance, Strings, Index
#56938 - Backport PR #56445: Adjust merge tests for new string option
Pull Request -
State: closed - Opened by lithomas1 almost 2 years ago
Labels: Strings
#56938 - Backport PR #56445: Adjust merge tests for new string option
Pull Request -
State: closed - Opened by lithomas1 almost 2 years ago
Labels: Strings
#56792 - Series.str.find fix for pd.ArrowDtype(pa.string())
Pull Request -
State: closed - Opened by rohanjain101 almost 2 years ago
- 4 comments
Labels: Strings, Arrow
#56792 - Series.str.find fix for pd.ArrowDtype(pa.string())
Pull Request -
State: closed - Opened by rohanjain101 almost 2 years ago
- 5 comments
Labels: Strings, Arrow
#56754 - BUG: Interchange protocol uses `u` for string format code but offets are 8 bytes
Issue -
State: closed - Opened by WillAyd almost 2 years ago
- 1 comment
Labels: Bug, Strings, Interchange
#56715 - Backport PR #56691 on branch 2.2.x (Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652)
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow
#56715 - Backport PR #56691 on branch 2.2.x (Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652)
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow
#56691 - Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652
Pull Request -
State: closed - Opened by JackCollins91 almost 2 years ago
- 6 comments
Labels: Strings, Arrow
#56691 - Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652
Pull Request -
State: closed - Opened by JackCollins91 almost 2 years ago
- 4 comments
Labels: Strings, Arrow
#56663 - Pyarrow stringmatch fix
Pull Request -
State: closed - Opened by neha3004 almost 2 years ago
- 4 comments
Labels: Strings, Arrow
#56663 - Pyarrow stringmatch fix
Pull Request -
State: closed - Opened by neha3004 almost 2 years ago
- 4 comments
Labels: Strings, Arrow
#56652 - BUG: Pyarrow implementation of str.fullmatc matches partial string
Issue -
State: open - Opened by sjnarmstrong almost 2 years ago
- 4 comments
Labels: Bug, Strings, good first issue, Arrow
#56580 - support tuple in startswith/endswith for arrow strings
Pull Request -
State: closed - Opened by rohanjain101 almost 2 years ago
- 1 comment
Labels: Strings, Arrow
#56579 - BUG: Series.str.startswith/endswith don't support tuple pattern for pd.ArrowDtype(pa.string())
Issue -
State: closed - Opened by rohanjain101 almost 2 years ago
- 2 comments
Labels: Bug, Strings
#56538 - Support multiplication of pd.ArrowDtype(pa.string()) and integral value where integral value is a series
Pull Request -
State: closed - Opened by rohanjain101 almost 2 years ago
- 1 comment
Labels: Strings, Arrow
#56535 - Adjust format tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Strings
#56534 - TST: Adjust excel tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: IO Excel, Strings
#56529 - Adjust pivot tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Strings
#56528 - BUG: pivot dropping wrong column level with numeric columns and ea dtype
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Bug, Reshaping, Strings
#56527 - Adjust dummies tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Strings
#56526 - Adjust crosstab tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Strings
#56505 - PERF: casting to the new String dtype could be faster by leveraging pyarrow
Issue -
State: open - Opened by jorisvandenbossche almost 2 years ago
- 1 comment
Labels: Performance, Strings
#56446 - Adjust concat tests for string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Reshaping, Strings
#56445 - Adjust merge tests for new string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 3 comments
Labels: Strings
#56444 - BUG: merge_asof raising incorrect error for strings
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Reshaping, Strings
#56442 - BUG: merge not sorting for new string dtype
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 4 comments
Labels: Reshaping, Strings, Arrow
#56442 - BUG: merge not sorting for new string dtype
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 6 comments
Labels: Reshaping, Strings, Arrow
#56441 - BUG: merge not raising for String and numeric merges
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 1 comment
Labels: Reshaping, Strings
#56414 - Adjust groupby tests for string option
Issue -
State: closed - Opened by phofl almost 2 years ago
- 4 comments
Labels: Testing, Strings, Stale
#56414 - Adjust groupby tests for string option
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 4 comments
Labels: Testing, Strings, Stale
#56412 - Series.str.find fix for arrow strings when start < 0
Pull Request -
State: closed - Opened by rohanjain101 almost 2 years ago
- 1 comment
Labels: Strings
#56368 - BUG: Series.__mul__ for pyarrow strings
Pull Request -
State: open - Opened by mroeschke almost 2 years ago
Labels: Numeric Operations, Strings, Arrow
#56348 - Backport PR #56332: BUG: str.split for ArrowDtype with pat=None
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow
#56334 - ENH: Implement str.extract for ArrowDtype
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow
#56332 - BUG: str.split for ArrowDtype with pat=None
Pull Request -
State: closed - Opened by mroeschke almost 2 years ago
- 1 comment
Labels: Strings, Arrow
#56271 - BUG: `.str.split()` fails with pyarrow strings
Issue -
State: closed - Opened by mattharrison almost 2 years ago
- 5 comments
Labels: Bug, Strings, Arrow
#56268 - BUG: `str.extract` Method Not Implemented for `pd.ArrowDtype(pa.string())`
Issue -
State: closed - Opened by mattharrison almost 2 years ago
- 6 comments
Labels: Bug, Strings, Arrow
#56263 - Backport PR #56179 on branch 2.1.x (BUG: to_numeric casting to ea for new string dtype)
Pull Request -
State: closed - Opened by meeseeksmachine almost 2 years ago
Labels: Dtype Conversions, Strings
#56259 - BUG: new string dtype fails with >2 GB of data in a single column
Issue -
State: closed - Opened by jorisvandenbossche almost 2 years ago
- 6 comments
Labels: Bug, Strings, Arrow
#56245 - BUG: __eq__ raising for new arrow string dtype for incompatible objects
Pull Request -
State: closed - Opened by phofl almost 2 years ago
- 2 comments
Labels: Numeric Operations, Strings