An open API service for providing issue and pull request metadata for open source projects.

GitHub / pandas-dev/pandas issues and pull requests

Labelled with: Strings

#59448 - String dtype: fix alignment sorting in case of python storage

Pull Request - State: open - Opened by jorisvandenbossche over 1 year ago
Labels: Strings

#59443 - REF (string dtype): de-duplicate _str_map methods

Pull Request - State: closed - Opened by jbrockmendel over 1 year ago - 1 comment
Labels: Strings, backported

#59437 - TST (string dtype): add test build with future strings enabled without pyarrow

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: CI, Strings, backported

#59437 - TST (string dtype): add test build with future strings enabled without pyarrow

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: CI, Strings

#59433 - TST (string dtype): un-xfail string tests specific to object dtype

Pull Request - State: closed - Opened by jbrockmendel over 1 year ago - 2 comments
Labels: Strings

#59433 - TST (string dtype): un-xfail string tests specific to object dtype

Pull Request - State: closed - Opened by jbrockmendel over 1 year ago - 5 comments
Labels: Strings

#59430 - TST (string dtype): fix groupby xfails with using_infer_string + update error message

Pull Request - State: closed - Opened by jbrockmendel over 1 year ago - 12 comments
Labels: Groupby, Strings

#59430 - TST (string dtype): fix groupby xfails with using_infer_string + update error message

Pull Request - State: closed - Opened by jbrockmendel over 1 year ago - 9 comments
Labels: Groupby, Strings

#59414 - API/TST: expand tests for string any/all reduction + fix pyarrow-based implementation

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: API Design, Strings, Reduction Operations

#59414 - API/TST: expand tests for string any/all reduction + fix pyarrow-based implementation

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 3 comments
Labels: API Design, Strings, Reduction Operations, backported

#59388 - String dtype: use 'str' string alias and representation for NaN-variant of the dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Strings

#59388 - String dtype: use 'str' string alias and representation for NaN-variant of the dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Strings, backported

#59376 - String dtype: restrict options.mode.string_storage to python|pyarrow (remove pyarrow_numpy)

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Strings, backported

#59375 - TST (string dtype): replace string_storage fixture with explicit storage/na_value keyword arguments for dtype creation

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings, backported

#59368 - TST (string dtype): remove usage of arrow_string_storage fixture

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings, backported

#59368 - TST (string dtype): remove usage of arrow_string_storage fixture

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Testing, Strings

#59352 - TST (string dtype): follow-up on GH-59329 fixing new xfails

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings

#59352 - TST (string dtype): follow-up on GH-59329 fixing new xfails

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Testing, Strings, backported

#59345 - TST (string dtype): change any_string_dtype fixture to use actual dtype instances

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings

#59345 - TST (string dtype): change any_string_dtype fixture to use actual dtype instances

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Testing, Strings, backported

#59330 - String dtype: rename the storage options and add `na_value` keyword in `StringDtype()`

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Strings

#59330 - String dtype: rename the storage options and add `na_value` keyword in `StringDtype()`

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 3 comments
Labels: Strings, backported

#59329 - TST (string dtype): xfail all currently failing tests with future.infer_string

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings

#59329 - TST (string dtype): xfail all currently failing tests with future.infer_string

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 4 comments
Labels: Testing, Strings, backported

#59328 - String dtype: overview of breaking behaviour changes

Issue - State: closed - Opened by jorisvandenbossche over 1 year ago - 19 comments
Labels: API Design, Strings

#59323 - TST (string dtype): clean-up xpasssing tests with future string dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings, backported

#59323 - TST (string dtype): clean-up xpasssing tests with future string dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings

#59320 - REF (string dtype): rename using_pyarrow_string_dtype to using_string_dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 1 comment
Labels: Testing, Strings, backported

#59320 - REF (string dtype): rename using_pyarrow_string_dtype to using_string_dtype

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago
Labels: Testing, Strings

#58678 - Potential regression induced by "CLN: Simplify map_infer_mask (#58483)"

Issue - State: closed - Opened by DeaMariaLeon over 1 year ago - 1 comment
Labels: Strings, Benchmark

#58613 - Default string dtype (PDEP-14): naming convention to distinguish the dtype variants

Issue - State: closed - Opened by jorisvandenbossche over 1 year ago - 28 comments
Labels: API Design, Strings, Needs Discussion

#58597 - Backport PR #58590 on branch 2.2.x (BUG: Use large_string in string array consistently)

Pull Request - State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Strings, Arrow

#58597 - Backport PR #58590 on branch 2.2.x (BUG: Use large_string in string array consistently)

Pull Request - State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Strings, Arrow

#58590 - BUG: Use large_string in string array consistently

Pull Request - State: closed - Opened by phofl over 1 year ago
Labels: Strings, Arrow

#58581 - Default string dtype should not raise fallback performance warnings

Issue - State: closed - Opened by jorisvandenbossche over 1 year ago - 5 comments
Labels: Strings, Arrow

#58578 - Allow StringArray[python] to be backed by numpy StringDType in numpy 2.0

Pull Request - State: closed - Opened by lithomas1 over 1 year ago - 3 comments
Labels: Strings, Compat, Stale

#58459 - TST / string dtype: add env variable to enable future_string and add test build

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 4 comments
Labels: Testing, Strings, backported

#58451 - String dtype: implement object-dtype based StringArray variant with NumPy semantics

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Strings, backported

#58451 - String dtype: implement object-dtype based StringArray variant with NumPy semantics

Pull Request - State: closed - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Strings

#58418 - Bug: fix Series.str.split when 'regex=None' for series having 'pd.ArrowDtype(pa.string())' dtype

Pull Request - State: closed - Opened by yuanx749 over 1 year ago - 3 comments
Labels: Strings, Stale, Arrow

#58394 - ENH: Add support for numpy 2's string dtype

Pull Request - State: closed - Opened by lithomas1 over 1 year ago - 3 comments
Labels: Enhancement, Strings, Stale

#58394 - ENH: Add support for numpy 2's string dtype

Pull Request - State: closed - Opened by lithomas1 over 1 year ago - 3 comments
Labels: Enhancement, Strings, Stale

#58321 - BUG: Series.str.split broken with pyarrow strings and regex argument

Issue - State: open - Opened by WillAyd over 1 year ago - 3 comments
Labels: Bug, Strings, Arrow

#57733 - PERF: improve StringArray.isna

Pull Request - State: open - Opened by jorisvandenbossche over 1 year ago - 2 comments
Labels: Performance, Strings, Stale

#57542 - PERF: Return RangeIndex columns in str.extract when possible

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Strings

#57542 - PERF: Return RangeIndex columns in str.extract when possible

Pull Request - State: closed - Opened by mroeschke over 1 year ago
Labels: Performance, Strings

#57465 - PERF: Return RangeIndex columns in str.extract when possible

Pull Request - State: closed - Opened by mroeschke almost 2 years ago - 1 comment
Labels: Performance, Strings

#57465 - PERF: Return RangeIndex columns in str.extract when possible

Pull Request - State: closed - Opened by mroeschke almost 2 years ago - 1 comment
Labels: Performance, Strings

#57212 - BUG: ensure_string_array might modify read-only array inplace

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Bug, Strings

#57212 - BUG: ensure_string_array might modify read-only array inplace

Pull Request - State: closed - Opened by phofl almost 2 years ago
Labels: Bug, Strings

#57120 - Backport PR #57089 on branch 2.2.x (BUG: wide_to_long with string columns)

Pull Request - State: closed - Opened by meeseeksmachine almost 2 years ago
Labels: Reshaping, Strings

#57089 - BUG: wide_to_long with string columns

Pull Request - State: closed - Opened by mroeschke almost 2 years ago
Labels: Reshaping, Strings

#57089 - BUG: wide_to_long with string columns

Pull Request - State: closed - Opened by mroeschke almost 2 years ago - 1 comment
Labels: Reshaping, Strings

#57066 - BUG: df.str.match(pattern) fails with pd.options.future.infer_string = True with re.compile()

Issue - State: closed - Opened by ErichMarx almost 2 years ago - 1 comment
Labels: Bug, Regression, Strings, Arrow

#56997 - PERF: StringEngine for string dtype indexing ops

Pull Request - State: closed - Opened by lukemanley almost 2 years ago
Labels: Performance, Strings, Index

#56997 - PERF: StringEngine for string dtype indexing ops

Pull Request - State: closed - Opened by lukemanley almost 2 years ago - 1 comment
Labels: Performance, Strings, Index

#56938 - Backport PR #56445: Adjust merge tests for new string option

Pull Request - State: closed - Opened by lithomas1 almost 2 years ago
Labels: Strings

#56938 - Backport PR #56445: Adjust merge tests for new string option

Pull Request - State: closed - Opened by lithomas1 almost 2 years ago
Labels: Strings

#56792 - Series.str.find fix for pd.ArrowDtype(pa.string())

Pull Request - State: closed - Opened by rohanjain101 almost 2 years ago - 4 comments
Labels: Strings, Arrow

#56792 - Series.str.find fix for pd.ArrowDtype(pa.string())

Pull Request - State: closed - Opened by rohanjain101 almost 2 years ago - 5 comments
Labels: Strings, Arrow

#56754 - BUG: Interchange protocol uses `u` for string format code but offets are 8 bytes

Issue - State: closed - Opened by WillAyd almost 2 years ago - 1 comment
Labels: Bug, Strings, Interchange

#56691 - Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652

Pull Request - State: closed - Opened by JackCollins91 almost 2 years ago - 6 comments
Labels: Strings, Arrow

#56691 - Bug pyarrow implementation of str.fullmatch matches partial string. issue #56652

Pull Request - State: closed - Opened by JackCollins91 almost 2 years ago - 4 comments
Labels: Strings, Arrow

#56663 - Pyarrow stringmatch fix

Pull Request - State: closed - Opened by neha3004 almost 2 years ago - 4 comments
Labels: Strings, Arrow

#56663 - Pyarrow stringmatch fix

Pull Request - State: closed - Opened by neha3004 almost 2 years ago - 4 comments
Labels: Strings, Arrow

#56652 - BUG: Pyarrow implementation of str.fullmatc matches partial string

Issue - State: open - Opened by sjnarmstrong almost 2 years ago - 4 comments
Labels: Bug, Strings, good first issue, Arrow

#56580 - support tuple in startswith/endswith for arrow strings

Pull Request - State: closed - Opened by rohanjain101 almost 2 years ago - 1 comment
Labels: Strings, Arrow

#56579 - BUG: Series.str.startswith/endswith don't support tuple pattern for pd.ArrowDtype(pa.string())

Issue - State: closed - Opened by rohanjain101 almost 2 years ago - 2 comments
Labels: Bug, Strings

#56538 - Support multiplication of pd.ArrowDtype(pa.string()) and integral value where integral value is a series

Pull Request - State: closed - Opened by rohanjain101 almost 2 years ago - 1 comment
Labels: Strings, Arrow

#56535 - Adjust format tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Strings

#56534 - TST: Adjust excel tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: IO Excel, Strings

#56529 - Adjust pivot tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Strings

#56528 - BUG: pivot dropping wrong column level with numeric columns and ea dtype

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Bug, Reshaping, Strings

#56527 - Adjust dummies tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Strings

#56526 - Adjust crosstab tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Strings

#56505 - PERF: casting to the new String dtype could be faster by leveraging pyarrow

Issue - State: open - Opened by jorisvandenbossche almost 2 years ago - 1 comment
Labels: Performance, Strings

#56446 - Adjust concat tests for string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Reshaping, Strings

#56445 - Adjust merge tests for new string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 3 comments
Labels: Strings

#56444 - BUG: merge_asof raising incorrect error for strings

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Reshaping, Strings

#56442 - BUG: merge not sorting for new string dtype

Pull Request - State: closed - Opened by phofl almost 2 years ago - 4 comments
Labels: Reshaping, Strings, Arrow

#56442 - BUG: merge not sorting for new string dtype

Pull Request - State: closed - Opened by phofl almost 2 years ago - 6 comments
Labels: Reshaping, Strings, Arrow

#56441 - BUG: merge not raising for String and numeric merges

Pull Request - State: closed - Opened by phofl almost 2 years ago - 1 comment
Labels: Reshaping, Strings

#56414 - Adjust groupby tests for string option

Issue - State: closed - Opened by phofl almost 2 years ago - 4 comments
Labels: Testing, Strings, Stale

#56414 - Adjust groupby tests for string option

Pull Request - State: closed - Opened by phofl almost 2 years ago - 4 comments
Labels: Testing, Strings, Stale

#56412 - Series.str.find fix for arrow strings when start < 0

Pull Request - State: closed - Opened by rohanjain101 almost 2 years ago - 1 comment
Labels: Strings

#56368 - BUG: Series.__mul__ for pyarrow strings

Pull Request - State: open - Opened by mroeschke almost 2 years ago
Labels: Numeric Operations, Strings, Arrow

#56348 - Backport PR #56332: BUG: str.split for ArrowDtype with pat=None

Pull Request - State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow

#56334 - ENH: Implement str.extract for ArrowDtype

Pull Request - State: closed - Opened by mroeschke almost 2 years ago
Labels: Strings, Arrow

#56332 - BUG: str.split for ArrowDtype with pat=None

Pull Request - State: closed - Opened by mroeschke almost 2 years ago - 1 comment
Labels: Strings, Arrow

#56271 - BUG: `.str.split()` fails with pyarrow strings

Issue - State: closed - Opened by mattharrison almost 2 years ago - 5 comments
Labels: Bug, Strings, Arrow

#56268 - BUG: `str.extract` Method Not Implemented for `pd.ArrowDtype(pa.string())`

Issue - State: closed - Opened by mattharrison almost 2 years ago - 6 comments
Labels: Bug, Strings, Arrow

#56263 - Backport PR #56179 on branch 2.1.x (BUG: to_numeric casting to ea for new string dtype)

Pull Request - State: closed - Opened by meeseeksmachine almost 2 years ago
Labels: Dtype Conversions, Strings

#56259 - BUG: new string dtype fails with >2 GB of data in a single column

Issue - State: closed - Opened by jorisvandenbossche almost 2 years ago - 6 comments
Labels: Bug, Strings, Arrow

#56245 - BUG: __eq__ raising for new arrow string dtype for incompatible objects

Pull Request - State: closed - Opened by phofl almost 2 years ago - 2 comments
Labels: Numeric Operations, Strings