GitHub / pandas-dev/pandas issues and pull requests
Labelled with: Strings
#54537 - REF: Refactor using_pyarrow check for string tests
Pull Request -
State: closed - Opened by phofl over 2 years ago
- 1 comment
Labels: Refactor, Strings
#54536 - REF: Move checks to object into a variable
Pull Request -
State: closed - Opened by phofl over 2 years ago
Labels: Refactor, Strings
#54530 - REF: Factor out string methods from ArrowExtensionArray
Pull Request -
State: closed - Opened by phofl over 2 years ago
- 1 comment
Labels: Strings, Arrow
#54512 - Backport PR #54498 on branch 2.1.x (Speed up string inference in maybe_convert_objects)
Pull Request -
State: closed - Opened by meeseeksmachine over 2 years ago
Labels: Strings
#54498 - Speed up string inference in maybe_convert_objects
Pull Request -
State: closed - Opened by phofl over 2 years ago
- 3 comments
Labels: Strings
#54496 - Fix inference for fixed with numpy strings with arrow string option
Pull Request -
State: closed - Opened by phofl over 2 years ago
- 3 comments
Labels: Strings
#54430 - ENH: allow opt-in to inferring pyarrow strings
Pull Request -
State: closed - Opened by phofl over 2 years ago
- 4 comments
Labels: Strings, Arrow
#54300 - BUG: from_dummies always returning object data
Pull Request -
State: closed - Opened by mroeschke over 2 years ago
- 1 comment
Labels: Dtype Conversions, Strings
#54190 - BUG: `testing.assert_frame_equal` unhelpful error message for `string[pyarrow]`
Issue -
State: open - Opened by randolf-scholz over 2 years ago
Labels: Bug, Strings, IO Parquet, Arrow
#53846 - BUG: Series.str.extractall with arrow string returns object dtype
Issue -
State: closed - Opened by wirable23 over 2 years ago
- 4 comments
Labels: Bug, Strings, Arrow, pyarrow dtype retention
#53833 - BUG: Arrow backed series can't be displayed after string slicing it
Issue -
State: closed - Opened by wirable23 over 2 years ago
- 5 comments
Labels: Bug, Strings, Upstream issue, Arrow
#53720 - BUG: Series.unique() terminates strings prematurely on null Bytes
Issue -
State: closed - Opened by Nadrons over 2 years ago
- 2 comments
Labels: Bug, Algos, Strings
#53646 - ENH: Series.str.join for ArrowDtype(pa.string())
Pull Request -
State: closed - Opened by lukemanley over 2 years ago
- 1 comment
Labels: Strings, Arrow
#53585 - PERF: Series.str.split(expand=True) for pyarrow-backed strings
Pull Request -
State: closed - Opened by lukemanley over 2 years ago
- 1 comment
Labels: Performance, Strings, Arrow
#53549 - Backport PR #53532 on branch 2.0.x (BUG: Series.str.split(expand=True) for ArrowDtype(pa.string()))
Pull Request -
State: closed - Opened by lukemanley over 2 years ago
Labels: Bug, Strings, Arrow
#53431 - TST: Add test for series str decode GH#22613
Pull Request -
State: closed - Opened by liang3zy22 over 2 years ago
- 1 comment
Labels: Testing, Strings
#53024 - BUG: Series.str.get with arrow String Dtype allows index to be floating point value
Issue -
State: closed - Opened by wirable23 over 2 years ago
- 2 comments
Labels: Bug, Error Reporting, Strings, Upstream issue, Arrow
#52987 - BUG: Series[python_str].__setitem__(series[pyarrow_string]
Issue -
State: open - Opened by jbrockmendel over 2 years ago
- 2 comments
Labels: Bug, Strings, Arrow
#52434 - BUG: String slicing produces different results with pyarrow string datatype compared to python string type
Issue -
State: closed - Opened by rohanjain101 over 2 years ago
- 8 comments
Labels: Bug, Strings, Upstream issue, Arrow
#52429 - API/DEPR: dtype=(str|bytes) interpret as pyarrow
Issue -
State: closed - Opened by jbrockmendel over 2 years ago
- 8 comments
Labels: Dtype Conversions, Strings, Deprecate, Arrow
#52037 - BUG: Different result with ẞ in Series.str.upper when using Arrow string dtype vs pd.StringDtype()
Issue -
State: closed - Opened by rohanjain101 over 2 years ago
- 8 comments
Labels: Docs, Strings, Upstream issue, Arrow
#51948 - ENH: Can pandas.Series.str.len returns a nullable pd.Int64Dtype rather than float64
Issue -
State: open - Opened by chelsea-lin over 2 years ago
- 2 comments
Labels: Enhancement, Strings, NA - MaskedArrays
#51939 - BUG: `any()` and `all()` raise with extension strings
Issue -
State: open - Opened by jrbourbeau over 2 years ago
- 10 comments
Labels: Bug, Strings, Arrow
#51748 - ENH: Allow dictionaries to be passed to pandas.Series.str.replace
Issue -
State: closed - Opened by lukefeilberg over 2 years ago
- 14 comments
Labels: Enhancement, API Design, Strings, API - Consistency
#51675 - BUG: bytes of special nan value are lost after pd.concat
Issue -
State: open - Opened by boxblox over 2 years ago
- 2 comments
Labels: Bug, Missing-data, Reshaping, Strings
#51400 - BUG: Undefined behavior of Series.str.split
Issue -
State: open - Opened by YarShev almost 3 years ago
- 4 comments
Labels: Bug, Strings
#50099 - BUG: `DataFrame.dtypes` doesn't include backend for `string` columns
Issue -
State: open - Opened by jrbourbeau almost 3 years ago
- 3 comments
Labels: Bug, Output-Formatting, Strings, DataFrame
#50074 - BUG: `string[pyarrow]` dtype doesn't roundtrip through `pyarrow`
Issue -
State: open - Opened by jrbourbeau almost 3 years ago
- 9 comments
Labels: Bug, Strings, Arrow
#48964 - PERF: `pd.util.hash_pandas_object` slower on `string[pyarrow]` than `object` dtypes
Issue -
State: open - Opened by jrbourbeau about 3 years ago
- 9 comments
Labels: Performance, Strings, hashing, Arrow
#48638 - BUG: StringArray is a subclass of PandasArray
Issue -
State: closed - Opened by ehsantn about 3 years ago
- 5 comments
Labels: Internals, Strings, ExtensionArray, Closing Candidate
#48469 - ENH: Remove ArrowStringArray and StringDtype("pyarrow")
Issue -
State: closed - Opened by gsheni about 3 years ago
- 6 comments
Labels: Enhancement, Strings, Deprecate, Needs Discussion, Arrow
#47884 - Pandas string dtype needs from NumPy - prototyping & plan of attack
Issue -
State: open - Opened by rgommers over 3 years ago
- 10 comments
Labels: API Design, Strings
#47872 - ENH: Allow different `dtype` in `pandas.Series.str.get_dummies`
Issue -
State: closed - Opened by JeffersonQin over 3 years ago
- 1 comment
Labels: Enhancement, Performance, Strings
#47667 - BUG: caching in CachedAccessor is problematic and (probably) unnecessary
Issue -
State: closed - Opened by kdebrab over 3 years ago
- 3 comments
Labels: Bug, Performance, Strings, Accessors
#45962 - DOC: pandas.Series.str.is(digit|alpha|lower|upper|...) source links point to wrapper function
Issue -
State: open - Opened by janosh almost 4 years ago
- 1 comment
Labels: Docs, Strings
#45929 - BUG: pandas.Series.unique() does not return correct unique values on \u string
Issue -
State: open - Opened by cbhushan almost 4 years ago
- 4 comments
Labels: Bug, Algos, Strings
#44203 - ENH: Set default storage for strings globaly
Issue -
State: closed - Opened by taoufik07 about 4 years ago
- 4 comments
Labels: Enhancement, Strings
#44104 - BUG: DataFrame.update do not preserve string dtype
Issue -
State: open - Opened by ali-cetin-4ss about 4 years ago
- 5 comments
Labels: Bug, Dtype Conversions, Strings, good first issue, Needs Tests, pyarrow dtype retention, combine/combine_first/update
#42918 - BUG: "ValueError: Must provide strings." when using "string" as dtype
Issue -
State: closed - Opened by omgMath over 4 years ago
- 6 comments
Labels: Bug, Missing-data, Strings
#42863 - ArrowStringArray: delay/conditionally import pyarrow.compute
Issue -
State: open - Opened by jorisvandenbossche over 4 years ago
- 4 comments
Labels: Bug, Strings, Compat, Closing Candidate
#42664 - BUG: `DataFrame.to_parquet` doesn't round-trip pyarrow StringDtype
Issue -
State: open - Opened by TomAugspurger over 4 years ago
- 16 comments
Labels: Docs, Strings, IO Parquet
#41973 - BUG: numpy.char funcs error with string array citing non-string array.
Issue -
State: open - Opened by attack68 over 4 years ago
Labels: Bug, Strings, Compat, ufuncs
#37963 - BUG: Some string methods treat "." as regex, others don't
Issue -
State: open - Opened by giangiacomosanna about 5 years ago
- 2 comments
Labels: Docs, Strings, API - Consistency
#37893 - BUG: fixed length string data type specifiers, are ignored on astype calls, if they are passed in a dictionary
Issue -
State: closed - Opened by okz about 5 years ago
- 1 comment
Labels: Bug, Dtype Conversions, Strings
#33269 - Consider adding isnative property to StringDtype?
Issue -
State: open - Opened by krassowski over 5 years ago
- 1 comment
Labels: Enhancement, Strings, Needs Discussion, ExtensionArray, Closing Candidate
#32993 - df.drop_duplicates() not working as expected
Issue -
State: closed - Opened by kenjioman over 5 years ago
- 9 comments
Labels: Bug, Strings, duplicated
#32835 - Feature request: Allow regex flags for `str.split`
Issue -
State: closed - Opened by tsoernes over 5 years ago
- 2 comments
Labels: Enhancement, Strings
#32233 - [FEATURE] Ability to pass regex module flags in DataFrame.replace while regex=True
Issue -
State: open - Opened by vishnu-dev over 5 years ago
- 2 comments
Labels: Enhancement, Strings, replace
#31516 - query works for string data type columns with missing values but not for boolean indexing
Issue -
State: open - Opened by tdpetrou almost 6 years ago
- 2 comments
Labels: Strings, good first issue, ExtensionArray, Needs Tests, expressions
#31356 - AssertionError creating array with StringDtype instead of StringDtype(), can we have a friendlier error message?
Issue -
State: closed - Opened by gerritholl almost 6 years ago
- 1 comment
Labels: Enhancement, Error Reporting, Strings
#25353 - astype(str) / astype_unicode: np.nan converted to "nan" (checknull, skipna)
Issue -
State: closed - Opened by ThibTrip almost 7 years ago
- 18 comments
Labels: Bug, Missing-data, Strings, Astype
#23800 - ENH: Add `pd.DataFrame.strformat` method for simplified string series construction
Issue -
State: closed - Opened by ms7463 about 7 years ago
- 3 comments
Labels: Enhancement, Strings, Needs Discussion, DataFrame
#22613 - Series.str.decode() turns arrays of strings to NaN and fails on byte strings
Issue -
State: closed - Opened by abrakababra about 7 years ago
- 10 comments
Labels: Strings, good first issue, Needs Tests
#22496 - Add support for "regex" library
Issue -
State: open - Opened by pmav99 about 7 years ago
- 22 comments
Labels: Enhancement, Strings
#19671 - auto convert from string to datetime64 in iterrows.
Issue -
State: closed - Opened by xiaoluffy almost 8 years ago
- 7 comments
Labels: Bug, Reshaping, Dtype Conversions, Strings
#13992 - SeriesGroupby.cumsum raises on object dtype
Issue -
State: closed - Opened by agraboso over 9 years ago
- 3 comments
Labels: Bug, Groupby, Strings