An open API service for providing issue and pull request metadata for open source projects.

GitHub / pandas-dev/pandas issues and pull requests

Labelled with: Strings

#54537 - REF: Refactor using_pyarrow check for string tests

Pull Request - State: closed - Opened by phofl over 2 years ago - 1 comment
Labels: Refactor, Strings

#54536 - REF: Move checks to object into a variable

Pull Request - State: closed - Opened by phofl over 2 years ago
Labels: Refactor, Strings

#54530 - REF: Factor out string methods from ArrowExtensionArray

Pull Request - State: closed - Opened by phofl over 2 years ago - 1 comment
Labels: Strings, Arrow

#54498 - Speed up string inference in maybe_convert_objects

Pull Request - State: closed - Opened by phofl over 2 years ago - 3 comments
Labels: Strings

#54496 - Fix inference for fixed with numpy strings with arrow string option

Pull Request - State: closed - Opened by phofl over 2 years ago - 3 comments
Labels: Strings

#54430 - ENH: allow opt-in to inferring pyarrow strings

Pull Request - State: closed - Opened by phofl over 2 years ago - 4 comments
Labels: Strings, Arrow

#54300 - BUG: from_dummies always returning object data

Pull Request - State: closed - Opened by mroeschke over 2 years ago - 1 comment
Labels: Dtype Conversions, Strings

#54190 - BUG: `testing.assert_frame_equal` unhelpful error message for `string[pyarrow]`

Issue - State: open - Opened by randolf-scholz over 2 years ago
Labels: Bug, Strings, IO Parquet, Arrow

#53846 - BUG: Series.str.extractall with arrow string returns object dtype

Issue - State: closed - Opened by wirable23 over 2 years ago - 4 comments
Labels: Bug, Strings, Arrow, pyarrow dtype retention

#53833 - BUG: Arrow backed series can't be displayed after string slicing it

Issue - State: closed - Opened by wirable23 over 2 years ago - 5 comments
Labels: Bug, Strings, Upstream issue, Arrow

#53720 - BUG: Series.unique() terminates strings prematurely on null Bytes

Issue - State: closed - Opened by Nadrons over 2 years ago - 2 comments
Labels: Bug, Algos, Strings

#53646 - ENH: Series.str.join for ArrowDtype(pa.string())

Pull Request - State: closed - Opened by lukemanley over 2 years ago - 1 comment
Labels: Strings, Arrow

#53585 - PERF: Series.str.split(expand=True) for pyarrow-backed strings

Pull Request - State: closed - Opened by lukemanley over 2 years ago - 1 comment
Labels: Performance, Strings, Arrow

#53549 - Backport PR #53532 on branch 2.0.x (BUG: Series.str.split(expand=True) for ArrowDtype(pa.string()))

Pull Request - State: closed - Opened by lukemanley over 2 years ago
Labels: Bug, Strings, Arrow

#53431 - TST: Add test for series str decode GH#22613

Pull Request - State: closed - Opened by liang3zy22 over 2 years ago - 1 comment
Labels: Testing, Strings

#53024 - BUG: Series.str.get with arrow String Dtype allows index to be floating point value

Issue - State: closed - Opened by wirable23 over 2 years ago - 2 comments
Labels: Bug, Error Reporting, Strings, Upstream issue, Arrow

#52987 - BUG: Series[python_str].__setitem__(series[pyarrow_string]

Issue - State: open - Opened by jbrockmendel over 2 years ago - 2 comments
Labels: Bug, Strings, Arrow

#52434 - BUG: String slicing produces different results with pyarrow string datatype compared to python string type

Issue - State: closed - Opened by rohanjain101 over 2 years ago - 8 comments
Labels: Bug, Strings, Upstream issue, Arrow

#52429 - API/DEPR: dtype=(str|bytes) interpret as pyarrow

Issue - State: closed - Opened by jbrockmendel over 2 years ago - 8 comments
Labels: Dtype Conversions, Strings, Deprecate, Arrow

#52037 - BUG: Different result with ẞ in Series.str.upper when using Arrow string dtype vs pd.StringDtype()

Issue - State: closed - Opened by rohanjain101 over 2 years ago - 8 comments
Labels: Docs, Strings, Upstream issue, Arrow

#51948 - ENH: Can pandas.Series.str.len returns a nullable pd.Int64Dtype rather than float64

Issue - State: open - Opened by chelsea-lin over 2 years ago - 2 comments
Labels: Enhancement, Strings, NA - MaskedArrays

#51939 - BUG: `any()` and `all()` raise with extension strings

Issue - State: open - Opened by jrbourbeau over 2 years ago - 10 comments
Labels: Bug, Strings, Arrow

#51748 - ENH: Allow dictionaries to be passed to pandas.Series.str.replace

Issue - State: closed - Opened by lukefeilberg over 2 years ago - 14 comments
Labels: Enhancement, API Design, Strings, API - Consistency

#51675 - BUG: bytes of special nan value are lost after pd.concat

Issue - State: open - Opened by boxblox over 2 years ago - 2 comments
Labels: Bug, Missing-data, Reshaping, Strings

#51400 - BUG: Undefined behavior of Series.str.split

Issue - State: open - Opened by YarShev almost 3 years ago - 4 comments
Labels: Bug, Strings

#50099 - BUG: `DataFrame.dtypes` doesn't include backend for `string` columns

Issue - State: open - Opened by jrbourbeau almost 3 years ago - 3 comments
Labels: Bug, Output-Formatting, Strings, DataFrame

#50074 - BUG: `string[pyarrow]` dtype doesn't roundtrip through `pyarrow`

Issue - State: open - Opened by jrbourbeau almost 3 years ago - 9 comments
Labels: Bug, Strings, Arrow

#48964 - PERF: `pd.util.hash_pandas_object` slower on `string[pyarrow]` than `object` dtypes

Issue - State: open - Opened by jrbourbeau about 3 years ago - 9 comments
Labels: Performance, Strings, hashing, Arrow

#48638 - BUG: StringArray is a subclass of PandasArray

Issue - State: closed - Opened by ehsantn about 3 years ago - 5 comments
Labels: Internals, Strings, ExtensionArray, Closing Candidate

#48469 - ENH: Remove ArrowStringArray and StringDtype("pyarrow")

Issue - State: closed - Opened by gsheni about 3 years ago - 6 comments
Labels: Enhancement, Strings, Deprecate, Needs Discussion, Arrow

#47884 - Pandas string dtype needs from NumPy - prototyping & plan of attack

Issue - State: open - Opened by rgommers over 3 years ago - 10 comments
Labels: API Design, Strings

#47872 - ENH: Allow different `dtype` in `pandas.Series.str.get_dummies`

Issue - State: closed - Opened by JeffersonQin over 3 years ago - 1 comment
Labels: Enhancement, Performance, Strings

#47667 - BUG: caching in CachedAccessor is problematic and (probably) unnecessary

Issue - State: closed - Opened by kdebrab over 3 years ago - 3 comments
Labels: Bug, Performance, Strings, Accessors

#45962 - DOC: pandas.Series.str.is(digit|alpha|lower|upper|...) source links point to wrapper function

Issue - State: open - Opened by janosh almost 4 years ago - 1 comment
Labels: Docs, Strings

#45929 - BUG: pandas.Series.unique() does not return correct unique values on \u string

Issue - State: open - Opened by cbhushan almost 4 years ago - 4 comments
Labels: Bug, Algos, Strings

#44203 - ENH: Set default storage for strings globaly

Issue - State: closed - Opened by taoufik07 about 4 years ago - 4 comments
Labels: Enhancement, Strings

#44104 - BUG: DataFrame.update do not preserve string dtype

Issue - State: open - Opened by ali-cetin-4ss about 4 years ago - 5 comments
Labels: Bug, Dtype Conversions, Strings, good first issue, Needs Tests, pyarrow dtype retention, combine/combine_first/update

#42918 - BUG: "ValueError: Must provide strings." when using "string" as dtype

Issue - State: closed - Opened by omgMath over 4 years ago - 6 comments
Labels: Bug, Missing-data, Strings

#42863 - ArrowStringArray: delay/conditionally import pyarrow.compute

Issue - State: open - Opened by jorisvandenbossche over 4 years ago - 4 comments
Labels: Bug, Strings, Compat, Closing Candidate

#42664 - BUG: `DataFrame.to_parquet` doesn't round-trip pyarrow StringDtype

Issue - State: open - Opened by TomAugspurger over 4 years ago - 16 comments
Labels: Docs, Strings, IO Parquet

#41973 - BUG: numpy.char funcs error with string array citing non-string array.

Issue - State: open - Opened by attack68 over 4 years ago
Labels: Bug, Strings, Compat, ufuncs

#37963 - BUG: Some string methods treat "." as regex, others don't

Issue - State: open - Opened by giangiacomosanna about 5 years ago - 2 comments
Labels: Docs, Strings, API - Consistency

#37893 - BUG: fixed length string data type specifiers, are ignored on astype calls, if they are passed in a dictionary

Issue - State: closed - Opened by okz about 5 years ago - 1 comment
Labels: Bug, Dtype Conversions, Strings

#33269 - Consider adding isnative property to StringDtype?

Issue - State: open - Opened by krassowski over 5 years ago - 1 comment
Labels: Enhancement, Strings, Needs Discussion, ExtensionArray, Closing Candidate

#32993 - df.drop_duplicates() not working as expected

Issue - State: closed - Opened by kenjioman over 5 years ago - 9 comments
Labels: Bug, Strings, duplicated

#32835 - Feature request: Allow regex flags for `str.split`

Issue - State: closed - Opened by tsoernes over 5 years ago - 2 comments
Labels: Enhancement, Strings

#32233 - [FEATURE] Ability to pass regex module flags in DataFrame.replace while regex=True

Issue - State: open - Opened by vishnu-dev over 5 years ago - 2 comments
Labels: Enhancement, Strings, replace

#31516 - query works for string data type columns with missing values but not for boolean indexing

Issue - State: open - Opened by tdpetrou almost 6 years ago - 2 comments
Labels: Strings, good first issue, ExtensionArray, Needs Tests, expressions

#31356 - AssertionError creating array with StringDtype instead of StringDtype(), can we have a friendlier error message?

Issue - State: closed - Opened by gerritholl almost 6 years ago - 1 comment
Labels: Enhancement, Error Reporting, Strings

#25353 - astype(str) / astype_unicode: np.nan converted to "nan" (checknull, skipna)

Issue - State: closed - Opened by ThibTrip almost 7 years ago - 18 comments
Labels: Bug, Missing-data, Strings, Astype

#23800 - ENH: Add `pd.DataFrame.strformat` method for simplified string series construction

Issue - State: closed - Opened by ms7463 about 7 years ago - 3 comments
Labels: Enhancement, Strings, Needs Discussion, DataFrame

#22613 - Series.str.decode() turns arrays of strings to NaN and fails on byte strings

Issue - State: closed - Opened by abrakababra about 7 years ago - 10 comments
Labels: Strings, good first issue, Needs Tests

#22496 - Add support for "regex" library

Issue - State: open - Opened by pmav99 about 7 years ago - 22 comments
Labels: Enhancement, Strings

#19671 - auto convert from string to datetime64 in iterrows.

Issue - State: closed - Opened by xiaoluffy almost 8 years ago - 7 comments
Labels: Bug, Reshaping, Dtype Conversions, Strings

#13992 - SeriesGroupby.cumsum raises on object dtype

Issue - State: closed - Opened by agraboso over 9 years ago - 3 comments
Labels: Bug, Groupby, Strings