An open API service for providing issue and pull request metadata for open source projects.

GitHub / pandas-dev/pandas issues and pull requests

Labelled with: Needs Discussion

#62309 - PERF: `to_numpy` is >3 orders of magnitude slower on tz aware datetimes

Issue - State: open - Opened by joshdunnlime 3 months ago - 2 comments
Labels: Performance, Needs Discussion, Warnings

#62268 - ENH: Make ExtensionDtype.numpy_dtype part of the interface?

Issue - State: closed - Opened by mroeschke 3 months ago - 3 comments
Labels: Enhancement, Needs Discussion, ExtensionArray

#61890 - Output formatting: the repr of the Categorical categories (quoted or unquoted strings?)

Issue - State: open - Opened by jorisvandenbossche 4 months ago
Labels: Output-Formatting, Categorical, Needs Discussion

#61825 - PERF: Brainstorming read_csv perf improvements

Issue - State: open - Opened by jbrockmendel 5 months ago - 3 comments
Labels: Performance, IO CSV, Needs Discussion

#61819 - BUG: Series created from pre-2.1 legacy pickles lose their names during .copy operations

Issue - State: open - Opened by Liam3851 5 months ago - 2 comments
Labels: Bug, Needs Discussion, IO Pickle

#61809 - BUG: Pandas Series with Xarray slow print time.

Issue - State: open - Opened by chaoyupeng 5 months ago - 1 comment
Labels: Bug, Performance, Output-Formatting, Needs Discussion

#61707 - BUG: .describe() doesn't work for EAs

Issue - State: open - Opened by andrewgsavage 5 months ago - 5 comments
Labels: Bug, Needs Discussion, ExtensionArray

#61618 - Moving to PyArrow dtypes by default

Issue - State: open - Opened by datapythonista 6 months ago
Labels: Needs Discussion

#61538 - usecols investigation for various I/O functions

Issue - State: closed - Opened by eicchen 6 months ago - 1 comment
Labels: IO CSV, Needs Discussion

#61522 - ENH: Implement select to select columns

Issue - State: open - Opened by datapythonista 6 months ago
Labels: Indexing, Needs Discussion

#61502 - BUG: Print alignement problem with some unicode characters

Issue - State: open - Opened by mhooreman 6 months ago - 4 comments
Labels: Bug, Unicode, Needs Discussion

#61496 - BUG: Passing string[pyarrow] to the dtype parameter of e.g. csv_read() does produce a string type Series

Issue - State: open - Opened by ClauPet 6 months ago - 3 comments
Labels: Dtype Conversions, IO CSV, Strings, Needs Discussion, Arrow

#61466 - BUG: Series.str.isdigit with pyarrow dtype doesn't honor unicode superscripts

Issue - State: open - Opened by GarrettWu 6 months ago - 6 comments
Labels: Bug, Strings, Needs Discussion, Upstream issue, Arrow

#61433 - BUG: Some `ExtensionArray`s can return 0-d Elements

Issue - State: open - Opened by ilan-gold 7 months ago - 4 comments
Labels: Bug, Indexing, Needs Discussion, ExtensionArray

#61431 - BUG: documented usage of of `str.split(...).str.get` fails on dtype `large_string[pyarrow]`

Issue - State: open - Opened by SandroCasagrande 7 months ago - 9 comments
Labels: Bug, Strings, Needs Discussion

#61425 - BUG(string dtype): Arithmetic operations between Series with string dtype index

Issue - State: open - Opened by rhshadrach 7 months ago - 2 comments
Labels: Bug, Strings, Needs Discussion, API - Consistency

#61223 - BUG: setting item to iterable with .at fails when column doesn't exist or has wrong dtype

Issue - State: open - Opened by jbogar 8 months ago - 6 comments
Labels: Bug, Indexing, Needs Discussion

#61182 - BUG: Negation of `.str.isnumeric()` changes `dtype` when `pd.NA` is present

Issue - State: closed - Opened by noahblakesmith 8 months ago - 9 comments
Labels: Bug, Missing-data, Strings, Needs Discussion, Closing Candidate

#61177 - BUG: Inconsistency when converting week PeriodIndex into DatetimeIndex with astype("datetime64[ns]")

Issue - State: open - Opened by dbalabka 8 months ago - 4 comments
Labels: Bug, Datetime, Dtype Conversions, Period, Needs Discussion

#61157 - Preserve Complex Data Types for to_csv

Pull Request - State: closed - Opened by Jaspvr 8 months ago - 4 comments
Labels: Enhancement, IO CSV, Complex, Needs Discussion

#61145 - BUG: Inconsistent index type when using read_csv with string[pyarrow] dtype

Issue - State: open - Opened by ptth222 8 months ago - 4 comments
Labels: Bug, IO CSV, Needs Discussion, API - Consistency

#60973 - DOC: No warning in set_index() that previous index column is removed.

Issue - State: closed - Opened by ncotie 9 months ago - 3 comments
Labels: Docs, Indexing, Needs Discussion

#60920 - API: Design questions for HDFStore.append

Issue - State: open - Opened by JakeTT404 10 months ago - 3 comments
Labels: API Design, IO HDF5, Needs Discussion

#60816 - BUG: Union of two DateTimeIndexes is incorrectly calculated

Issue - State: open - Opened by filmor 10 months ago - 5 comments
Labels: Bug, Regression, Needs Discussion, Non-Nano

#60702 - ENH: Enable .mode to sort with NA values

Pull Request - State: closed - Opened by rhshadrach 11 months ago - 7 comments
Labels: Enhancement, Algos, Needs Discussion

#60702 - ENH: Enable .mode to sort with NA values

Pull Request - State: closed - Opened by rhshadrach 11 months ago - 3 comments
Labels: Enhancement, Algos, Needs Discussion

#60639 - API (string dtype): comparisons between different string classes

Issue - State: closed - Opened by rhshadrach 11 months ago - 27 comments
Labels: Numeric Operations, Strings, Needs Discussion, API - Consistency

#60622 - [WIP] `df.apply`: add support for `engine='bodo'`

Pull Request - State: closed - Opened by scott-routledge2 11 months ago - 5 comments
Labels: Needs Discussion

#60593 - API: `future.default_arg` for method-specific behavior changes

Issue - State: open - Opened by rhshadrach 11 months ago - 11 comments
Labels: API Design, Deprecate, Needs Discussion

#60560 - BUG: inconsistent return types from __getitem__ vs iteration

Issue - State: open - Opened by gboeing 12 months ago - 6 comments
Labels: Bug, Needs Discussion, API - Consistency

#60479 - ENH: Support pd.json_normalize for normalizing only meta fields

Issue - State: open - Opened by Ynjxsjmh 12 months ago - 4 comments
Labels: Enhancement, IO JSON, Needs Discussion

#60194 - BUG: Index.equals is not commutative for string and category dtypes

Issue - State: closed - Opened by DhruvBShetty about 1 year ago - 7 comments
Labels: Bug, Needs Discussion, ExtensionArray

#60106 - BUG: isna() does not catch np.NaN when datatype is Float64

Issue - State: closed - Opened by mortnstak about 1 year ago - 5 comments
Labels: Bug, Missing-data, Needs Discussion, NA - MaskedArrays, PDEP missing values

#59976 - DOC: Improve docs in the dt namespace

Issue - State: open - Opened by rhshadrach about 1 year ago
Labels: Docs, Needs Discussion

#59874 - ENH: Droping consecutive duplicates

Issue - State: closed - Opened by Yehuda-Bergstein about 1 year ago - 3 comments
Labels: Enhancement, Needs Discussion, Closing Candidate

#59839 - ENH: add a comments variable to pandas.DataFrame.to_csv

Issue - State: closed - Opened by ebo about 1 year ago - 11 comments
Labels: Enhancement, IO CSV, Needs Discussion, Closing Candidate

#59831 - ENH: Restore the functionality of `.fillna`

Issue - State: closed - Opened by tomprimozic about 1 year ago - 5 comments
Labels: Enhancement, Missing-data, Dtype Conversions, Needs Discussion, Needs Info, Closing Candidate

#59788 - DOC: Series.update throws a `FutureWarning` about `def[col] = df[col].method` but `.update` returns `None` and works `inplace`

Issue - State: open - Opened by spawn-guy about 1 year ago - 10 comments
Labels: Needs Discussion, Warnings, Copy / view semantics, combine/combine_first/update

#59780 - Consider moving pyarrow's pandas compatibility and conversion code to the pandas project?

Issue - State: open - Opened by jorisvandenbossche about 1 year ago
Labels: Needs Discussion, Arrow

#59775 - BUG: pd.concat() cannot concatenate more than 3 mixed-frequency dataframes with multi-index series

Issue - State: open - Opened by sakaiando about 1 year ago - 2 comments
Labels: Bug, Reshaping, Period, Needs Discussion

#59771 - ENH: ArrowTemporalProperties' object has no attribute 'to_period'

Issue - State: open - Opened by dbalabka about 1 year ago - 4 comments
Labels: Enhancement, Datetime, Needs Discussion, Arrow

#59737 - ENH: DataFrame.from_dict doesn't work with collections.UserDict objects

Issue - State: open - Opened by mesvam about 1 year ago - 6 comments
Labels: Enhancement, Needs Discussion, Constructors

#59585 - ENH: `DataFrame.struct.explode(column, *, separator=".")` method to pull struct subfields into the parent DataFrame

Issue - State: open - Opened by tswast over 1 year ago - 2 comments
Labels: Enhancement, Needs Discussion, Needs Info, Arrow

#59436 - BUG: preserve dtype to `bool[pyarrow]` when calling pyarrow backed `Series.isna()`

Pull Request - State: closed - Opened by KevsterAmp over 1 year ago - 3 comments
Labels: Needs Discussion, pyarrow dtype retention

#59436 - BUG: preserve dtype to `bool[pyarrow]` when calling pyarrow backed `Series.isna()`

Pull Request - State: closed - Opened by KevsterAmp over 1 year ago - 3 comments
Labels: Needs Discussion, pyarrow dtype retention

#58913 - BUG: DataFrame(data, ...) creates a copy when 'data' is a NumPy array (pandas 3.0+)

Issue - State: closed - Opened by jameslamb over 1 year ago - 9 comments
Labels: Bug, Needs Discussion, Constructors, Copy / view semantics

#58872 - BUG: Pandas Styler HTML not rendering properly

Issue - State: closed - Opened by trendyllama over 1 year ago - 4 comments
Labels: Bug, Needs Discussion, Styler, Closing Candidate

#58870 - ENH: skiprows after header in read_csv

Issue - State: closed - Opened by davetapley over 1 year ago - 2 comments
Labels: Enhancement, IO CSV, Needs Discussion

#58843 - ENH: Implemented det and rref for frame.py(determinant and row reduced form)

Pull Request - State: closed - Opened by DuarteMarques510 over 1 year ago - 4 comments
Labels: Needs Discussion

#58843 - ENH: Implemented det and rref for frame.py(determinant and row reduced form)

Pull Request - State: closed - Opened by DuarteMarques510 over 1 year ago - 4 comments
Labels: Needs Discussion

#58816 - DOC: Insufficient Project Background Information

Issue - State: closed - Opened by kevinliu249 over 1 year ago - 2 comments
Labels: Docs, Needs Discussion, Closing Candidate

#58772 - Make specific pandas dataframe column immuteable / not changeable

Issue - State: closed - Opened by weidinger-c over 1 year ago - 3 comments
Labels: Enhancement, Needs Discussion, Closing Candidate

#58636 - ENH: Allow `select_dtypes("category")` to identify `ArrowDtype(pa.dictionary))`

Issue - State: open - Opened by nachomaiz over 1 year ago - 2 comments
Labels: Enhancement, Categorical, Needs Discussion, Arrow

#58613 - Default string dtype (PDEP-14): naming convention to distinguish the dtype variants

Issue - State: closed - Opened by jorisvandenbossche over 1 year ago - 28 comments
Labels: API Design, Strings, Needs Discussion

#58547 - ENH: Add totality validation to merge method

Issue - State: closed - Opened by z3rone over 1 year ago - 6 comments
Labels: Enhancement, Reshaping, Needs Discussion

#58247 - DEPR: ExcelFile.parse

Issue - State: open - Opened by rhshadrach over 1 year ago
Labels: IO Excel, Deprecate, Needs Discussion

#58234 - ENH: sum() should default to numeric_only=True

Issue - State: closed - Opened by travisturenne over 1 year ago - 6 comments
Labels: Enhancement, API Design, Needs Discussion, Closing Candidate

#58225 - ENH: Should we support aggregating by-frame in DataFrameGroupBy.agg

Issue - State: open - Opened by rhshadrach over 1 year ago
Labels: Enhancement, Groupby, Needs Discussion, Apply

#58214 - DEPR: rename 'dtype_backend'

Issue - State: closed - Opened by jbrockmendel over 1 year ago - 9 comments
Labels: Bug, API Design, Deprecate, Needs Discussion

#58071 - BUG: TypeError("type Timedelta doesn't define __round__ method")

Issue - State: closed - Opened by JustAnotherVeryNormalDeveloper over 1 year ago - 7 comments
Labels: Enhancement, Timedelta, Needs Discussion

#57765 - BUG: Setting a numpy array as a column in Pandas uses only the first column of the array.

Issue - State: open - Opened by kgourgou over 1 year ago - 5 comments
Labels: Bug, Indexing, Needs Discussion

#57750 - BUG: .loc operation cannot locate existing index when having single string as index for dataframe ('string',)

Issue - State: open - Opened by carlonlv over 1 year ago - 4 comments
Labels: Bug, Indexing, Needs Discussion

#57736 - ENH: Assign for pandas Series

Issue - State: closed - Opened by cbrummitt over 1 year ago - 4 comments
Labels: Enhancement, Reshaping, Needs Discussion, Series, Closing Candidate

#57706 - BUG: groupby.agg should always agg

Pull Request - State: open - Opened by rhshadrach over 1 year ago - 6 comments
Labels: Enhancement, Groupby, API Design, Needs Discussion, Apply

#57706 - BUG: groupby.agg should always agg

Pull Request - State: closed - Opened by rhshadrach over 1 year ago - 7 comments
Labels: Enhancement, Groupby, API Design, Needs Discussion, Apply

#57661 - BUG: Transform() function returns unexpected results with list

Issue - State: open - Opened by J-ZW-Wang over 1 year ago - 3 comments
Labels: Bug, Groupby, Needs Discussion, Transformations

#57639 - Create median_absolute_deviation

Pull Request - State: closed - Opened by VasudevanS1906 over 1 year ago - 1 comment
Labels: Enhancement, Needs Discussion

#57607 - BUG: Possible bad api side effect of replacing FrozenList with tuple

Issue - State: closed - Opened by dalejung over 1 year ago - 8 comments
Labels: Bug, Blocker, Needs Discussion, Index

#57565 - DOC: Series.diff with boolean dtype does not return a series of dtype float

Issue - State: open - Opened by from-nowhere almost 2 years ago - 4 comments
Labels: Docs, Dtype Conversions, Needs Discussion, Transformations, Nullable by default

#57525 - ENH: Better documentation or default behavior for GroupBy for columns with non-sortable values

Issue - State: open - Opened by gabuzi almost 2 years ago - 6 comments
Labels: Enhancement, Groupby, Needs Discussion, Sorting

#57521 - BUG: pandas >= 2.0 parses 'May' in ambiguous way.

Issue - State: closed - Opened by ffyring almost 2 years ago - 8 comments
Labels: Bug, Needs Discussion, datetime.date

#57437 - BUG: Future limitation to prevent automatic type cast makes instantiation of empty series impossible

Issue - State: closed - Opened by KingOtto123 almost 2 years ago - 10 comments
Labels: Needs Discussion, Closing Candidate, PDEP6-related

#57335 - DEPR: pd.concat special cases DatetimeIndex to sort even when sort=False

Issue - State: open - Opened by lukemanley almost 2 years ago
Labels: Deprecate, Needs Discussion

#57303 - ENH: allows word / PPT / outlook formatting as output of to_clipboard

Issue - State: closed - Opened by lcrmorin almost 2 years ago - 3 comments
Labels: Enhancement, IO Data, Needs Discussion, Styler, Closing Candidate

#57181 - ENH: Add a functionality to scan csv files and show some useful info (eg-the record count) of a CSV file without loading entire file.

Issue - State: closed - Opened by KelumPerera almost 2 years ago - 3 comments
Labels: Enhancement, IO CSV, Needs Discussion

#57073 - DISC: Consider not requiring PyArrow in 3.0

Issue - State: open - Opened by MarcoGorelli almost 2 years ago - 68 comments
Labels: Needs Discussion

#56992 - ENH: Add groupby(...).agg_index

Pull Request - State: closed - Opened by rhshadrach almost 2 years ago - 7 comments
Labels: Enhancement, Groupby, Needs Discussion

#56992 - ENH: Add groupby(...).agg_index

Pull Request - State: closed - Opened by rhshadrach almost 2 years ago - 7 comments
Labels: Enhancement, Groupby, Needs Discussion

#56920 - ENH: Allow users to disable PerformanceWarning

Issue - State: closed - Opened by rhshadrach almost 2 years ago - 1 comment
Labels: Enhancement, Needs Discussion, Warnings

#56732 - Move interchange protocol implementation into a separate project

Issue - State: open - Opened by phofl almost 2 years ago
Labels: Needs Discussion, Interchange

#56619 - ENH/PERF: dispatch is_monotonic_increasing / decreasing ?

Issue - State: open - Opened by lukemanley almost 2 years ago - 4 comments
Labels: Performance, Needs Discussion, ExtensionArray

#56553 - ENH: Allow to set axis name in pd.concat

Issue - State: open - Opened by mullimanko almost 2 years ago - 1 comment
Labels: Enhancement, Reshaping, Needs Discussion

#56542 - DEPR: Deprecate xlrd and pyxlsb excel engines

Issue - State: open - Opened by phofl almost 2 years ago - 4 comments
Labels: IO Excel, Deprecate, Needs Discussion

#56533 - ENH: Add limit_area to groupby.ffill/bfill

Issue - State: open - Opened by rhshadrach almost 2 years ago
Labels: Enhancement, Groupby, Missing-data, Needs Discussion, API - Consistency

#56518 - DEPR: SparseDtype

Issue - State: open - Opened by jbrockmendel almost 2 years ago - 1 comment
Labels: Sparse, Deprecate, Needs Discussion

#56498 - ENH: `DataFrame.assign` allow passing dictionary directly

Issue - State: open - Opened by randolf-scholz almost 2 years ago
Labels: Enhancement, Needs Discussion

#56012 - DEPR: Series and Index shouldn't do inference on pandas objects

Issue - State: closed - Opened by phofl about 2 years ago - 2 comments
Labels: Deprecate, Needs Discussion

#56007 - DEPR: DataFrame.value_counts(sort=False) sorts

Issue - State: open - Opened by rhshadrach about 2 years ago
Labels: Algos, Deprecate, Needs Discussion

#55842 - Docs say `Series()` accepts an iterable, but all iterables are not accepted

Issue - State: closed - Opened by Dr-Irv about 2 years ago - 2 comments
Labels: Docs, Needs Discussion

#55571 - ENH: Add Zstd Compression Option for Parquet Output

Issue - State: open - Opened by alexozwald about 2 years ago - 1 comment
Labels: Enhancement, Needs Discussion, IO Parquet

#55551 - REF: groupby internal names

Issue - State: open - Opened by rhshadrach about 2 years ago - 2 comments
Labels: Refactor, Groupby, Needs Discussion

#55493 - STY: avoid accessing private attributes/methods/functions

Issue - State: open - Opened by jbrockmendel about 2 years ago
Labels: Code Style, Needs Discussion

#55462 - BUG: there is no year 0

Issue - State: open - Opened by jbrockmendel about 2 years ago - 8 comments
Labels: Timeseries, Needs Discussion

#55413 - DEPR: Index.format?

Issue - State: closed - Opened by jbrockmendel about 2 years ago - 9 comments
Labels: Deprecate, Needs Discussion, Index

#55312 - REF: Add tests.groupby.methods

Pull Request - State: closed - Opened by rhshadrach about 2 years ago - 8 comments
Labels: Refactor, Testing, Groupby, Needs Discussion

#55289 - ENH: Improve Filter function with Filter_Columns and Filter_Rows

Issue - State: open - Opened by speed650 about 2 years ago - 12 comments
Labels: Enhancement, Needs Discussion, Filters

#55264 - DEPR: is_decimal, is_interval

Issue - State: closed - Opened by jbrockmendel about 2 years ago - 3 comments
Labels: Deprecate, Needs Discussion

#55261 - ENH: Respect observed=False in groupby for non-categoricals

Issue - State: open - Opened by rhshadrach about 2 years ago
Labels: Enhancement, Groupby, Needs Discussion

#55226 - ENH: Support for list of formats in pd.to_datetime()

Issue - State: open - Opened by NickiForte about 2 years ago - 2 comments
Labels: Enhancement, Timeseries, Needs Discussion

#55196 - BUG: ExcelWriter set_column with num_format datetime doesn't work (but it works for xlsxwriter)

Issue - State: open - Opened by lrisch about 2 years ago - 12 comments
Labels: Bug, IO Excel, Needs Discussion, Closing Candidate