An open API service for providing issue and pull request metadata for open source projects.

GitHub / pandas-dev/pandas issues and pull requests

Labelled with: Performance

#62694 - PERF: rolling std/var x100 slower in current dev version

Issue - State: open - Opened by yolking about 1 month ago - 1 comment
Labels: Performance, Needs Triage

#62309 - PERF: `to_numpy` is >3 orders of magnitude slower on tz aware datetimes

Issue - State: open - Opened by joshdunnlime 2 months ago - 2 comments
Labels: Performance, Needs Discussion, Warnings

#62244 - PERF: `DataFrame.unstack()` and `DataFrame.pivot_table()` upcasting take up more memory than needed

Issue - State: closed - Opened by pabloknecht 3 months ago - 10 comments
Labels: Performance, Needs Triage, Nullable by default

#62119 - PERF: Pandas parser lib takes 50MB of memory on Windows

Issue - State: open - Opened by LucaCerina 3 months ago
Labels: Performance, Needs Triage

#61825 - PERF: Brainstorming read_csv perf improvements

Issue - State: open - Opened by jbrockmendel 4 months ago - 3 comments
Labels: Performance, IO CSV, Needs Discussion

#61809 - BUG: Pandas Series with Xarray slow print time.

Issue - State: open - Opened by chaoyupeng 4 months ago - 1 comment
Labels: Bug, Performance, Output-Formatting, Needs Discussion

#61783 - PERF: Unnecessary string interning in read_csv?

Issue - State: closed - Opened by jbrockmendel 5 months ago - 2 comments
Labels: Performance, Needs Triage

#61764 - ENH: speed up wide DataFrame.line plots by using a single LineCollection

Pull Request - State: open - Opened by EvMossan 5 months ago
Labels: Visualization, Performance

#61747 - PERF: Arrow dtypes are much slower than Numpy for DataFrame.apply

Issue - State: open - Opened by ehsantn 5 months ago - 8 comments
Labels: Performance, Apply, Arrow

#61532 - ENH: speed up `DataFrame.plot` using `LineCollection`

Issue - State: open - Opened by Abdelgha-4 6 months ago - 5 comments
Labels: Visualization, Performance

#61460 - PERF: Windows Unit Test Timing Out

Issue - State: open - Opened by MartinBraquet 6 months ago
Labels: Performance, Needs Triage

#61456 - PERF: Setting an item of incompatible dtype

Issue - State: open - Opened by muhannad125 6 months ago - 1 comment
Labels: Indexing, Performance, Needs Info

#61320 - PERF: Restore old performances with .isin() on columns typed as np.ui…

Pull Request - State: closed - Opened by pbrochart 7 months ago - 3 comments
Labels: Performance, Regression, isin

#61320 - PERF: Restore old performances with .isin() on columns typed as np.ui…

Pull Request - State: closed - Opened by pbrochart 7 months ago - 2 comments
Labels: Performance, Regression, isin

#61251 - PERF: future_stack is too slow

Issue - State: closed - Opened by auderson 8 months ago - 1 comment
Labels: Performance, Needs Triage

#61116 - Bug: Save original index and remap after function completes

Pull Request - State: closed - Opened by Jeffrharr 8 months ago - 3 comments
Labels: Performance, Filters

#61116 - Bug: Save original index and remap after function completes

Pull Request - State: closed - Opened by Jeffrharr 8 months ago - 4 comments
Labels: Performance, Filters

#61076 - PERF: why nlargest is so slower?

Issue - State: open - Opened by ZGarry 9 months ago
Labels: Performance, Needs Triage

#61046 - perf: improve membership check performance in column filtering

Pull Request - State: closed - Opened by allrob23 9 months ago - 6 comments
Labels: Performance, IO CSV

#61046 - perf: improve membership check performance in column filtering

Pull Request - State: closed - Opened by allrob23 9 months ago - 7 comments
Labels: Performance, IO CSV

#61045 - PERF: Optimize membership check in column filtering for better performance

Issue - State: closed - Opened by allrob23 9 months ago
Labels: Performance, IO CSV

#61014 - PERF: use `blk.dtype` in `where()` & `_setitem_frame()`

Pull Request - State: closed - Opened by auderson 9 months ago - 4 comments
Labels: Performance

#61010 - PERF: bottleneck in `where()`

Issue - State: closed - Opened by auderson 9 months ago - 5 comments
Labels: Performance, Needs Triage

#60871 - PERF: Fix groupby skipna performance

Pull Request - State: closed - Opened by snitish 10 months ago - 2 comments
Labels: Groupby, Performance

#60871 - PERF: Fix groupby skipna performance

Pull Request - State: closed - Opened by snitish 10 months ago - 1 comment
Labels: Groupby, Performance

#60870 - PERF: Regression in groupby ops from adding skipna

Issue - State: closed - Opened by rhshadrach 10 months ago - 2 comments
Labels: Groupby, Missing-data, Performance, Regression

#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray

Pull Request - State: closed - Opened by mroeschke 10 months ago - 1 comment
Labels: Performance, Arrow

#60778 - PERF: Avoid a numpy array copy in ArrowExtensionArray._to_datetimearray

Pull Request - State: open - Opened by mroeschke 10 months ago
Labels: Performance, Arrow

#60689 - PERF: Fix ASV CSV benchmarks

Pull Request - State: closed - Opened by rhshadrach 10 months ago - 1 comment
Labels: Performance, IO CSV

#60689 - PERF: Fix ASV CSV benchmarks

Pull Request - State: closed - Opened by rhshadrach 10 months ago - 1 comment
Labels: Performance, IO CSV

#60483 - Backport PR #60461 on branch 2.3.x (PERF: improve construct_1d_object_array_from_listlike)

Pull Request - State: closed - Opened by meeseeksmachine 12 months ago
Labels: Performance, Constructors

#60483 - Backport PR #60461 on branch 2.3.x (PERF: improve construct_1d_object_array_from_listlike)

Pull Request - State: closed - Opened by meeseeksmachine 12 months ago
Labels: Performance, Constructors

#60469 - PERF: regression on mean(axis=1) compared to old pandas version

Issue - State: open - Opened by Remy-ooo 12 months ago - 7 comments
Labels: Performance, Copy / view semantics

#60461 - PERF: improve construct_1d_object_array_from_listlike

Pull Request - State: closed - Opened by jorisvandenbossche 12 months ago - 3 comments
Labels: Performance, Constructors

#60461 - PERF: improve construct_1d_object_array_from_listlike

Pull Request - State: closed - Opened by jorisvandenbossche 12 months ago - 4 comments
Labels: Performance, Constructors

#60213 - PERF: index.unique much slower than get_level_values.drop_duplicates

Issue - State: open - Opened by jacek-pliszka about 1 year ago - 1 comment
Labels: Performance, Algos, Closing Candidate

#60203 - PERF: selection inside a row using a list of labels 30 slower than selection by the same individual labels

Issue - State: closed - Opened by xstreck1 about 1 year ago - 3 comments
Labels: Indexing, Performance, Closing Candidate

#60179 - PERF: `dropna` with `SparseArray` experiments a much worse time complexity

Issue - State: open - Opened by mflova about 1 year ago
Labels: Performance, Needs Triage

#60121 - PERF: faster _coerce_to_data_and_mask() for astype("Float64")

Pull Request - State: closed - Opened by auderson about 1 year ago - 4 comments
Labels: Performance, NA - MaskedArrays

#60121 - PERF: faster _coerce_to_data_and_mask() for astype("Float64")

Pull Request - State: closed - Opened by auderson about 1 year ago - 3 comments
Labels: Performance, NA - MaskedArrays

#60098 - PERF: Slowdowns with .isin() on columns typed as np.uint64

Issue - State: open - Opened by adrian17 about 1 year ago
Labels: Performance, Needs Triage

#60075 - PERF: DataFrame fragmentation when calling DataFrame.assign() with large kwargs

Issue - State: open - Opened by y-koj about 1 year ago
Labels: Performance, Needs Triage

#60070 - PERF: Extreme regression in 2.2 when aggregating a DataFrame with .attrs data

Issue - State: open - Opened by buhrmann about 1 year ago
Labels: Performance, Needs Triage

#60066 - PERF: df.astype("float64[pyarrow]") is slow, df.astype("Float64") is super slow

Issue - State: closed - Opened by auderson about 1 year ago - 4 comments
Labels: Performance, good first issue, ExtensionArray, Constructors

#59992 - Potential performance regression with "API: value_counts to consistently maintain order of input"

Issue - State: closed - Opened by DeaMariaLeon about 1 year ago - 1 comment
Labels: Groupby, Performance, Algos

#59969 - PERF: Possible Memory Leak when Importing Parquet File with PyArrow Engine in Pandas

Issue - State: closed - Opened by Voltagabbana about 1 year ago - 3 comments
Labels: Performance, Needs Info

#59657 - PERF: Melt 2x slower when future.infer_string option enabled

Issue - State: closed - Opened by maver1ck about 1 year ago - 4 comments
Labels: Performance, Reshaping, Strings

#59647 - PERF: CategoricalDtype.update_dtype

Pull Request - State: closed - Opened by mroeschke about 1 year ago - 2 comments
Labels: Performance, Categorical

#59647 - PERF: CategoricalDtype.update_dtype

Pull Request - State: closed - Opened by mroeschke about 1 year ago - 2 comments
Labels: Performance, Categorical

#59643 - PERF: Improve efficiency of `BlockValuesRefs._clear_dead_references(...)`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago
Labels: Performance, Copy / view semantics

#59643 - PERF: Improve efficiency of `BlockValuesRefs._clear_dead_references(...)`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago - 1 comment
Labels: Performance, Copy / view semantics

#59630 - PERF: avoid unnecessary copy

Pull Request - State: closed - Opened by jbrockmendel about 1 year ago - 2 comments
Labels: Performance, Strings

#59630 - PERF: avoid unnecessary copy

Pull Request - State: closed - Opened by jbrockmendel about 1 year ago - 3 comments
Labels: Performance, Strings

#59608 - PERF: Performance Improvement on `DataFrame.to_csv()` when `index=False`

Pull Request - State: closed - Opened by KevsterAmp about 1 year ago - 2 comments
Labels: Performance, IO CSV

#59608 - PERF: Performance Improvement on `DataFrame.to_csv()` when `index=False`

Pull Request - State: closed - Opened by KevsterAmp about 1 year ago - 3 comments
Labels: Performance, IO CSV

#59598 - PERF: Improve efficiency of `BlockValuesRefs`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago - 2 comments
Labels: Enhancement, Performance, Copy / view semantics

#59598 - PERF: Improve efficiency of `BlockValuesRefs`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago - 3 comments
Labels: Enhancement, Performance, Copy / view semantics

#59578 - PERF: Only clear cached .levels when setting MultiIndex.names

Pull Request - State: open - Opened by mroeschke about 1 year ago
Labels: Performance, MultiIndex

#59578 - PERF: Only clear cached .levels when setting MultiIndex.names

Pull Request - State: closed - Opened by mroeschke about 1 year ago - 1 comment
Labels: Performance, MultiIndex

#59573 - PERF: avoid calling `DataFrame.dtypes` in loop

Pull Request - State: closed - Opened by messense about 1 year ago
Labels: Performance

#59573 - PERF: avoid calling `DataFrame.dtypes` in loop

Pull Request - State: closed - Opened by messense about 1 year ago - 1 comment
Labels: Performance

#59565 - PERF: Native version of `is_hashable`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago - 6 comments
Labels: Performance, hashing, Stale

#59565 - PERF: Native version of `is_hashable`

Pull Request - State: closed - Opened by Tolker-KU about 1 year ago - 6 comments
Labels: Performance, hashing, Stale

#59494 - PERF: Excessive memory consumption in pd.read_parquet

Issue - State: closed - Opened by cotZhong over 1 year ago - 3 comments
Labels: Performance, Needs Info, IO Parquet

#59312 - PERF: Significant Performance Difference in DataFrame.to_csv() with and without Index Reset

Issue - State: closed - Opened by hogan-roblox over 1 year ago - 2 comments
Labels: Performance, IO CSV

#59279 - PERF:

Issue - State: open - Opened by buhtz over 1 year ago
Labels: Performance, Needs Triage

#59272 - BUG: CoW does not seem to work on an index with duplicated labels

Issue - State: closed - Opened by arnaudlegout over 1 year ago - 11 comments
Labels: Performance, Copy / view semantics, Closing Candidate

#59180 - PERF: to_excel slowing runtime

Issue - State: closed - Opened by OnindoK over 1 year ago - 2 comments
Labels: Performance, IO Excel, Needs Info

#59178 - PERF: Don't create a CategoricalIndex._engine in __contains__ if categories are RangeIndex

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Categorical

#59178 - PERF: Don't create a CategoricalIndex._engine in __contains__ if categories are RangeIndex

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Categorical

#59025 - PREF: Fix regression from #58984

Pull Request - State: closed - Opened by luke396 over 1 year ago - 2 comments
Labels: Groupby, Performance

#59025 - PREF: Fix regression from #58984

Pull Request - State: closed - Opened by luke396 over 1 year ago - 1 comment
Labels: Groupby, Performance

#59002 - Backport PR #58992 on branch 2.2.x (PERF: cache plotting date locators for DatetimeIndex plotting)

Pull Request - State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Visualization, Performance

#59002 - Backport PR #58992 on branch 2.2.x (PERF: cache plotting date locators for DatetimeIndex plotting)

Pull Request - State: closed - Opened by meeseeksmachine over 1 year ago
Labels: Visualization, Performance

#58992 - PERF: cache plotting date locators for DatetimeIndex plotting

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Visualization, Performance

#58992 - PERF: cache plotting date locators for DatetimeIndex plotting

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 3 comments
Labels: Visualization, Performance

#58973 - PERF/CLN: Avoid ravel in plotting

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Visualization, Performance

#58973 - PERF/CLN: Avoid ravel in plotting

Pull Request - State: open - Opened by mroeschke over 1 year ago
Labels: Visualization, Performance

#58972 - PERF: Use reshape instead of ravel/flatten

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Code Style

#58972 - PERF: Use reshape instead of ravel/flatten

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Code Style

#58963 - PERF: 100% CPU utilization with matplotlib (macosx)

Issue - State: closed - Opened by cinsk over 1 year ago - 1 comment
Labels: Performance, Needs Triage

#58960 - PERF: Use shallow copies/defer copies in io

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: IO Data, Performance

#58960 - PERF: Use shallow copies/defer copies in io

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: IO Data, Performance

#58958 - PERF: Only copy in plotting when needed

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Visualization, Performance

#58956 - PERF: groupby is significantly slower for `DatetimeIndex` with timezone

Issue - State: closed - Opened by veenstrajelmer over 1 year ago - 2 comments
Labels: Performance, Needs Triage

#58817 - PERF: future_stack=True with non-MulitIndex columns

Pull Request - State: closed - Opened by rhshadrach over 1 year ago - 4 comments
Labels: Performance, Reshaping, Stale

#58817 - PERF: future_stack=True with non-MulitIndex columns

Pull Request - State: open - Opened by rhshadrach over 1 year ago
Labels: Performance, Reshaping

#58733 - PERF: Eliminate circular references in accessor attributes

Pull Request - State: closed - Opened by mroeschke over 1 year ago
Labels: Performance, Accessors

#58733 - PERF: Eliminate circular references in accessor attributes

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 2 comments
Labels: Performance, Accessors

#58411 - PERF: MultiIndex._engine use smaller dtypes

Pull Request - State: closed - Opened by GianlucaFicarelli over 1 year ago - 1 comment
Labels: Performance, MultiIndex

#58411 - PERF: MultiIndex._engine use smaller dtypes

Pull Request - State: closed - Opened by GianlucaFicarelli over 1 year ago - 2 comments
Labels: Performance, MultiIndex

#58391 - PERF: df.unstack() is 500 times slower since pandas>=2.1

Issue - State: open - Opened by sbonz over 1 year ago
Labels: Performance, Needs Triage

#58385 - PERF: MultiIndex.memory_usage shouldn't trigger the index engine

Pull Request - State: closed - Opened by GianlucaFicarelli over 1 year ago
Labels: Performance, Index

#58385 - PERF: MultiIndex.memory_usage shouldn't trigger the index engine

Pull Request - State: closed - Opened by GianlucaFicarelli over 1 year ago - 1 comment
Labels: Performance, Index

#58376 - PERF: RangeIndex.value_counts/searchsorted/to_numpy

Pull Request - State: open - Opened by mroeschke over 1 year ago
Labels: Performance, Index

#58376 - PERF: RangeIndex.value_counts/searchsorted/to_numpy

Pull Request - State: closed - Opened by mroeschke over 1 year ago - 1 comment
Labels: Performance, Index

#58288 - Potential regression induced by commit 1ee5832

Issue - State: closed - Opened by rhshadrach over 1 year ago - 2 comments
Labels: IO Data, Performance, Regression

#58287 - Potential regression induced by commit 9cd5e55

Issue - State: open - Opened by rhshadrach over 1 year ago
Labels: Performance, Regression

#58286 - Potential regression induced by commit eb55bca

Issue - State: closed - Opened by rhshadrach over 1 year ago - 2 comments
Labels: Performance, Regression, Timedelta

#58285 - Potential regression induced by commit 924f246

Issue - State: open - Opened by rhshadrach over 1 year ago
Labels: Performance, Output-Formatting, Regression