Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dask/fastparquet issues and pull requests

#933 - Some compatibility fixes

Pull Request - State: open - Opened by martindurant about 1 month ago - 1 comment

#932 - feat: support for writing to buffers

Pull Request - State: closed - Opened by felixscherz about 1 month ago - 1 comment

#931 - pure-numpy interface to parquet

Pull Request - State: open - Opened by martindurant about 1 month ago

#930 - Support upcoming default pandas string dtype (pandas >= 3)

Issue - State: open - Opened by jorisvandenbossche about 1 month ago - 6 comments

#928 - No wheel file for fastparquet 2024.5.0 for Python 3.12 and Windows

Issue - State: closed - Opened by DrGFreeman about 2 months ago - 4 comments

#927 - fix(_dtypes): non pandas boolean numpy type was deprecated

Pull Request - State: closed - Opened by ThomasDsantos 2 months ago - 2 comments

#926 - Issues with filtering when using to_pandas

Issue - State: closed - Opened by jscottcronin 3 months ago - 4 comments

#925 - See what happens if we don't track thrift i32

Pull Request - State: open - Opened by martindurant 4 months ago

#924 - New release?

Issue - State: closed - Opened by jakirkham 4 months ago - 4 comments

#923 - Fastparquet raises on import with numpy 2.0 rc

Issue - State: closed - Opened by phofl 5 months ago - 5 comments

#922 - Use np.int64 type for day to nanosecond conversion (NEP50)

Pull Request - State: closed - Opened by bnavigator 5 months ago - 2 comments

#921 - Numpy 2: OverflowError with int96

Issue - State: closed - Opened by bnavigator 5 months ago - 4 comments

#920 - Categorical dtype not preserved with fastparquet-write, pyarrow-read

Issue - State: open - Opened by zmoon 8 months ago - 2 comments

#919 - Upcoming pandas (>2.2.0) raises "read-only" errors

Issue - State: open - Opened by martindurant 8 months ago - 3 comments

#918 - Update action versions

Pull Request - State: closed - Opened by martindurant 8 months ago

#917 - Loading List of List of Strings leads to nans

Issue - State: open - Opened by olegsinavski 8 months ago - 6 comments

#916 - Allow zoneinfo objects

Pull Request - State: closed - Opened by mroeschke 8 months ago - 7 comments

#915 - Support zoneinfo.ZoneInfo timezones

Issue - State: closed - Opened by mroeschke 8 months ago

#914 - Option to not close() after write() when writing to buffer

Issue - State: closed - Opened by luukburger 9 months ago - 3 comments

#912 - Rewrite delta bitpack reader

Pull Request - State: closed - Opened by martindurant 9 months ago - 1 comment

#910 - try for pandas CI

Pull Request - State: closed - Opened by martindurant 10 months ago

#909 - Fix reading timezones from metadata

Pull Request - State: closed - Opened by barbuz 10 months ago - 4 comments

#908 - Bug loading parquet files with timezone information

Issue - State: closed - Opened by barbuz 10 months ago - 6 comments

#907 - schema evolution when writing the row groups does not work

Issue - State: open - Opened by braindevices 10 months ago - 4 comments

#906 - Don't .data on numpy array

Pull Request - State: closed - Opened by martindurant 10 months ago

#905 - issue #904. Err when no props in metadata

Pull Request - State: closed - Opened by remi-sap 10 months ago

#903 - persist dataframe attrs

Pull Request - State: closed - Opened by martindurant 11 months ago

#902 - fix dtype-str concatenation bug in empty()

Pull Request - State: closed - Opened by cshaley 11 months ago - 1 comment

#901 - Nullable types for 1 row vs multiple rows

Issue - State: closed - Opened by yoav-orca 11 months ago - 3 comments

#900 - attrs persistance for Pandas

Issue - State: closed - Opened by davetapley 11 months ago - 1 comment

#899 - Further _from_sequence

Pull Request - State: closed - Opened by martindurant 11 months ago - 4 comments

#898 - Fix dt regression in empty()

Pull Request - State: closed - Opened by martindurant 11 months ago - 8 comments

#897 - Regression due to `_from_sequence`

Issue - State: closed - Opened by martindurant 11 months ago - 1 comment

#896 - Some `fastparquet`-related tests are failing on Python 3.10

Issue - State: open - Opened by jrbourbeau 11 months ago - 10 comments

#895 - update test versions

Pull Request - State: closed - Opened by martindurant 11 months ago

#894 - changelog & unpin build actions

Pull Request - State: closed - Opened by martindurant 11 months ago

#893 - Use dt units in empty() (tz path)

Pull Request - State: closed - Opened by martindurant 11 months ago - 1 comment

#892 - a python-3.12 windows wheel

Issue - State: closed - Opened by stonebig 11 months ago - 13 comments

#891 - BUG: dataframe.empty with non-nano pd.DatetimeTZDtype

Issue - State: closed - Opened by jbrockmendel 11 months ago - 2 comments

#890 - Be more defensive about inplace decompression in V2

Pull Request - State: closed - Opened by martindurant 12 months ago - 4 comments

#888 - Allow categorical column with no categories

Pull Request - State: closed - Opened by martindurant 12 months ago

#887 - fastparquet cannot read a categorical column that contains NaNs only

Issue - State: closed - Opened by apamplifi 12 months ago - 2 comments

#886 - fixes for numpy2

Pull Request - State: closed - Opened by graingert 12 months ago - 2 comments

#885 - Allow RLE for bools in v1 pages

Pull Request - State: closed - Opened by martindurant about 1 year ago - 7 comments

#884 - BUG: reading boolean column with RLE encoding gives wrong values

Issue - State: closed - Opened by jorisvandenbossche about 1 year ago - 4 comments

#883 - Allow DELTA for V1 pages

Pull Request - State: closed - Opened by martindurant about 1 year ago

#882 - fastparquet encoding issue.

Issue - State: closed - Opened by venkatsura about 1 year ago - 20 comments

#881 - release notes

Pull Request - State: closed - Opened by martindurant about 1 year ago

#881 - release notes

Pull Request - State: closed - Opened by martindurant about 1 year ago

#880 - Fastparquet not outputting DATE logical type while using to_parquet of pandas

Issue - State: closed - Opened by bsikander about 1 year ago - 4 comments

#879 - Potential Parquet File Metadata Corruption After Process Timeout

Issue - State: open - Opened by alordthorsen about 1 year ago - 7 comments

#878 - API Documentation Page Is Empty

Issue - State: closed - Opened by rawrgulmuffins about 1 year ago - 1 comment

#877 - docstring for ParquetFile updated

Pull Request - State: closed - Opened by terr01 about 1 year ago - 1 comment

#876 - pandas nullable opt-out not working as intented

Issue - State: closed - Opened by terr01 about 1 year ago - 3 comments

#874 - Use non-ns units on timestamps declared the old way

Pull Request - State: closed - Opened by martindurant about 1 year ago

#873 - row_filter=True fails if parquet file wasn't made by fastparquet

Issue - State: closed - Opened by zmbc about 1 year ago - 1 comment

#872 - Incorrect value returned for overflow timestamps in micros format for V1 footers

Issue - State: closed - Opened by revans2 about 1 year ago - 4 comments

#871 - Add RTDv2 config

Pull Request - State: closed - Opened by martindurant about 1 year ago

#870 - Add test case for reading non-pandas parquet file

Pull Request - State: closed - Opened by piotrb5e3 over 1 year ago - 4 comments

#868 - write to buffer support

Issue - State: closed - Opened by strongbugman over 1 year ago - 2 comments

#867 - TypeError: 'NoneType' object is not iterable

Issue - State: open - Opened by davetapley over 1 year ago - 2 comments

#866 - Extra field when cloning ParquetFile

Pull Request - State: closed - Opened by martindurant over 1 year ago

#865 - Can't read parquet files created by pyspark using ParquetFile.iter_row_groups()

Issue - State: closed - Opened by igozali over 1 year ago - 2 comments

#864 - sdist for 2023.4.0 is missing on PyPI

Issue - State: closed - Opened by sunpoet over 1 year ago - 2 comments

#863 - Allow selecting column for cat output when only one row-group

Pull Request - State: closed - Opened by martindurant over 1 year ago

#862 - Allow dict loading for single row-group

Issue - State: closed - Opened by martindurant over 1 year ago

#861 - Honor column dtype from input dataframe when roundtripping

Pull Request - State: closed - Opened by phofl over 1 year ago - 4 comments

#860 - Change ``fastpath=True`` to ``Categorical.from_codes``

Pull Request - State: closed - Opened by phofl over 1 year ago - 5 comments

#859 - Preserve columns dtype when columnns=[] is given

Pull Request - State: closed - Opened by phofl over 1 year ago - 3 comments

#858 - Don't convert to str for bytes-per-item estimate

Pull Request - State: closed - Opened by martindurant over 1 year ago

#857 - Deprecate ._data

Pull Request - State: closed - Opened by martindurant over 1 year ago

#856 - Select

Pull Request - State: closed - Opened by martindurant over 1 year ago

#855 - Memory spike from converting Pandas StringDtype to Numpy unicode array.

Issue - State: closed - Opened by mshober over 1 year ago - 3 comments

#853 - release notes

Pull Request - State: closed - Opened by martindurant over 1 year ago

#852 - Revert single-level list of filters to AND

Pull Request - State: closed - Opened by martindurant over 1 year ago - 1 comment

#851 - BUG single list of filters does not appear to AND properly

Issue - State: closed - Opened by beckermr over 1 year ago - 7 comments

#850 - Use bigger int type for dereferencing dicts in V2

Pull Request - State: closed - Opened by martindurant over 1 year ago - 2 comments

#849 - Reading a Parquet file produced by pyarrow results to corrupted data read

Issue - State: closed - Opened by miohtama over 1 year ago - 5 comments

#848 - Fix heading in releasenotes.rst

Pull Request - State: closed - Opened by GianlucaFicarelli over 1 year ago

#847 - Allow mix of nested and None in infer_object_encoding

Pull Request - State: closed - Opened by martindurant over 1 year ago

#846 - fastparquet 2023.1.0 may fail to dump dataframes with nested objects

Issue - State: closed - Opened by GianlucaFicarelli over 1 year ago - 2 comments

#845 - enable V2 page row-filters

Pull Request - State: closed - Opened by martindurant over 1 year ago

#844 - release notes

Pull Request - State: closed - Opened by martindurant over 1 year ago

#843 - [do not merge] try tests

Pull Request - State: closed - Opened by martindurant over 1 year ago

#842 - Pages

Pull Request - State: closed - Opened by martindurant over 1 year ago - 1 comment

#841 - Squash pandas warnings and improve write speed

Pull Request - State: closed - Opened by martindurant over 1 year ago

#840 - Speed up Parquet Writing?

Issue - State: closed - Opened by marklit over 1 year ago - 7 comments

#839 - OverflowError with a 3GB, 11M-line JSONL file

Issue - State: closed - Opened by marklit over 1 year ago - 6 comments

#838 - Fixes for Pandas 2.0

Pull Request - State: closed - Opened by martindurant over 1 year ago

#837 - Fails to rountrip non-`ns` `datetime64` with `pandas` 2.0

Issue - State: closed - Opened by jrbourbeau almost 2 years ago - 3 comments

#836 - Roundtrip tz for multi-index/categorical columns

Pull Request - State: closed - Opened by martindurant almost 2 years ago

#835 - Delta test

Pull Request - State: closed - Opened by martindurant almost 2 years ago - 2 comments