Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / dask/fastparquet issues and pull requests
#942 - Publish Python 3.13 wheels
Issue -
State: open - Opened by edgarrmondragon 27 days ago
- 1 comment
#941 - micromamba -> miniconda
Pull Request -
State: closed - Opened by martindurant 27 days ago
#940 - Statistics on ``ParquetFile`` subset.
Pull Request -
State: closed - Opened by yohplala 27 days ago
- 2 comments
#939 - Linting in fastparquet?
Issue -
State: open - Opened by yohplala 27 days ago
- 3 comments
#938 - ``statistics`` does not work on a ParquetFile subset?
Issue -
State: closed - Opened by yohplala 28 days ago
- 2 comments
#937 - The parquet format specification is not followed for Interval type (i.e. timedeltas)
Issue -
State: open - Opened by mgab about 2 months ago
- 3 comments
#936 - timezone aware timestamps in micro-seconds (us) read as '1970-01-01'
Issue -
State: closed - Opened by jlequeux about 2 months ago
- 1 comment
#935 - iter_row_groups raises KeyError when dots in column names and data is not primitive
Issue -
State: open - Opened by adrienDog 3 months ago
- 11 comments
#934 - Fix Spark Example link
Pull Request -
State: closed - Opened by jimwhite 3 months ago
#933 - Some compatibility fixes
Pull Request -
State: closed - Opened by martindurant 3 months ago
- 1 comment
#932 - feat: support for writing to buffers
Pull Request -
State: closed - Opened by felixscherz 3 months ago
- 1 comment
#931 - pure-numpy interface to parquet
Pull Request -
State: open - Opened by martindurant 3 months ago
- 4 comments
#930 - Support upcoming default pandas string dtype (pandas >= 3)
Issue -
State: closed - Opened by jorisvandenbossche 3 months ago
- 6 comments
#929 - BUG: reading datetimeindex with time zone gives wrong values (all "1970-01-01 01:00:00")
Issue -
State: closed - Opened by jorisvandenbossche 3 months ago
- 2 comments
#928 - No wheel file for fastparquet 2024.5.0 for Python 3.12 and Windows
Issue -
State: closed - Opened by DrGFreeman 3 months ago
- 4 comments
#927 - fix(_dtypes): non pandas boolean numpy type was deprecated
Pull Request -
State: closed - Opened by ThomasDsantos 4 months ago
- 2 comments
#926 - Issues with filtering when using to_pandas
Issue -
State: closed - Opened by jscottcronin 5 months ago
- 4 comments
#925 - See what happens if we don't track thrift i32
Pull Request -
State: open - Opened by martindurant 6 months ago
#924 - New release?
Issue -
State: closed - Opened by jakirkham 6 months ago
- 4 comments
#923 - Fastparquet raises on import with numpy 2.0 rc
Issue -
State: closed - Opened by phofl 7 months ago
- 5 comments
#922 - Use np.int64 type for day to nanosecond conversion (NEP50)
Pull Request -
State: closed - Opened by bnavigator 7 months ago
- 2 comments
#921 - Numpy 2: OverflowError with int96
Issue -
State: closed - Opened by bnavigator 7 months ago
- 4 comments
#920 - Categorical dtype not preserved with fastparquet-write, pyarrow-read
Issue -
State: open - Opened by zmoon 10 months ago
- 2 comments
#919 - Upcoming pandas (>2.2.0) raises "read-only" errors
Issue -
State: open - Opened by martindurant 10 months ago
- 3 comments
#918 - Update action versions
Pull Request -
State: closed - Opened by martindurant 10 months ago
#917 - Loading List of List of Strings leads to nans
Issue -
State: open - Opened by olegsinavski 10 months ago
- 6 comments
#916 - Allow zoneinfo objects
Pull Request -
State: closed - Opened by mroeschke 10 months ago
- 7 comments
#915 - Support zoneinfo.ZoneInfo timezones
Issue -
State: closed - Opened by mroeschke 10 months ago
#914 - Option to not close() after write() when writing to buffer
Issue -
State: closed - Opened by luukburger 11 months ago
- 3 comments
#913 - PyArrow will become a required dependency with pandas 3.0
Issue -
State: open - Opened by davetapley 11 months ago
#912 - Rewrite delta bitpack reader
Pull Request -
State: closed - Opened by martindurant 11 months ago
- 1 comment
#911 - When changing to a larger dtype, its size must be a advisor of the total size in bytes of the last axis of the array
Issue -
State: closed - Opened by ymatrix-jt 12 months ago
- 6 comments
#910 - try for pandas CI
Pull Request -
State: closed - Opened by martindurant 12 months ago
#909 - Fix reading timezones from metadata
Pull Request -
State: closed - Opened by barbuz 12 months ago
- 4 comments
#908 - Bug loading parquet files with timezone information
Issue -
State: closed - Opened by barbuz 12 months ago
- 6 comments
#907 - schema evolution when writing the row groups does not work
Issue -
State: open - Opened by braindevices about 1 year ago
- 4 comments
#906 - Don't .data on numpy array
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#905 - issue #904. Err when no props in metadata
Pull Request -
State: closed - Opened by remi-sap about 1 year ago
#904 - update_file_custom_metadata error when file has no properties.
Issue -
State: closed - Opened by remi-sap about 1 year ago
#903 - persist dataframe attrs
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#902 - fix dtype-str concatenation bug in empty()
Pull Request -
State: closed - Opened by cshaley about 1 year ago
- 1 comment
#901 - Nullable types for 1 row vs multiple rows
Issue -
State: closed - Opened by yoav-orca about 1 year ago
- 3 comments
#900 - attrs persistance for Pandas
Issue -
State: closed - Opened by davetapley about 1 year ago
- 1 comment
#899 - Further _from_sequence
Pull Request -
State: closed - Opened by martindurant about 1 year ago
- 4 comments
#898 - Fix dt regression in empty()
Pull Request -
State: closed - Opened by martindurant about 1 year ago
- 8 comments
#897 - Regression due to `_from_sequence`
Issue -
State: closed - Opened by martindurant about 1 year ago
- 1 comment
#896 - Some `fastparquet`-related tests are failing on Python 3.10
Issue -
State: open - Opened by jrbourbeau about 1 year ago
- 10 comments
#895 - update test versions
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#894 - changelog & unpin build actions
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#893 - Use dt units in empty() (tz path)
Pull Request -
State: closed - Opened by martindurant about 1 year ago
- 1 comment
#892 - a python-3.12 windows wheel
Issue -
State: closed - Opened by stonebig about 1 year ago
- 13 comments
#891 - BUG: dataframe.empty with non-nano pd.DatetimeTZDtype
Issue -
State: closed - Opened by jbrockmendel about 1 year ago
- 2 comments
#890 - Be more defensive about inplace decompression in V2
Pull Request -
State: closed - Opened by martindurant about 1 year ago
- 4 comments
#889 - to_pandas(): cramjam.DecompressionError: snappy: output buffer (size = 262144) is smaller than required (size = 1048576)
Issue -
State: closed - Opened by miohtama about 1 year ago
- 1 comment
#888 - Allow categorical column with no categories
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#887 - fastparquet cannot read a categorical column that contains NaNs only
Issue -
State: closed - Opened by apamplifi about 1 year ago
- 2 comments
#886 - fixes for numpy2
Pull Request -
State: closed - Opened by graingert about 1 year ago
- 2 comments
#885 - Allow RLE for bools in v1 pages
Pull Request -
State: closed - Opened by martindurant about 1 year ago
- 7 comments
#884 - BUG: reading boolean column with RLE encoding gives wrong values
Issue -
State: closed - Opened by jorisvandenbossche about 1 year ago
- 4 comments
#883 - Allow DELTA for V1 pages
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#882 - fastparquet encoding issue.
Issue -
State: closed - Opened by venkatsura about 1 year ago
- 20 comments
#881 - release notes
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#881 - release notes
Pull Request -
State: closed - Opened by martindurant about 1 year ago
#880 - Fastparquet not outputting DATE logical type while using to_parquet of pandas
Issue -
State: closed - Opened by bsikander about 1 year ago
- 4 comments
#879 - Potential Parquet File Metadata Corruption After Process Timeout
Issue -
State: open - Opened by alordthorsen over 1 year ago
- 7 comments
#878 - API Documentation Page Is Empty
Issue -
State: closed - Opened by rawrgulmuffins over 1 year ago
- 1 comment
#877 - docstring for ParquetFile updated
Pull Request -
State: closed - Opened by terr01 over 1 year ago
- 1 comment
#876 - pandas nullable opt-out not working as intented
Issue -
State: closed - Opened by terr01 over 1 year ago
- 3 comments
#875 - ValueError: Seek before start of file using custom open_with function or S3 file object
Issue -
State: open - Opened by soerenbrandt over 1 year ago
- 3 comments
#874 - Use non-ns units on timestamps declared the old way
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#873 - row_filter=True fails if parquet file wasn't made by fastparquet
Issue -
State: closed - Opened by zmbc over 1 year ago
- 1 comment
#872 - Incorrect value returned for overflow timestamps in micros format for V1 footers
Issue -
State: closed - Opened by revans2 over 1 year ago
- 4 comments
#871 - Add RTDv2 config
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#870 - Add test case for reading non-pandas parquet file
Pull Request -
State: closed - Opened by piotrb5e3 over 1 year ago
- 4 comments
#869 - AttributeError: 'ParquetFile' object has no attribute '_columns_dtype' when reading files without pandas metadata
Issue -
State: open - Opened by piotrb5e3 over 1 year ago
- 1 comment
#868 - write to buffer support
Issue -
State: closed - Opened by strongbugman over 1 year ago
- 2 comments
#867 - TypeError: 'NoneType' object is not iterable
Issue -
State: open - Opened by davetapley over 1 year ago
- 2 comments
#866 - Extra field when cloning ParquetFile
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#865 - Can't read parquet files created by pyspark using ParquetFile.iter_row_groups()
Issue -
State: closed - Opened by igozali over 1 year ago
- 2 comments
#864 - sdist for 2023.4.0 is missing on PyPI
Issue -
State: closed - Opened by sunpoet over 1 year ago
- 2 comments
#863 - Allow selecting column for cat output when only one row-group
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#862 - Allow dict loading for single row-group
Issue -
State: closed - Opened by martindurant over 1 year ago
#861 - Honor column dtype from input dataframe when roundtripping
Pull Request -
State: closed - Opened by phofl over 1 year ago
- 4 comments
#860 - Change ``fastpath=True`` to ``Categorical.from_codes``
Pull Request -
State: closed - Opened by phofl over 1 year ago
- 5 comments
#859 - Preserve columns dtype when columnns=[] is given
Pull Request -
State: closed - Opened by phofl over 1 year ago
- 3 comments
#858 - Don't convert to str for bytes-per-item estimate
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#857 - Deprecate ._data
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#856 - Select
Pull Request -
State: closed - Opened by martindurant over 1 year ago
#855 - Memory spike from converting Pandas StringDtype to Numpy unicode array.
Issue -
State: closed - Opened by mshober over 1 year ago
- 3 comments
#854 - [Question] Support for `arrow` data types when reading data with `fastparquet`
Issue -
State: open - Opened by j-bennet over 1 year ago
- 5 comments
#853 - release notes
Pull Request -
State: closed - Opened by martindurant almost 2 years ago
#852 - Revert single-level list of filters to AND
Pull Request -
State: closed - Opened by martindurant almost 2 years ago
- 1 comment
#851 - BUG single list of filters does not appear to AND properly
Issue -
State: closed - Opened by beckermr almost 2 years ago
- 7 comments
#850 - Use bigger int type for dereferencing dicts in V2
Pull Request -
State: closed - Opened by martindurant almost 2 years ago
- 2 comments
#849 - Reading a Parquet file produced by pyarrow results to corrupted data read
Issue -
State: closed - Opened by miohtama almost 2 years ago
- 5 comments
#848 - Fix heading in releasenotes.rst
Pull Request -
State: closed - Opened by GianlucaFicarelli almost 2 years ago
#847 - Allow mix of nested and None in infer_object_encoding
Pull Request -
State: closed - Opened by martindurant almost 2 years ago
#846 - fastparquet 2023.1.0 may fail to dump dataframes with nested objects
Issue -
State: closed - Opened by GianlucaFicarelli almost 2 years ago
- 2 comments
#845 - enable V2 page row-filters
Pull Request -
State: closed - Opened by martindurant almost 2 years ago
#844 - release notes
Pull Request -
State: closed - Opened by martindurant almost 2 years ago