Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / mosaicml/streaming issues and pull requests

#829 - fix f string

Pull Request - State: open - Opened by XiaohanZhangCMU 7 days ago

#828 - Missing f-string

Issue - State: open - Opened by kevin-hanselman 8 days ago - 1 comment

#827 - Pipeline Parallelism (Supported? How to?)

Issue - State: open - Opened by casper-hansen 9 days ago - 2 comments
Labels: enhancement

#826 - Consistent errors for unused streams in batching methods

Pull Request - State: closed - Opened by snarayan21 12 days ago

#825 - Update setuptools requirement from <68.0.0 to <76.0.0

Pull Request - State: closed - Opened by dependabot[bot] 12 days ago
Labels: dependencies

#824 - Cannot Load MDS Dataset

Issue - State: open - Opened by naston 12 days ago - 13 comments
Labels: bug

#823 - Add upper bound for prefix_int

Pull Request - State: open - Opened by XiaohanZhangCMU 17 days ago - 1 comment

#822 - Update FAQs to indicate wrapping not supported

Pull Request - State: closed - Opened by milocress 18 days ago

#821 - Update pytest-cov requirement from <6,>=4 to >=4,<7

Pull Request - State: closed - Opened by dependabot[bot] 19 days ago
Labels: dependencies

#820 - UnicodeDecodeError: ... Efficient way to debug the dataset with streaming?

Issue - State: open - Opened by TAYmit 21 days ago - 3 comments
Labels: enhancement

#819 - Bump version to 0.10.1

Pull Request - State: closed - Opened by XiaohanZhangCMU 21 days ago

#818 - add jpeg quality option

Pull Request - State: open - Opened by cabreraalex 25 days ago - 5 comments

#817 - refactored the download module to have reusable clients

Pull Request - State: closed - Opened by ethantang-db 25 days ago

#816 - Fix device_per_stream bugs

Pull Request - State: closed - Opened by quentindrx 26 days ago - 7 comments

#815 - Bump fastapi from 0.115.2 to 0.115.4

Pull Request - State: closed - Opened by dependabot[bot] 26 days ago
Labels: dependencies

#814 - Update huggingface-hub requirement from <0.26,>=0.23.4 to >=0.23.4,<0.27

Pull Request - State: closed - Opened by dependabot[bot] 26 days ago
Labels: dependencies

#813 - Fix shared memory permission issue in a shared pod environment

Pull Request - State: closed - Opened by XiaohanZhangCMU 28 days ago - 2 comments

#812 - write to S3 is very slow

Issue - State: open - Opened by charliedream1 29 days ago - 3 comments
Labels: bug

#811 - Choose JPEG compression level

Issue - State: open - Opened by cabreraalex 29 days ago - 1 comment
Labels: enhancement

#810 - Bump pytest-split from 0.9.0 to 0.10.0

Pull Request - State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#809 - Bump uvicorn from 0.31.1 to 0.32.0

Pull Request - State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#808 - Fix logo png

Pull Request - State: closed - Opened by XiaohanZhangCMU about 1 month ago - 1 comment

#807 - Issue: NCCL ProcessGroup Device Mapping Bug when using streaming with accelerate

Issue - State: closed - Opened by wangyanhui666 about 1 month ago - 5 comments
Labels: bug

#806 - Add better error message for shared prefix

Pull Request - State: closed - Opened by XiaohanZhangCMU about 1 month ago

#805 - Introducing Streaming Guru on Gurubase.io

Pull Request - State: closed - Opened by kursataktas about 1 month ago

#804 - Bump fastapi from 0.115.0 to 0.115.2

Pull Request - State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#803 - Bump uvicorn from 0.31.0 to 0.31.1

Pull Request - State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#801 - MosaicML-Streaming on Databricks

Issue - State: open - Opened by gtmdotme about 1 month ago - 9 comments

#800 - Support for on-the-fly filtering

Issue - State: open - Opened by ColinToft about 1 month ago
Labels: enhancement

#799 - Warning -> info about defaults from v0.7.0

Pull Request - State: closed - Opened by snarayan21 about 2 months ago

#798 - Fix dataset.size() typo in docs

Pull Request - State: closed - Opened by snarayan21 about 2 months ago

#797 - Update pre-commit requirement from <4,>=2.18.1 to >=2.18.1,<5

Pull Request - State: open - Opened by dependabot[bot] about 2 months ago - 2 comments
Labels: dependencies

#796 - Degraded shuffle quality near the end of an epoch

Issue - State: closed - Opened by thayes427 about 2 months ago - 4 comments
Labels: bug

#795 - Shard evict fix

Pull Request - State: closed - Opened by snarayan21 about 2 months ago - 1 comment

#794 - Fixed broken links in README.md

Pull Request - State: closed - Opened by LukaszSztukiewicz about 2 months ago

#793 - Bump uvicorn from 0.30.6 to 0.31.0

Pull Request - State: closed - Opened by dependabot[bot] about 2 months ago
Labels: dependencies

#792 - Make `epoch_sample_ids` cachable

Issue - State: open - Opened by janEbert about 2 months ago - 2 comments
Labels: enhancement

#791 - Search local shards directly to find shard to evict

Pull Request - State: closed - Opened by coryMosaicML about 2 months ago - 5 comments

#790 - Bump main branch to 0.10.0.dev0

Pull Request - State: closed - Opened by dakinggg about 2 months ago

#789 - Guide on using with HuggingFace accelerate/Trainer

Issue - State: closed - Opened by shimizust about 2 months ago - 6 comments
Labels: enhancement

#788 - remove v0.7.0 warning

Pull Request - State: closed - Opened by eitanturok 2 months ago - 1 comment

#787 - Update huggingface-hub requirement from <0.25,>=0.23.4 to >=0.23.4,<0.26

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#786 - Bump fastapi from 0.114.2 to 0.115.0

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago
Labels: dependencies

#785 - Bump pydantic from 2.9.1 to 2.9.2

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#784 - Update datasets requirement from <3,>=2.4.0 to >=2.4.0,<4

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#783 - Bump fastapi from 0.114.0 to 0.114.2

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#782 - Bump pytest from 8.3.2 to 8.3.3

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#781 - Dataset does not work after stopping training

Issue - State: open - Opened by AugustDev 2 months ago - 1 comment
Labels: bug

#780 - Sparse Numpy Arrays

Issue - State: open - Opened by Matagi1996 2 months ago
Labels: enhancement

#779 - Bump fastapi from 0.112.2 to 0.114.0

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#778 - Bump pydantic from 2.8.2 to 2.9.1

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago - 1 comment
Labels: dependencies

#777 - Allow JSON encoder to handle ndarray

Pull Request - State: closed - Opened by srowen 3 months ago
Labels: enhancement

#776 - Add MapType as JSON-compatible

Pull Request - State: closed - Opened by srowen 3 months ago - 2 comments
Labels: enhancement

#775 - JointWriter: Allow shard file appending

Issue - State: open - Opened by janEbert 3 months ago - 2 comments
Labels: bug

#774 - GCS Auth ERROR/Download timeout

Issue - State: closed - Opened by rishabhm12 3 months ago - 1 comment
Labels: bug

#773 - Refactor spanner to avoid creating large array

Pull Request - State: open - Opened by XiaohanZhangCMU 3 months ago - 10 comments

#772 - Bump jupyter from 1.0.0 to 1.1.1

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago - 1 comment
Labels: dependencies

#771 - MemoryError: Unable to allocate

Issue - State: open - Opened by AugustDev 3 months ago - 1 comment
Labels: bug

#770 - Bump ci testing

Pull Request - State: closed - Opened by snarayan21 3 months ago

#769 - Is it possible to add additional columns or metadata into an exists dataset?

Issue - State: closed - Opened by zhou13 3 months ago - 2 comments
Labels: enhancement

#768 - Bump fastapi from 0.112.1 to 0.112.2

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago - 3 comments
Labels: dependencies

#767 - File exists: '/000000_epoch_shape' when using the ddp strategy from pytorch lightning

Issue - State: open - Opened by elbamos 3 months ago - 23 comments
Labels: bug

#766 - Version 0.8.1 bump!

Pull Request - State: closed - Opened by snarayan21 3 months ago

#765 - Fix size missing in encoding sample

Pull Request - State: closed - Opened by XiaohanZhangCMU 3 months ago

#764 - Fix linting for numpy 2.1.0

Pull Request - State: closed - Opened by snarayan21 3 months ago

#763 - Why does StreamingDataset.state_dict() have to be told how many samples have been yielded?

Issue - State: closed - Opened by thayes427 3 months ago - 2 comments
Labels: enhancement

#762 - Bump uvicorn from 0.30.5 to 0.30.6

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago - 1 comment
Labels: dependencies

#761 - Bump databricks-sdk from 0.29.0 to 0.30.0

Pull Request - State: open - Opened by dependabot[bot] 3 months ago - 1 comment
Labels: dependencies

#760 - Bump fastapi from 0.112.0 to 0.112.1

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago - 1 comment
Labels: dependencies

#759 - Possible to bypass/re-download checksum mismatched shards?

Issue - State: closed - Opened by huxuan 3 months ago - 1 comment
Labels: enhancement

#758 - Memory leak using download_file with DDP or FSDP

Issue - State: closed - Opened by nagadit 3 months ago - 9 comments
Labels: bug

#757 - Can't output Vector or Array spark types

Issue - State: closed - Opened by elbamos 3 months ago - 6 comments
Labels: bug

#756 - Ruff rule to remove unused imports

Pull Request - State: closed - Opened by snarayan21 3 months ago

#755 - Add pycln to pre commit to remove unused imports

Pull Request - State: closed - Opened by snarayan21 3 months ago - 5 comments

#754 - Throw exception when event.is_set() after write()s

Pull Request - State: closed - Opened by srowen 3 months ago - 12 comments

#752 - Type hints conformant with pep 585

Pull Request - State: closed - Opened by snarayan21 3 months ago

#751 - Check file size within LocalUploader

Pull Request - State: open - Opened by XiaohanZhangCMU 3 months ago - 1 comment

#749 - Different lists of examples when shuffle == False

Issue - State: closed - Opened by experiencor 3 months ago - 2 comments
Labels: bug

#748 - Add default compression, and warning about local paths to dataframe_to_mds

Pull Request - State: closed - Opened by srowen 3 months ago - 5 comments

#747 - Bump ci-testing to v0.1.2

Pull Request - State: closed - Opened by snarayan21 4 months ago

#746 - Patching conf.py due to Sphinx deprecating config manipulation

Pull Request - State: closed - Opened by snarayan21 4 months ago

#745 - Bump ci-testing to v0.1.0

Pull Request - State: closed - Opened by snarayan21 4 months ago

#744 - Bump fastapi from 0.111.1 to 0.112.0

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago
Labels: dependencies

#743 - Bump uvicorn from 0.30.3 to 0.30.5

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago - 1 comment
Labels: dependencies

#742 - Estimate total shards at the beginning of data conversion

Issue - State: open - Opened by abhijithneilabraham 4 months ago - 1 comment
Labels: enhancement

#741 - Fix dataloader hang at the end of an epoch

Pull Request - State: closed - Opened by XiaohanZhangCMU 4 months ago - 1 comment

#740 - How to use with multi GPU training?

Issue - State: closed - Opened by MaxxP0 4 months ago - 3 comments

#739 - Make Pytest log in color in Github Action

Pull Request - State: closed - Opened by eitanturok 4 months ago

#738 - Bump Streaming Version to 0.8.0

Pull Request - State: closed - Opened by mvpatel2000 4 months ago

#737 - Resume of data conversion?

Issue - State: closed - Opened by huxuan 4 months ago - 5 comments
Labels: enhancement

#736 - ndarray is not supported by dataframe_to_mds [mosaicml-streaming==0.7.6]

Issue - State: closed - Opened by akshat-suwalka-dream11 4 months ago - 13 comments
Labels: bug

#735 - Bump pytest from 8.2.2 to 8.3.2

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago - 1 comment
Labels: dependencies

#734 - huge temp files while uploading data using MDS writer

Issue - State: open - Opened by MaxxP0 4 months ago - 2 comments
Labels: bug

#733 - fix azure container name and blob name in download_from_azure

Pull Request - State: closed - Opened by jaehwana2z 4 months ago - 1 comment

#732 - [Question] StreamingOutsideDTWebVid cache_limit for video

Issue - State: closed - Opened by nagadit 4 months ago - 3 comments

#731 - Bump pytest from 8.2.2 to 8.3.1

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago - 1 comment
Labels: dependencies

#730 - Bump uvicorn from 0.30.1 to 0.30.3

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago
Labels: dependencies