Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / mosaicml/streaming issues and pull requests
#829 - fix f string
Pull Request -
State: open - Opened by XiaohanZhangCMU 7 days ago
#828 - Missing f-string
Issue -
State: open - Opened by kevin-hanselman 8 days ago
- 1 comment
#827 - Pipeline Parallelism (Supported? How to?)
Issue -
State: open - Opened by casper-hansen 9 days ago
- 2 comments
Labels: enhancement
#826 - Consistent errors for unused streams in batching methods
Pull Request -
State: closed - Opened by snarayan21 12 days ago
#825 - Update setuptools requirement from <68.0.0 to <76.0.0
Pull Request -
State: closed - Opened by dependabot[bot] 12 days ago
Labels: dependencies
#824 - Cannot Load MDS Dataset
Issue -
State: open - Opened by naston 12 days ago
- 13 comments
Labels: bug
#823 - Add upper bound for prefix_int
Pull Request -
State: open - Opened by XiaohanZhangCMU 17 days ago
- 1 comment
#822 - Update FAQs to indicate wrapping not supported
Pull Request -
State: closed - Opened by milocress 18 days ago
#821 - Update pytest-cov requirement from <6,>=4 to >=4,<7
Pull Request -
State: closed - Opened by dependabot[bot] 19 days ago
Labels: dependencies
#820 - UnicodeDecodeError: ... Efficient way to debug the dataset with streaming?
Issue -
State: open - Opened by TAYmit 21 days ago
- 3 comments
Labels: enhancement
#819 - Bump version to 0.10.1
Pull Request -
State: closed - Opened by XiaohanZhangCMU 21 days ago
#818 - add jpeg quality option
Pull Request -
State: open - Opened by cabreraalex 25 days ago
- 5 comments
#817 - refactored the download module to have reusable clients
Pull Request -
State: closed - Opened by ethantang-db 25 days ago
#816 - Fix device_per_stream bugs
Pull Request -
State: closed - Opened by quentindrx 26 days ago
- 7 comments
#815 - Bump fastapi from 0.115.2 to 0.115.4
Pull Request -
State: closed - Opened by dependabot[bot] 26 days ago
Labels: dependencies
#814 - Update huggingface-hub requirement from <0.26,>=0.23.4 to >=0.23.4,<0.27
Pull Request -
State: closed - Opened by dependabot[bot] 26 days ago
Labels: dependencies
#813 - Fix shared memory permission issue in a shared pod environment
Pull Request -
State: closed - Opened by XiaohanZhangCMU 28 days ago
- 2 comments
#812 - write to S3 is very slow
Issue -
State: open - Opened by charliedream1 29 days ago
- 3 comments
Labels: bug
#811 - Choose JPEG compression level
Issue -
State: open - Opened by cabreraalex 29 days ago
- 1 comment
Labels: enhancement
#810 - Bump pytest-split from 0.9.0 to 0.10.0
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#809 - Bump uvicorn from 0.31.1 to 0.32.0
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#808 - Fix logo png
Pull Request -
State: closed - Opened by XiaohanZhangCMU about 1 month ago
- 1 comment
#807 - Issue: NCCL ProcessGroup Device Mapping Bug when using streaming with accelerate
Issue -
State: closed - Opened by wangyanhui666 about 1 month ago
- 5 comments
Labels: bug
#806 - Add better error message for shared prefix
Pull Request -
State: closed - Opened by XiaohanZhangCMU about 1 month ago
#805 - Introducing Streaming Guru on Gurubase.io
Pull Request -
State: closed - Opened by kursataktas about 1 month ago
#804 - Bump fastapi from 0.115.0 to 0.115.2
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#803 - Bump uvicorn from 0.31.0 to 0.31.1
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#802 - Running into "FileExistsError: [Errno 17] File exists: '/000000_epoch_shape'" even with single GPU
Issue -
State: open - Opened by deepanshu-a2z about 1 month ago
- 8 comments
Labels: bug
#801 - MosaicML-Streaming on Databricks
Issue -
State: open - Opened by gtmdotme about 1 month ago
- 9 comments
#800 - Support for on-the-fly filtering
Issue -
State: open - Opened by ColinToft about 1 month ago
Labels: enhancement
#799 - Warning -> info about defaults from v0.7.0
Pull Request -
State: closed - Opened by snarayan21 about 2 months ago
#798 - Fix dataset.size() typo in docs
Pull Request -
State: closed - Opened by snarayan21 about 2 months ago
#797 - Update pre-commit requirement from <4,>=2.18.1 to >=2.18.1,<5
Pull Request -
State: open - Opened by dependabot[bot] about 2 months ago
- 2 comments
Labels: dependencies
#796 - Degraded shuffle quality near the end of an epoch
Issue -
State: closed - Opened by thayes427 about 2 months ago
- 4 comments
Labels: bug
#795 - Shard evict fix
Pull Request -
State: closed - Opened by snarayan21 about 2 months ago
- 1 comment
#794 - Fixed broken links in README.md
Pull Request -
State: closed - Opened by LukaszSztukiewicz about 2 months ago
#793 - Bump uvicorn from 0.30.6 to 0.31.0
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
Labels: dependencies
#792 - Make `epoch_sample_ids` cachable
Issue -
State: open - Opened by janEbert about 2 months ago
- 2 comments
Labels: enhancement
#791 - Search local shards directly to find shard to evict
Pull Request -
State: closed - Opened by coryMosaicML about 2 months ago
- 5 comments
#790 - Bump main branch to 0.10.0.dev0
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#789 - Guide on using with HuggingFace accelerate/Trainer
Issue -
State: closed - Opened by shimizust about 2 months ago
- 6 comments
Labels: enhancement
#788 - remove v0.7.0 warning
Pull Request -
State: closed - Opened by eitanturok 2 months ago
- 1 comment
#787 - Update huggingface-hub requirement from <0.25,>=0.23.4 to >=0.23.4,<0.26
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#786 - Bump fastapi from 0.114.2 to 0.115.0
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
Labels: dependencies
#785 - Bump pydantic from 2.9.1 to 2.9.2
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#784 - Update datasets requirement from <3,>=2.4.0 to >=2.4.0,<4
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#783 - Bump fastapi from 0.114.0 to 0.114.2
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#782 - Bump pytest from 8.3.2 to 8.3.3
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#781 - Dataset does not work after stopping training
Issue -
State: open - Opened by AugustDev 2 months ago
- 1 comment
Labels: bug
#780 - Sparse Numpy Arrays
Issue -
State: open - Opened by Matagi1996 2 months ago
Labels: enhancement
#779 - Bump fastapi from 0.112.2 to 0.114.0
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#778 - Bump pydantic from 2.8.2 to 2.9.1
Pull Request -
State: closed - Opened by dependabot[bot] 2 months ago
- 1 comment
Labels: dependencies
#777 - Allow JSON encoder to handle ndarray
Pull Request -
State: closed - Opened by srowen 3 months ago
Labels: enhancement
#776 - Add MapType as JSON-compatible
Pull Request -
State: closed - Opened by srowen 3 months ago
- 2 comments
Labels: enhancement
#775 - JointWriter: Allow shard file appending
Issue -
State: open - Opened by janEbert 3 months ago
- 2 comments
Labels: bug
#774 - GCS Auth ERROR/Download timeout
Issue -
State: closed - Opened by rishabhm12 3 months ago
- 1 comment
Labels: bug
#773 - Refactor spanner to avoid creating large array
Pull Request -
State: open - Opened by XiaohanZhangCMU 3 months ago
- 10 comments
#772 - Bump jupyter from 1.0.0 to 1.1.1
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: dependencies
#771 - MemoryError: Unable to allocate
Issue -
State: open - Opened by AugustDev 3 months ago
- 1 comment
Labels: bug
#770 - Bump ci testing
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#769 - Is it possible to add additional columns or metadata into an exists dataset?
Issue -
State: closed - Opened by zhou13 3 months ago
- 2 comments
Labels: enhancement
#768 - Bump fastapi from 0.112.1 to 0.112.2
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 3 comments
Labels: dependencies
#767 - File exists: '/000000_epoch_shape' when using the ddp strategy from pytorch lightning
Issue -
State: open - Opened by elbamos 3 months ago
- 23 comments
Labels: bug
#766 - Version 0.8.1 bump!
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#765 - Fix size missing in encoding sample
Pull Request -
State: closed - Opened by XiaohanZhangCMU 3 months ago
#764 - Fix linting for numpy 2.1.0
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#763 - Why does StreamingDataset.state_dict() have to be told how many samples have been yielded?
Issue -
State: closed - Opened by thayes427 3 months ago
- 2 comments
Labels: enhancement
#762 - Bump uvicorn from 0.30.5 to 0.30.6
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: dependencies
#761 - Bump databricks-sdk from 0.29.0 to 0.30.0
Pull Request -
State: open - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: dependencies
#760 - Bump fastapi from 0.112.0 to 0.112.1
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: dependencies
#759 - Possible to bypass/re-download checksum mismatched shards?
Issue -
State: closed - Opened by huxuan 3 months ago
- 1 comment
Labels: enhancement
#758 - Memory leak using download_file with DDP or FSDP
Issue -
State: closed - Opened by nagadit 3 months ago
- 9 comments
Labels: bug
#757 - Can't output Vector or Array spark types
Issue -
State: closed - Opened by elbamos 3 months ago
- 6 comments
Labels: bug
#756 - Ruff rule to remove unused imports
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#755 - Add pycln to pre commit to remove unused imports
Pull Request -
State: closed - Opened by snarayan21 3 months ago
- 5 comments
#754 - Throw exception when event.is_set() after write()s
Pull Request -
State: closed - Opened by srowen 3 months ago
- 12 comments
#753 - Writers can finish 'successfully' even if there is a failure uploading from local to remote
Issue -
State: closed - Opened by srowen 3 months ago
Labels: bug
#752 - Type hints conformant with pep 585
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#751 - Check file size within LocalUploader
Pull Request -
State: open - Opened by XiaohanZhangCMU 3 months ago
- 1 comment
#750 - Ensure deterministic sample order between epochs when `shuffle=False`
Pull Request -
State: closed - Opened by snarayan21 3 months ago
#749 - Different lists of examples when shuffle == False
Issue -
State: closed - Opened by experiencor 3 months ago
- 2 comments
Labels: bug
#748 - Add default compression, and warning about local paths to dataframe_to_mds
Pull Request -
State: closed - Opened by srowen 3 months ago
- 5 comments
#747 - Bump ci-testing to v0.1.2
Pull Request -
State: closed - Opened by snarayan21 4 months ago
#746 - Patching conf.py due to Sphinx deprecating config manipulation
Pull Request -
State: closed - Opened by snarayan21 4 months ago
#745 - Bump ci-testing to v0.1.0
Pull Request -
State: closed - Opened by snarayan21 4 months ago
#744 - Bump fastapi from 0.111.1 to 0.112.0
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
Labels: dependencies
#743 - Bump uvicorn from 0.30.3 to 0.30.5
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
- 1 comment
Labels: dependencies
#742 - Estimate total shards at the beginning of data conversion
Issue -
State: open - Opened by abhijithneilabraham 4 months ago
- 1 comment
Labels: enhancement
#741 - Fix dataloader hang at the end of an epoch
Pull Request -
State: closed - Opened by XiaohanZhangCMU 4 months ago
- 1 comment
#740 - How to use with multi GPU training?
Issue -
State: closed - Opened by MaxxP0 4 months ago
- 3 comments
#739 - Make Pytest log in color in Github Action
Pull Request -
State: closed - Opened by eitanturok 4 months ago
#738 - Bump Streaming Version to 0.8.0
Pull Request -
State: closed - Opened by mvpatel2000 4 months ago
#737 - Resume of data conversion?
Issue -
State: closed - Opened by huxuan 4 months ago
- 5 comments
Labels: enhancement
#736 - ndarray is not supported by dataframe_to_mds [mosaicml-streaming==0.7.6]
Issue -
State: closed - Opened by akshat-suwalka-dream11 4 months ago
- 13 comments
Labels: bug
#735 - Bump pytest from 8.2.2 to 8.3.2
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
- 1 comment
Labels: dependencies
#734 - huge temp files while uploading data using MDS writer
Issue -
State: open - Opened by MaxxP0 4 months ago
- 2 comments
Labels: bug
#733 - fix azure container name and blob name in download_from_azure
Pull Request -
State: closed - Opened by jaehwana2z 4 months ago
- 1 comment
#732 - [Question] StreamingOutsideDTWebVid cache_limit for video
Issue -
State: closed - Opened by nagadit 4 months ago
- 3 comments
#731 - Bump pytest from 8.2.2 to 8.3.1
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
- 1 comment
Labels: dependencies
#730 - Bump uvicorn from 0.30.1 to 0.30.3
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
Labels: dependencies