Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / mosaicml/streaming issues and pull requests

#629 - Bump databricks-sdk from 0.14.0 to 0.22.0

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago - 1 comment
Labels: dependencies

#628 - Bump databricks-sdk from 0.14.0 to 0.21.0

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#627 - Update google-cloud-storage requirement from <2.11.0,>=2.9.0 to >=2.9.0,<2.16.0

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#626 - Bump uvicorn from 0.27.1 to 0.28.0

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago
Labels: dependencies

#625 - Bump pytest from 7.4.4 to 8.1.1

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 3 comments
Labels: dependencies

#624 - You must set batch size. There is no other way.

Pull Request - State: closed - Opened by snarayan21 9 months ago - 1 comment

#623 - Add ndarray type

Pull Request - State: closed - Opened by XiaohanZhangCMU 9 months ago - 1 comment

#622 - Reuse S3 session

Issue - State: open - Opened by wouterzwerink 9 months ago - 9 comments
Labels: enhancement

#621 - Update __init__.py

Pull Request - State: closed - Opened by b-chu 9 months ago

#620 - Update __init__.py

Pull Request - State: closed - Opened by b-chu 9 months ago

#619 - Make streaming use the correct number of unique samples with SP/TP

Pull Request - State: closed - Opened by snarayan21 9 months ago - 2 comments

#618 - merge_index will not merge

Issue - State: closed - Opened by stillmatic 9 months ago - 1 comment
Labels: bug

#617 - StreamingDataset hangs after first epoch is over

Issue - State: closed - Opened by apkumar 9 months ago - 1 comment
Labels: bug

#616 - Switch linting workflows to ci-testing repo

Pull Request - State: closed - Opened by b-chu 9 months ago - 2 comments

#615 - Time to yield samples?

Issue - State: closed - Opened by cinjon 9 months ago - 1 comment

#614 - Add allow_unsafe_types args to StreamingCOCO

Pull Request - State: closed - Opened by karan6181 9 months ago - 4 comments

#613 - Unexpected mds format data for json encoding / how to encode list of strings

Issue - State: open - Opened by ssharpe42 9 months ago - 2 comments
Labels: bug

#612 - Add support for registering custom Cloud Uploaders

Issue - State: open - Opened by JAEarly 9 months ago - 4 comments
Labels: enhancement

#611 - Update careers link

Pull Request - State: closed - Opened by milocress 9 months ago

#610 - Bump fastapi from 0.109.0 to 0.110.0

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago
Labels: dependencies

#609 - Bump databricks-sdk from 0.14.0 to 0.20.0

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#608 - Update ruff to 0.2.2

Pull Request - State: closed - Opened by Skylion007 9 months ago - 2 comments

#607 - Expanded replication testing + documentation

Pull Request - State: closed - Opened by snarayan21 9 months ago - 1 comment

#606 - Column logical (not physical) type and allow_schema_mismatch

Pull Request - State: open - Opened by knighton 9 months ago

#605 - Is it possible to store shard metadata with the shard itself?

Issue - State: closed - Opened by universome 9 months ago - 6 comments
Labels: enhancement

#604 - Use type int when initializing SharedMemory size

Pull Request - State: closed - Opened by bchiang2 9 months ago - 3 comments

#603 - Bump databricks-sdk from 0.14.0 to 0.19.1

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#602 - Bump fastapi from 0.109.0 to 0.109.2

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#601 - Bump yamllint from 1.33.0 to 1.35.1

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago
Labels: dependencies

#600 - Bump databricks-sdk from 0.14.0 to 0.19.0

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 1 comment
Labels: dependencies

#599 - Bump uvicorn from 0.26.0 to 0.27.1

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#598 - [Mistake] Sorry! I was trying to push to my own fork.

Pull Request - State: closed - Opened by denizokt 10 months ago - 1 comment

#597 - Replicating samples across devices (SP / TP enablement)

Pull Request - State: closed - Opened by knighton 10 months ago - 7 comments

#596 - [easy typo fix] fix f-string

Pull Request - State: closed - Opened by bigning 10 months ago - 1 comment

#595 - Bump version to 0.7.4

Pull Request - State: closed - Opened by snarayan21 10 months ago

#594 - Allow writers to overwrite existing data

Pull Request - State: closed - Opened by JAEarly 10 months ago - 4 comments

#592 - Updated documentation for S3-compatible object stores

Pull Request - State: closed - Opened by AIproj 10 months ago

#591 - ValueError: invalid literal for int() with base 10

Issue - State: closed - Opened by murthyrudra 10 months ago - 3 comments
Labels: bug

#590 - parallel merge index

Pull Request - State: open - Opened by XiaohanZhangCMU 10 months ago - 3 comments

#589 - Failed to merge index on multiple MDS on cloudflare R2

Issue - State: closed - Opened by atamano 10 months ago - 3 comments
Labels: bug

#588 - Remove .ci folder and move FILE_HEADER and CODEOWNERS

Pull Request - State: closed - Opened by irenedea 10 months ago

#587 - Change comparison in partitions to include equals

Pull Request - State: closed - Opened by JAEarly 10 months ago - 2 comments

#586 - Add support for Python 3.11 and deprecate Python 3.8

Pull Request - State: closed - Opened by karan6181 10 months ago

#585 - Canonical File Transformations

Pull Request - State: closed - Opened by knighton 10 months ago - 1 comment

#584 - Update misplaced params of _format_remote_index_files

Pull Request - State: closed - Opened by lsongx 10 months ago

#583 - Make yamllint consistent with Composer

Pull Request - State: closed - Opened by b-chu 10 months ago - 1 comment

#582 - Bump uvicorn from 0.26.0 to 0.27.0.post1

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 1 comment
Labels: dependencies

#581 - Bump pytest-split from 0.8.1 to 0.8.2

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#580 - Update moto requirement from <5,>=4.0 to >=4.0,<6

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#579 - Bump databricks-sdk from 0.14.0 to 0.18.0

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 1 comment
Labels: dependencies

#578 - On the fly sample filtering and limiting of datasets

Issue - State: open - Opened by ssharpe42 10 months ago - 3 comments
Labels: enhancement

#577 - Fix merge remote index + Pyright Issue Exp

Pull Request - State: closed - Opened by XiaohanZhangCMU 10 months ago

#576 - fix(merge_index): scheme was not well formatted

Pull Request - State: closed - Opened by fwertel 10 months ago - 9 comments

#575 - Download jitter

Pull Request - State: closed - Opened by mvpatel2000 10 months ago - 1 comment

#574 - Add varint to MDS

Pull Request - State: open - Opened by knighton 10 months ago

#573 - Proper cleanup after failed runs

Issue - State: closed - Opened by eldarkurtic 10 months ago - 3 comments
Labels: bug

#572 - Bump uvicorn from 0.25.0 to 0.26.0

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#571 - Bump sphinx-tabs from 3.4.4 to 3.4.5

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#570 - Use `tempfile.gettempdir()` instead of a hardcoded temp root.

Pull Request - State: closed - Opened by knighton 10 months ago

#569 - Add options to precompute the epoch

Pull Request - State: open - Opened by knighton 10 months ago

#568 - Update license

Pull Request - State: closed - Opened by b-chu 10 months ago - 1 comment

#567 - COCO dataset converter wrongly setup?

Issue - State: closed - Opened by Data-drone 10 months ago - 6 comments
Labels: bug

#566 - Download to temporary path from azure

Pull Request - State: closed - Opened by philipnrmn 10 months ago - 2 comments

#565 - Bump gitpython from 3.1.40 to 3.1.41

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#564 - Bump fastapi from 0.108.0 to 0.109.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#562 - Bump version to 0.7.3

Pull Request - State: closed - Opened by karan6181 11 months ago

#561 - Bump databricks-sdk from 0.14.0 to 0.17.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies

#560 - Update copyright: 2023 -> 2022-2024.

Pull Request - State: closed - Opened by knighton 11 months ago

#559 - merge_index breaks on gs:// downloads

Issue - State: closed - Opened by bryanzwu-diffuse 11 months ago - 6 comments
Labels: bug

#558 - Bump pytest from 7.4.3 to 7.4.4

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#557 - Bump fastapi from 0.104.1 to 0.108.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#556 - Nuke 1) torch dist, 2) shared memory, and 3) filelock

Pull Request - State: open - Opened by knighton 11 months ago

#555 - Add fine-grained timings to Writers

Pull Request - State: open - Opened by knighton 11 months ago

#554 - Q: Can I load local files without uploading them to cloud storage?

Issue - State: closed - Opened by Spico197 11 months ago - 2 comments
Labels: enhancement

#553 - Removing stray print statement

Pull Request - State: closed - Opened by snarayan21 11 months ago

#552 - Let's blow away dist, and also shared memory

Pull Request - State: open - Opened by knighton 11 months ago

#551 - Making Streaming Dataset framework agnostic: Removing PyTorch dependency

Issue - State: open - Opened by Abhijit-2592 11 months ago - 5 comments
Labels: enhancement

#550 - Bump databricks-sdk from 0.14.0 to 0.16.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies

#549 - Bump uvicorn from 0.24.0.post1 to 0.25.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#548 - Bump pydantic from 2.5.2 to 2.5.3

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#546 - Make caching location optional.

Issue - State: open - Opened by PengWenChen 11 months ago - 9 comments

#545 - Advice Needed: handling significant amount of streams

Issue - State: open - Opened by suessmann 11 months ago - 3 comments

#544 - Fixed condition for warning when partitioning over tiny datasets.

Pull Request - State: closed - Opened by snarayan21 11 months ago

#543 - Logging messages from new defaults only show once per rank.

Pull Request - State: closed - Opened by snarayan21 11 months ago

#542 - Bump databricks-sdk from 0.14.0 to 0.15.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies

#541 - Bump fastapi from 0.104.1 to 0.105.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies

#540 - Update google-cloud-storage requirement from <2.11.0,>=2.9.0 to >=2.9.0,<2.15.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies

#539 - Distributed Key Value Tensor Store

Issue - State: open - Opened by OrenLeung 12 months ago - 2 comments
Labels: enhancement

#538 - Parquet streaming [WIP]

Pull Request - State: open - Opened by knighton 12 months ago

#537 - Improve naming: JSON shards are actually JSONL, etc.

Pull Request - State: closed - Opened by knighton 12 months ago

#536 - New storage APIs

Pull Request - State: closed - Opened by knighton 12 months ago

#534 - Replicate allow unsafe types in dev

Pull Request - State: closed - Opened by knighton 12 months ago

#533 - Add benchmarking suite for all backends and formats

Pull Request - State: closed - Opened by knighton 12 months ago

#532 - Bump version to 0.7.2

Pull Request - State: closed - Opened by karan6181 12 months ago

#531 - Add allow_unsafe_types parameter to the streaming regression tests

Pull Request - State: closed - Opened by karan6181 12 months ago

#530 - Redo/generalize/tighten args shorthand

Pull Request - State: closed - Opened by knighton 12 months ago

#529 - Fancy overly long lines command.

Pull Request - State: closed - Opened by knighton 12 months ago - 1 comment