Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tensorflow/datasets issues and pull requests

#5622 - Internal change

Pull Request - State: closed - Opened by copybara-service[bot] 4 days ago - 1 comment

#5621 - Move mlcroissant installation to setup.py as we use pip now.

Pull Request - State: closed - Opened by copybara-service[bot] 4 days ago - 1 comment

#5620 - Move mlcroissant installation to setup.py as we use pip now.

Pull Request - State: closed - Opened by copybara-service[bot] 4 days ago - 1 comment

#5619 - Set both names and IDs in mlc's test fixtures.

Pull Request - State: closed - Opened by copybara-service[bot] 5 days ago - 1 comment

#5617 - Add the test split to TAO dataset.

Pull Request - State: open - Opened by copybara-service[bot] 5 days ago - 1 comment

#5616 - internal

Pull Request - State: open - Opened by copybara-service[bot] 5 days ago - 1 comment

#5615 - Support Beam in Croissant preparation.

Pull Request - State: closed - Opened by copybara-service[bot] 5 days ago

#5613 - Fix the example for convert_format.py.

Pull Request - State: closed - Opened by copybara-service[bot] 7 days ago - 1 comment

#5609 - Refactor download_manager.py

Pull Request - State: closed - Opened by copybara-service[bot] 7 days ago

#5608 - Corrupt JPEG data: 211 extraneous bytes before marker 0xd9

Issue - State: open - Opened by wen020 7 days ago
Labels: help

#5606 - Use epath.Path in downloader.py

Pull Request - State: closed - Opened by copybara-service[bot] 11 days ago

#5604 - Refactor download_manager.py

Pull Request - State: closed - Opened by copybara-service[bot] 11 days ago

#5603 - Allow pickling support for sub classes of DatasetBuilder.

Pull Request - State: open - Opened by copybara-service[bot] 11 days ago - 1 comment

#5602 - Remove deprecated variants from imdb_reviews.

Pull Request - State: closed - Opened by copybara-service[bot] 13 days ago - 1 comment

#5601 - Internal

Pull Request - State: closed - Opened by copybara-service[bot] 14 days ago

#5600 - Internal

Pull Request - State: closed - Opened by copybara-service[bot] 14 days ago

#5599 - Add visibility descriptions.

Pull Request - State: closed - Opened by copybara-service[bot] 14 days ago

#5597 - Internal change

Pull Request - State: closed - Opened by copybara-service[bot] 14 days ago

#5595 - Add FineWeb-Edu dataset to TFDS.

Pull Request - State: open - Opened by copybara-service[bot] 19 days ago

#5594 - Add RedPajama-V2 dataset to TFDS.

Pull Request - State: open - Opened by copybara-service[bot] 19 days ago

#5593 - Append dataset name to download dir.

Pull Request - State: open - Opened by copybara-service[bot] 19 days ago

#5592 - Add relative_download_dir to Resource class.

Pull Request - State: open - Opened by copybara-service[bot] 19 days ago

#5590 - Support datasets >= 3.0.0

Pull Request - State: closed - Opened by copybara-service[bot] 20 days ago

#5588 - Fix typehints in download modules.

Pull Request - State: closed - Opened by copybara-service[bot] 21 days ago

#5587 - Use constants.METADATA_FILENAME when loading the metadata file.

Pull Request - State: closed - Opened by copybara-service[bot] 21 days ago - 1 comment

#5586 - Add support for multi-threaded use of reraise_with_context.

Pull Request - State: closed - Opened by copybara-service[bot] 21 days ago - 1 comment

#5585 - Add progress bar to convert_format_utils.py

Pull Request - State: closed - Opened by copybara-service[bot] 22 days ago - 1 comment

#5584 - schizophrenia MRI dataset

Issue - State: open - Opened by Ezza01 23 days ago
Labels: dataset request

#5583 - Refactor Croissant preparation

Pull Request - State: closed - Opened by copybara-service[bot] 25 days ago

#5581 - Add `get_file_spec` method to `DatasetBuilder`.

Pull Request - State: closed - Opened by copybara-service[bot] 26 days ago

#5580 - Use the file suffix instead file format enum value

Pull Request - State: closed - Opened by copybara-service[bot] 27 days ago

#5579 - Internal change

Pull Request - State: closed - Opened by copybara-service[bot] 27 days ago - 1 comment

#5578 - Add conversion functions to convert_format_utils in TFDS.

Pull Request - State: closed - Opened by copybara-service[bot] 28 days ago - 1 comment

#5577 - Add a dry_run flag to the TFDS CLI.

Pull Request - State: closed - Opened by copybara-service[bot] 28 days ago

#5576 - Fix validating checksums path.

Pull Request - State: closed - Opened by copybara-service[bot] 29 days ago

#5575 - Add description to feature repr.

Pull Request - State: closed - Opened by copybara-service[bot] 29 days ago - 1 comment

#5574 - Move checksums.tsv to constants.CHECKSUMS_FILENAME

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5573 - Fix `build_test.py`

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5572 - Migrate to simple_parsing

Pull Request - State: open - Opened by copybara-service[bot] about 1 month ago

#5571 - Add a mechanism to register dataset builder providers.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5570 - Fix tfds builders that try to access gcs even though the data is local.

Pull Request - State: open - Opened by copybara-service[bot] about 1 month ago - 1 comment

#5569 - Fix typo in the external tfrecord documentation

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago - 1 comment

#5568 - Add Dolma dataset to TFDS.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5567 - Improve Robustness and Error Handling in ImageFolder Dataset Builder

Pull Request - State: open - Opened by swalehmwadime about 1 month ago - 1 comment

#5564 - fix apply_colormap to use with numpy2.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5563 - fix bounding boxes features for numpy2.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5562 - Update beam_utils_test to support numpy2

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5561 - Refactor pytest workflow to use a template workflow.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5560 - Rename smart_buildings_dataset to smart_buildings

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5559 - Draft optional tests workflow.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5558 - Add method to get the split info from a dataset info proto.

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5557 - Add file spec method to SplitInfo

Pull Request - State: closed - Opened by copybara-service[bot] about 1 month ago

#5555 - Updated BigEarthNet v1.0 S2 download link

Pull Request - State: closed - Opened by kteedle about 1 month ago - 2 comments
Labels: copybara-import

#5554 - Add a wrapper around itertools.tee to make it a thread-safe

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago

#5553 - display reraised error more clearly

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago

#5552 - Add smart buildings dataset to tensorflow datasets.

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago - 1 comment

#5551 - Fix nltk version for c4 dataset.

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago

#5550 - Avoid rewriting Oxford IIIT Pet images to disk.

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago

#5549 - Include `trust_remote_code` flag for code-based HF builders preparation.

Pull Request - State: open - Opened by copybara-service[bot] about 2 months ago - 1 comment

#5548 - Add `is_valid_name` function.

Pull Request - State: closed - Opened by copybara-service[bot] about 2 months ago

#5546 - CroissantBuilder does not work on Windows machines

Issue - State: open - Opened by zwouter about 2 months ago - 6 comments
Labels: bug

#5544 - caltech_birds2011 url has changed.

Issue - State: closed - Opened by h-0-0 2 months ago - 1 comment
Labels: help

#5543 - Handle None values for `description` or `license`.

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago

#5542 - Fix bug for when the license is None.

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago - 1 comment

#5541 - Add a test to check the gated datasets warnings.

Pull Request - State: open - Opened by copybara-service[bot] 2 months ago - 1 comment

#5538 - [data request] <telecoms network performance>

Issue - State: closed - Opened by MngomeT 2 months ago - 1 comment
Labels: dataset request

#5535 - Checksum not matching when building the300w_lp dataset

Issue - State: closed - Opened by albertxcastro 2 months ago
Labels: bug

#5533 - Restore 775 as default access mode

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago

#5532 - Update GitHub actions to use mlcroissant==1.0.7

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago - 1 comment

#5529 - Dealing with Splits in CroissantBuilder

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago - 1 comment

#5528 - Update GitHub actions to use mlcroissant==1.0.6

Pull Request - State: closed - Opened by copybara-service[bot] 2 months ago - 1 comment

#5526 - Update Google Drive URL to in `the300w_lp_dataset_builder.py` with confirmation

Pull Request - State: closed - Opened by Inokinoki 3 months ago - 2 comments
Labels: copybara-import

#5525 - Failed to download and load `the300w_lp` dataset through the current Google Drive URL

Issue - State: open - Opened by Inokinoki 3 months ago - 1 comment
Labels: bug

#5523 - Update copy_dataset_info_files.py docstring.

Pull Request - State: closed - Opened by copybara-service[bot] 3 months ago - 1 comment