Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / tensorflow/datasets issues and pull requests
#5622 - Internal change
Pull Request -
State: closed - Opened by copybara-service[bot] 4 days ago
- 1 comment
#5621 - Move mlcroissant installation to setup.py as we use pip now.
Pull Request -
State: closed - Opened by copybara-service[bot] 4 days ago
- 1 comment
#5620 - Move mlcroissant installation to setup.py as we use pip now.
Pull Request -
State: closed - Opened by copybara-service[bot] 4 days ago
- 1 comment
#5619 - Set both names and IDs in mlc's test fixtures.
Pull Request -
State: closed - Opened by copybara-service[bot] 5 days ago
- 1 comment
#5618 - Set both names and IDs in mlc's test fixtures.
Pull Request -
State: open - Opened by copybara-service[bot] 5 days ago
#5617 - Add the test split to TAO dataset.
Pull Request -
State: open - Opened by copybara-service[bot] 5 days ago
- 1 comment
#5616 - internal
Pull Request -
State: open - Opened by copybara-service[bot] 5 days ago
- 1 comment
#5615 - Support Beam in Croissant preparation.
Pull Request -
State: closed - Opened by copybara-service[bot] 5 days ago
#5614 - Use unique paths for writing shard info for different splits in SplitBuilder.submit_shard_based_generation
Pull Request -
State: closed - Opened by copybara-service[bot] 6 days ago
#5613 - Fix the example for convert_format.py.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 days ago
- 1 comment
#5612 - Move the logic to read the dataset info from the constructor to the info property.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 days ago
#5611 - Add `builder_config_name` property to `DatasetBuilder`
Pull Request -
State: closed - Opened by copybara-service[bot] 7 days ago
#5610 - Remove `dataset` argument from `list_data_dirs` and `get_default_data_dir`.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 days ago
#5609 - Refactor download_manager.py
Pull Request -
State: closed - Opened by copybara-service[bot] 7 days ago
#5608 - Corrupt JPEG data: 211 extraneous bytes before marker 0xd9
Issue -
State: open - Opened by wen020 7 days ago
Labels: help
#5607 - Support register checksums for manually downloaded files.
Pull Request -
State: closed - Opened by copybara-service[bot] 8 days ago
#5606 - Use epath.Path in downloader.py
Pull Request -
State: closed - Opened by copybara-service[bot] 11 days ago
#5605 - Add a ShardDatasetBuilder that creates shards directly.
Pull Request -
State: closed - Opened by copybara-service[bot] 11 days ago
#5604 - Refactor download_manager.py
Pull Request -
State: closed - Opened by copybara-service[bot] 11 days ago
#5603 - Allow pickling support for sub classes of DatasetBuilder.
Pull Request -
State: open - Opened by copybara-service[bot] 11 days ago
- 1 comment
#5602 - Remove deprecated variants from imdb_reviews.
Pull Request -
State: closed - Opened by copybara-service[bot] 13 days ago
- 1 comment
#5601 - Internal
Pull Request -
State: closed - Opened by copybara-service[bot] 14 days ago
#5600 - Internal
Pull Request -
State: closed - Opened by copybara-service[bot] 14 days ago
#5599 - Add visibility descriptions.
Pull Request -
State: closed - Opened by copybara-service[bot] 14 days ago
#5598 - Add more information to the progress bars in convert_format
Pull Request -
State: closed - Opened by copybara-service[bot] 14 days ago
#5597 - Internal change
Pull Request -
State: closed - Opened by copybara-service[bot] 14 days ago
#5596 - Fix the following beam error at pickling for dynamic config builder classes.
Pull Request -
State: open - Opened by copybara-service[bot] 17 days ago
- 1 comment
#5595 - Add FineWeb-Edu dataset to TFDS.
Pull Request -
State: open - Opened by copybara-service[bot] 19 days ago
#5594 - Add RedPajama-V2 dataset to TFDS.
Pull Request -
State: open - Opened by copybara-service[bot] 19 days ago
#5593 - Append dataset name to download dir.
Pull Request -
State: open - Opened by copybara-service[bot] 19 days ago
#5592 - Add relative_download_dir to Resource class.
Pull Request -
State: open - Opened by copybara-service[bot] 19 days ago
#5591 - Make path variables purpose clearer in download_manager.
Pull Request -
State: closed - Opened by copybara-service[bot] 19 days ago
#5590 - Support datasets >= 3.0.0
Pull Request -
State: closed - Opened by copybara-service[bot] 20 days ago
#5589 - Add support for specifying how data should be deserialized in tfds.data_source
Pull Request -
State: closed - Opened by copybara-service[bot] 20 days ago
#5588 - Fix typehints in download modules.
Pull Request -
State: closed - Opened by copybara-service[bot] 21 days ago
#5587 - Use constants.METADATA_FILENAME when loading the metadata file.
Pull Request -
State: closed - Opened by copybara-service[bot] 21 days ago
- 1 comment
#5586 - Add support for multi-threaded use of reraise_with_context.
Pull Request -
State: closed - Opened by copybara-service[bot] 21 days ago
- 1 comment
#5585 - Add progress bar to convert_format_utils.py
Pull Request -
State: closed - Opened by copybara-service[bot] 22 days ago
- 1 comment
#5584 - schizophrenia MRI dataset
Issue -
State: open - Opened by Ezza01 23 days ago
Labels: dataset request
#5583 - Refactor Croissant preparation
Pull Request -
State: closed - Opened by copybara-service[bot] 25 days ago
#5582 - Use mlcroissant's Beam Reader in TFDS in CroissantBuilder.
Pull Request -
State: closed - Opened by copybara-service[bot] 26 days ago
#5581 - Add `get_file_spec` method to `DatasetBuilder`.
Pull Request -
State: closed - Opened by copybara-service[bot] 26 days ago
#5580 - Use the file suffix instead file format enum value
Pull Request -
State: closed - Opened by copybara-service[bot] 27 days ago
#5579 - Internal change
Pull Request -
State: closed - Opened by copybara-service[bot] 27 days ago
- 1 comment
#5578 - Add conversion functions to convert_format_utils in TFDS.
Pull Request -
State: closed - Opened by copybara-service[bot] 28 days ago
- 1 comment
#5577 - Add a dry_run flag to the TFDS CLI.
Pull Request -
State: closed - Opened by copybara-service[bot] 28 days ago
#5576 - Fix validating checksums path.
Pull Request -
State: closed - Opened by copybara-service[bot] 29 days ago
#5575 - Add description to feature repr.
Pull Request -
State: closed - Opened by copybara-service[bot] 29 days ago
- 1 comment
#5574 - Move checksums.tsv to constants.CHECKSUMS_FILENAME
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5573 - Fix `build_test.py`
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5572 - Migrate to simple_parsing
Pull Request -
State: open - Opened by copybara-service[bot] about 1 month ago
#5571 - Add a mechanism to register dataset builder providers.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5570 - Fix tfds builders that try to access gcs even though the data is local.
Pull Request -
State: open - Opened by copybara-service[bot] about 1 month ago
- 1 comment
#5569 - Fix typo in the external tfrecord documentation
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
- 1 comment
#5568 - Add Dolma dataset to TFDS.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5567 - Improve Robustness and Error Handling in ImageFolder Dataset Builder
Pull Request -
State: open - Opened by swalehmwadime about 1 month ago
- 1 comment
#5566 - Fix duke_ultrasound dataset timestamp_id from %Y%m%d%H%M%S to posix timestamp.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5565 - Fix asqa_dataset_builder for numpy2 by using int64 for sample_id feature.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5564 - fix apply_colormap to use with numpy2.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5563 - fix bounding boxes features for numpy2.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5562 - Update beam_utils_test to support numpy2
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5561 - Refactor pytest workflow to use a template workflow.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5560 - Rename smart_buildings_dataset to smart_buildings
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5559 - Draft optional tests workflow.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5558 - Add method to get the split info from a dataset info proto.
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5557 - Add file spec method to SplitInfo
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5556 - Add helper function to check whether data has been generated in a specific file format
Pull Request -
State: closed - Opened by copybara-service[bot] about 1 month ago
#5555 - Updated BigEarthNet v1.0 S2 download link
Pull Request -
State: closed - Opened by kteedle about 1 month ago
- 2 comments
Labels: copybara-import
#5554 - Add a wrapper around itertools.tee to make it a thread-safe
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5553 - display reraised error more clearly
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5552 - Add smart buildings dataset to tensorflow datasets.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
- 1 comment
#5551 - Fix nltk version for c4 dataset.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5550 - Avoid rewriting Oxford IIIT Pet images to disk.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5549 - Include `trust_remote_code` flag for code-based HF builders preparation.
Pull Request -
State: open - Opened by copybara-service[bot] about 2 months ago
- 1 comment
#5548 - Add `is_valid_name` function.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5547 - Update Caltech Birds URL to point to the dataset's official website.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5546 - CroissantBuilder does not work on Windows machines
Issue -
State: open - Opened by zwouter about 2 months ago
- 6 comments
Labels: bug
#5545 - Use conversion_utils.to_tfds_name to keep the current convention that xxx/yyy becomes xxx__yyy.
Pull Request -
State: closed - Opened by copybara-service[bot] about 2 months ago
#5544 - caltech_birds2011 url has changed.
Issue -
State: closed - Opened by h-0-0 2 months ago
- 1 comment
Labels: help
#5543 - Handle None values for `description` or `license`.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
#5542 - Fix bug for when the license is None.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5541 - Add a test to check the gated datasets warnings.
Pull Request -
State: open - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5540 - For HuggingFace builders, add gated text to the description and license of gated datasets. Also adds homepage to the dataset info.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5539 - Stream from Hugging Face instead of downloading and preparing everything.
Pull Request -
State: open - Opened by copybara-service[bot] 2 months ago
#5538 - [data request] <telecoms network performance>
Issue -
State: closed - Opened by MngomeT 2 months ago
- 1 comment
Labels: dataset request
#5537 - Add information about blocked versions and configs to dataset_info and restore this information in our ReadOnlyBuilder.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5536 - [numpy] Fix users of NumPy APIs that are removed in NumPy 2.0.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
#5535 - Checksum not matching when building the300w_lp dataset
Issue -
State: closed - Opened by albertxcastro 2 months ago
Labels: bug
#5534 - Better error message to pinpoint already prepared datasets in the wrong format.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
#5533 - Restore 775 as default access mode
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
#5532 - Update GitHub actions to use mlcroissant==1.0.7
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5531 - Leverage mlcroissant's filters in TFDS CroissantBuilder's `_generate_example`.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5530 - Add a heuristic to check that the user may have forgotten the config.
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
#5529 - Dealing with Splits in CroissantBuilder
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5528 - Update GitHub actions to use mlcroissant==1.0.6
Pull Request -
State: closed - Opened by copybara-service[bot] 2 months ago
- 1 comment
#5527 - Add incomplete_files method for when multiple files with the same incomplete prefix are written
Pull Request -
State: closed - Opened by copybara-service[bot] 3 months ago
- 1 comment
#5526 - Update Google Drive URL to in `the300w_lp_dataset_builder.py` with confirmation
Pull Request -
State: closed - Opened by Inokinoki 3 months ago
- 2 comments
Labels: copybara-import
#5525 - Failed to download and load `the300w_lp` dataset through the current Google Drive URL
Issue -
State: open - Opened by Inokinoki 3 months ago
- 1 comment
Labels: bug
#5524 - Refactor conversion functions from huggingface_utils to conversion_utils.
Pull Request -
State: closed - Opened by copybara-service[bot] 3 months ago
- 1 comment
#5523 - Update copy_dataset_info_files.py docstring.
Pull Request -
State: closed - Opened by copybara-service[bot] 3 months ago
- 1 comment