Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / mlfoundations/datacomp issues and pull requests

#90 - List of uids for the main filter baselines

Issue - State: open - Opened by lluisgomez about 2 months ago

#88 - Label Errors in ImageNet-O Eval Set

Issue - State: open - Opened by vishaal27 4 months ago

#87 - Redundant labels in iWILDCAM eval data

Issue - State: open - Opened by vishaal27 4 months ago

#85 - Problems in run train.py

Issue - State: open - Opened by JianchengZ 5 months ago - 3 comments

#84 - Invalid files for Datacomp1B

Issue - State: open - Opened by borisdayma 6 months ago

#83 - ImageNet 21k based filtered dataset

Issue - State: open - Opened by isidentical 7 months ago - 1 comment

#82 - Downloading Commonpool XLarge

Issue - State: open - Opened by zanussbaum 7 months ago

#81 - Average caption length for CommonPool

Issue - State: closed - Opened by BIGBALLON 8 months ago - 1 comment

#80 - Availability of npy indices for large pool

Issue - State: open - Opened by 88cf6a5ff 8 months ago

#79 - ModuleNotFoundError: No module named 'training'

Issue - State: closed - Opened by adymaharana 9 months ago - 2 comments

#77 - About update metadata with the corresponding image sample in shards

Issue - State: open - Opened by ypwang61 10 months ago - 2 comments

#76 - Frequency of Leaderboard Updates

Issue - State: closed - Opened by brunnedu 11 months ago - 1 comment

#75 - Training log

Issue - State: closed - Opened by mactavish91 11 months ago - 1 comment

#74 - metadata readme

Pull Request - State: open - Opened by sagadre 11 months ago

#73 - Pretraining dataset

Issue - State: open - Opened by mactavish91 11 months ago - 1 comment

#72 - Metadata for datacomp-large text-based filter

Issue - State: closed - Opened by aknvictor 11 months ago - 1 comment

#71 - Remove CSAM, if present

Issue - State: open - Opened by ahundt 11 months ago - 5 comments

#70 - Deduplication against evaluation sets

Issue - State: closed - Opened by nopperl about 1 year ago - 1 comment

#69 - `zeroshot_templates` split error for FairFace / UTKFace

Issue - State: closed - Opened by EIFY about 1 year ago - 9 comments

#68 - the normal success rate and downloading speed?

Issue - State: open - Opened by Tycho-Xue about 1 year ago - 1 comment

#67 - 14% of SHA256 hashes not matching

Issue - State: open - Opened by pfischer-nvidia about 1 year ago - 32 comments

#66 - Conda environment build issue

Issue - State: closed - Opened by brunnedu about 1 year ago - 3 comments

#62 - Dataset Size on Leaderboard

Issue - State: closed - Opened by brunnedu about 1 year ago - 1 comment

#61 - FMoW dataset and results variance

Issue - State: closed - Opened by teasgen about 1 year ago - 1 comment

#60 - Add AWS S3 dependencies to environment.yml

Pull Request - State: closed - Opened by 0x2b3bfa0 about 1 year ago - 2 comments

#59 - Usage with AWS S3 and Ray

Issue - State: open - Opened by 0x2b3bfa0 about 1 year ago - 5 comments

#58 - Expose img2dataset distributor

Pull Request - State: open - Opened by 0x2b3bfa0 about 1 year ago - 3 comments

#57 - Tried evaluate the model on a local network only machine

Issue - State: open - Opened by zwsjink about 1 year ago - 4 comments

#56 - Not able to push data to google cloud storage

Issue - State: open - Opened by krmayankb about 1 year ago - 1 comment

#54 - Appendix in the workshop paper submission

Issue - State: closed - Opened by zihengh1 about 1 year ago - 2 comments

#53 - Workshop submission deadline

Issue - State: closed - Opened by mamdouhJ about 1 year ago - 3 comments

#52 - How to precompute and save model-based metric during download?

Issue - State: closed - Opened by bram-w about 1 year ago - 2 comments

#48 - How to achieve exact same # of samples seen?

Issue - State: closed - Opened by zwsjink over 1 year ago - 12 comments

#47 - default to amp_bfloat16 for xlarge

Pull Request - State: closed - Opened by sagadre over 1 year ago

#46 - Text search over CommonPool

Issue - State: closed - Opened by sedol1339 over 1 year ago - 1 comment

#45 - Connection error while half-downloading metadata

Issue - State: open - Opened by sedol1339 over 1 year ago - 3 comments

#44 - [download_upstream] Add flags

Pull Request - State: closed - Opened by NielsRogge over 1 year ago - 1 comment

#42 - Result submission deadline

Issue - State: closed - Opened by bluer555 over 1 year ago - 1 comment

#41 - Downloading DataComp-1B

Issue - State: open - Opened by linzhiqiu over 1 year ago - 1 comment

#40 - Workshop paper submission

Issue - State: closed - Opened by mingtan2 over 1 year ago - 1 comment

#38 - download data

Issue - State: closed - Opened by KylinC over 1 year ago - 1 comment

#37 - train/test splits for downstream tasks

Issue - State: closed - Opened by bluer555 over 1 year ago - 1 comment

#36 - Update environment_osx.yml

Pull Request - State: closed - Opened by zwcolin over 1 year ago

#35 - Style fix

Pull Request - State: closed - Opened by Vaishaal over 1 year ago

#34 - --output_dir does not do correct thing if --output_dir is a cloud path

Issue - State: open - Opened by Vaishaal over 1 year ago - 1 comment

#33 - Metadata download error - OSError: Consistency check failed

Issue - State: closed - Opened by ch-shin over 1 year ago - 7 comments

#32 - Missing training file?

Issue - State: closed - Opened by meghbhalerao over 1 year ago - 1 comment

#30 - Deduplication of eval datasets

Issue - State: open - Opened by borisdayma over 1 year ago - 1 comment

#29 - Updated img2dataset to pull from the Spawning-Inc fork

Pull Request - State: open - Opened by Padge91 over 1 year ago

#28 - Update README on download optimizations.

Pull Request - State: closed - Opened by GeorgiosSmyrnis over 1 year ago

#27 - FileNotFoundError while downloading DataComp-1B

Issue - State: open - Opened by xfgao over 1 year ago - 7 comments

#26 - Adding the detected_language to metadata

Issue - State: closed - Opened by fabiozappo over 1 year ago - 1 comment

#25 - Leaderboard update

Issue - State: closed - Opened by haichaoyu over 1 year ago - 7 comments

#24 - Problems evaluating trained model

Issue - State: closed - Opened by mamdouhJ over 1 year ago - 1 comment

#23 - Instructions to download DataComp-1B.

Pull Request - State: closed - Opened by GeorgiosSmyrnis over 1 year ago

#22 - Consistency between Table 23 and Fig 3

Issue - State: closed - Opened by mingtan2 over 1 year ago - 5 comments

#21 - Update HF evalset cache dir and download script

Pull Request - State: closed - Opened by djghosh13 over 1 year ago

#20 - Can you share the CLIP score calculation script?

Issue - State: closed - Opened by mingtan2 over 1 year ago - 10 comments

#19 - Is there overlap between common-pool and laion-5B?

Issue - State: closed - Opened by zzzzzero over 1 year ago - 1 comment

#18 - Is there any evaluation randomness?

Issue - State: closed - Opened by mingtan2 over 1 year ago - 1 comment

#17 - add clustering code and update main readme.md (#16)

Pull Request - State: closed - Opened by gabrielilharco over 1 year ago

#16 - add clustering code and update main readme.md

Pull Request - State: closed - Opened by JieyuZ2 over 1 year ago

#15 - Any plan to release the baseline checkpoints in the paper?

Issue - State: closed - Opened by mingtan2 over 1 year ago - 1 comment

#14 - Error when creating the environment

Issue - State: closed - Opened by mamdouhJ over 1 year ago - 5 comments

#13 - reading images from within filtering script

Issue - State: closed - Opened by nazMahmoud over 1 year ago - 2 comments

#12 - Update with correct Flickr30k test set

Pull Request - State: closed - Opened by djghosh13 over 1 year ago

#11 - Is it possible to implement data filtering in training script?

Issue - State: closed - Opened by vtddggg over 1 year ago - 7 comments

#10 - baselines code from paper table 3

Pull Request - State: closed - Opened by sagadre over 1 year ago

#9 - Remove symlinks from download_upstream

Pull Request - State: closed - Opened by yaircarmon over 1 year ago

#8 - How can I split the pool? I don't have a large enough storage for all

Issue - State: closed - Opened by JianbangZ over 1 year ago - 2 comments

#7 - add get_cluster_labels_gpu

Pull Request - State: closed - Opened by JieyuZ2 over 1 year ago

#6 - Baselines

Pull Request - State: closed - Opened by sagadre over 1 year ago

#5 - can you publish DataComp-1B directly due to my small storage

Issue - State: closed - Opened by AItechnology over 1 year ago - 1 comment

#4 - add helper func for clustering

Pull Request - State: closed - Opened by JieyuZ2 over 1 year ago

#3 - How to deal with images that cannot be downloaded?

Issue - State: closed - Opened by vtddggg over 1 year ago - 20 comments

#2 - feat: allow custom download options

Pull Request - State: closed - Opened by borisdayma over 1 year ago - 9 comments

#1 - Update README.md

Pull Request - State: closed - Opened by eltociear over 1 year ago