Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / luyug/GradCache issues and pull requests

#33 - how to deal with same encoder

Issue - State: open - Opened by CSWellesSun 2 months ago

#31 - Implement Grokfast into GradCache

Issue - State: open - Opened by ben-walczak 3 months ago - 2 comments

#30 - traning speed is very slow

Issue - State: open - Opened by liuweie 3 months ago - 4 comments

#28 - Questions about training

Issue - State: open - Opened by MikeDean2367 7 months ago

#26 - [jax] single decorator grad cache

Pull Request - State: closed - Opened by luyug 9 months ago

#25 - distributed loss for multiple GPUs

Issue - State: closed - Opened by x-zb 9 months ago - 4 comments

#24 - Multiple outputs implementation

Issue - State: open - Opened by Soumya-dutta 9 months ago - 1 comment

#23 - Gradient update is extremely slow

Issue - State: open - Opened by AshStuff 9 months ago - 1 comment

#22 - How to use GradCache in non-single input function?

Issue - State: open - Opened by lxx909546478 over 1 year ago

#19 - Surprising OOM error

Issue - State: open - Opened by kawshik8 over 1 year ago - 1 comment

#18 - Thanks to your work! I train CLIP with this project. I have some problems.

Issue - State: closed - Opened by zzk2021 over 1 year ago - 1 comment

#17 - Documentation about autocast

Issue - State: open - Opened by jxmorris12 over 1 year ago

#16 - Tiny numerical differences, Weight updates not perfectly matching

Issue - State: open - Opened by Ar-Kareem almost 2 years ago - 2 comments

#15 - How to handle BatchNorm ?

Issue - State: open - Opened by heleifz about 2 years ago - 1 comment

#14 - Can you please publish this to pypi please

Issue - State: open - Opened by shaileshj2803 over 2 years ago - 2 comments

#13 - the batchsize with the gradcache

Issue - State: open - Opened by here101 over 2 years ago - 8 comments

#12 - TypeError at grad_cache/functional.py:39

Issue - State: closed - Opened by syoungbaak over 2 years ago - 4 comments

#11 - AttributeError: 'GCTrainer' object has no attribute 'scaler'

Issue - State: closed - Opened by ToluClassics over 2 years ago - 5 comments

#10 - Great work! Helped creating sota embeddings

Issue - State: closed - Opened by Muennighoff over 2 years ago

#9 - effective batch size with multiple GPUs

Issue - State: closed - Opened by shaileshj2803 over 2 years ago - 2 comments

#8 - Example with pytorch lightning

Issue - State: open - Opened by shaileshj2803 over 2 years ago - 3 comments

#7 - How does this provide the same gradient as a larger batch size?

Issue - State: open - Opened by sameerkhanna786 over 2 years ago - 6 comments

#6 - Add argument Tensor all gather decorator for Pytorch functional

Pull Request - State: closed - Opened by luyug over 2 years ago

#5 - functional approach with distributed training

Issue - State: open - Opened by kevinlin311tw over 2 years ago - 3 comments

#4 - Requirements of the python env?

Issue - State: closed - Opened by MicPie almost 3 years ago - 1 comment

#3 - Add Jax Support

Pull Request - State: closed - Opened by luyug almost 3 years ago

#2 - Compatibility with Huggingface Trainer

Issue - State: closed - Opened by sh0416 about 3 years ago - 2 comments

#1 - Nice Job

Issue - State: closed - Opened by menghuanlater about 3 years ago - 1 comment