Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / openlmlab/lomo issues and pull requests
#37 - 公式4疑问
Issue -
State: closed - Opened by yaorong1996 over 1 year ago
- 1 comment
#37 - 公式4疑问
Issue -
State: closed - Opened by yaorong1996 over 1 year ago
- 1 comment
#37 - 公式4疑问
Issue -
State: closed - Opened by yaorong1996 over 1 year ago
- 1 comment
#36 - How to calculate the used GPU memory for each part as in the paper?
Issue -
State: open - Opened by liming-ai over 1 year ago
- 2 comments
#36 - How to calculate the used GPU memory for each part as in the paper?
Issue -
State: open - Opened by liming-ai over 1 year ago
- 2 comments
#36 - How to calculate the used GPU memory for each part as in the paper?
Issue -
State: open - Opened by liming-ai over 1 year ago
- 2 comments
#35 - LOMO+QLoRA简单更改后的报错
Issue -
State: closed - Opened by 00drdelius over 1 year ago
- 7 comments
#35 - LOMO+QLoRA简单更改后的报错
Issue -
State: closed - Opened by 00drdelius over 1 year ago
- 7 comments
#35 - LOMO+QLoRA简单更改后的报错
Issue -
State: closed - Opened by 00drdelius over 1 year ago
- 7 comments
#34 - 请教个问题,LLM 训练会存在 micro-batch 之间需要累积梯度的场景,这种场景也会有优化吗?
Issue -
State: closed - Opened by nullnonenilNULL over 1 year ago
- 1 comment
#34 - 请教个问题,LLM 训练会存在 micro-batch 之间需要累积梯度的场景,这种场景也会有优化吗?
Issue -
State: closed - Opened by nullnonenilNULL over 1 year ago
- 1 comment
#34 - 请教个问题,LLM 训练会存在 micro-batch 之间需要累积梯度的场景,这种场景也会有优化吗?
Issue -
State: closed - Opened by nullnonenilNULL over 1 year ago
- 1 comment
#33 - LORA+LOMO distributed learning
Issue -
State: closed - Opened by JiaxiangRen over 1 year ago
- 2 comments
#32 - type object 'torch._C._distributed_c10d.ReduceOp' has no attribute 'AVG'
Issue -
State: closed - Opened by season1blue over 1 year ago
- 4 comments
#31 - Key Error: LOCAL_RANK
Issue -
State: closed - Opened by snykral over 1 year ago
- 1 comment
#31 - Key Error: LOCAL_RANK
Issue -
State: closed - Opened by snykral over 1 year ago
- 1 comment
#31 - Key Error: LOCAL_RANK
Issue -
State: closed - Opened by snykral over 1 year ago
- 1 comment
#30 - about torch.stack(self.grad_norms)
Issue -
State: open - Opened by jinzitian over 1 year ago
- 3 comments
#30 - about torch.stack(self.grad_norms)
Issue -
State: open - Opened by jinzitian over 1 year ago
- 3 comments
#30 - about torch.stack(self.grad_norms)
Issue -
State: open - Opened by jinzitian over 1 year ago
- 3 comments
#29 - 我使用了Resnet50+LOMO优化器,使用cpu去跑,系统内存相比sgd 没有任何变化,请问合理吗
Issue -
State: closed - Opened by yaocy over 1 year ago
#29 - 我使用了Resnet50+LOMO优化器,使用cpu去跑,系统内存相比sgd 没有任何变化,请问合理吗
Issue -
State: closed - Opened by yaocy over 1 year ago
#29 - 我使用了Resnet50+LOMO优化器,使用cpu去跑,系统内存相比sgd 没有任何变化,请问合理吗
Issue -
State: closed - Opened by yaocy over 1 year ago
#28 - llama-33B/llama-65B均报OOM,8*V100跑不起来怎么回事呢?
Issue -
State: open - Opened by alisyzhu over 1 year ago
- 7 comments
#28 - llama-33B/llama-65B均报OOM,8*V100跑不起来怎么回事呢?
Issue -
State: open - Opened by alisyzhu over 1 year ago
- 7 comments
#28 - llama-33B/llama-65B均报OOM,8*V100跑不起来怎么回事呢?
Issue -
State: open - Opened by alisyzhu over 1 year ago
- 7 comments
#27 - Some confusion about the method of the paper
Issue -
State: open - Opened by JorunoJobana over 1 year ago
- 3 comments
#27 - Some confusion about the method of the paper
Issue -
State: open - Opened by JorunoJobana over 1 year ago
- 3 comments
#27 - Some confusion about the method of the paper
Issue -
State: open - Opened by JorunoJobana over 1 year ago
- 3 comments
#26 - Memory consumption first grows up then falls down.
Issue -
State: open - Opened by zhenqin96 over 1 year ago
- 3 comments
#26 - Memory consumption first grows up then falls down.
Issue -
State: open - Opened by zhenqin96 over 1 year ago
- 3 comments
#26 - Memory consumption first grows up then falls down.
Issue -
State: open - Opened by zhenqin96 over 1 year ago
- 3 comments
#25 - Performance Model after Full Fine-tuning by LOMOTrainer
Issue -
State: open - Opened by dat-browny over 1 year ago
- 9 comments
#25 - Performance Model after Full Fine-tuning by LOMOTrainer
Issue -
State: open - Opened by dat-browny over 1 year ago
- 9 comments
#25 - Performance Model after Full Fine-tuning by LOMOTrainer
Issue -
State: open - Opened by dat-browny over 1 year ago
- 9 comments
#24 - ModuleNotFoundError: No module named 'rich' after ' python -m pip install rich'
Issue -
State: closed - Opened by SeekPoint over 1 year ago
- 1 comment
#24 - ModuleNotFoundError: No module named 'rich' after ' python -m pip install rich'
Issue -
State: closed - Opened by SeekPoint over 1 year ago
- 1 comment
#23 - wandb permission
Issue -
State: closed - Opened by season1blue over 1 year ago
- 4 comments
#23 - wandb permission
Issue -
State: closed - Opened by season1blue over 1 year ago
- 4 comments
#22 - 看了下lomo的代码实现,训练的速度会很慢吗?
Issue -
State: closed - Opened by egptee over 1 year ago
- 1 comment
#22 - 看了下lomo的代码实现,训练的速度会很慢吗?
Issue -
State: closed - Opened by egptee over 1 year ago
- 1 comment
#21 - 更充分实验,与Adam的实验效果进行比较
Issue -
State: open - Opened by yangjianxin1 over 1 year ago
- 6 comments
#21 - 更充分实验,与Adam的实验效果进行比较
Issue -
State: open - Opened by yangjianxin1 over 1 year ago
- 6 comments
#20 - Is LOMO capable of pre-training a LLM from scratch as well?
Issue -
State: open - Opened by YuxingLu613 over 1 year ago
- 2 comments
#20 - Is LOMO capable of pre-training a LLM from scratch as well?
Issue -
State: open - Opened by YuxingLu613 over 1 year ago
- 2 comments
#19 - 我理解是分批次进GPU内存再计算,而速度怎么做到没有下降的?太强了
Issue -
State: closed - Opened by guotong1988 over 1 year ago
- 6 comments
#19 - 我理解是分批次进GPU内存再计算,而速度怎么做到没有下降的?太强了
Issue -
State: closed - Opened by guotong1988 over 1 year ago
- 6 comments
#18 - 数据集问题
Issue -
State: open - Opened by wanghao-007 over 1 year ago
- 14 comments
#18 - 数据集问题
Issue -
State: open - Opened by wanghao-007 over 1 year ago
- 14 comments
#17 - 4070ti有机会训练一下吗
Issue -
State: closed - Opened by EveningLin over 1 year ago
- 1 comment
#17 - 4070ti有机会训练一下吗
Issue -
State: closed - Opened by EveningLin over 1 year ago
- 1 comment
#16 - Question about Memory usage (GB) when training LLaMA-7B under different settings.
Issue -
State: open - Opened by kiseliu over 1 year ago
- 3 comments
#16 - Question about Memory usage (GB) when training LLaMA-7B under different settings.
Issue -
State: open - Opened by kiseliu over 1 year ago
- 3 comments
#15 - I can not find the weights after training
Issue -
State: closed - Opened by LeeJodie over 1 year ago
- 1 comment
#15 - I can not find the weights after training
Issue -
State: closed - Opened by LeeJodie over 1 year ago
- 1 comment
#14 - the model weight seems not been updated
Issue -
State: closed - Opened by henryxiao1997 over 1 year ago
- 4 comments
#14 - the model weight seems not been updated
Issue -
State: closed - Opened by henryxiao1997 over 1 year ago
- 4 comments
#13 - Testing with P100 on Kaggle
Issue -
State: closed - Opened by Iambestfeed over 1 year ago
- 7 comments
#13 - Testing with P100 on Kaggle
Issue -
State: closed - Opened by Iambestfeed over 1 year ago
- 7 comments
#12 - 是否支持量化的模型呀?
Issue -
State: closed - Opened by laoda513 over 1 year ago
- 4 comments
#12 - 是否支持量化的模型呀?
Issue -
State: closed - Opened by laoda513 over 1 year ago
- 4 comments
#11 - 目前支持ChatGLM吗?我一直报错:NotImplementedError: Cannot copy out of meta tensor; no data!
Issue -
State: closed - Opened by doubleguy over 1 year ago
- 2 comments
#11 - 目前支持ChatGLM吗?我一直报错:NotImplementedError: Cannot copy out of meta tensor; no data!
Issue -
State: closed - Opened by doubleguy over 1 year ago
- 2 comments
#10 - 4张3090能训练llama13B么,我做了尝试但是失败了
Issue -
State: closed - Opened by cc2017111 over 1 year ago
- 4 comments
#9 - What is the difference from official PyTorch DDP hooks?
Issue -
State: open - Opened by wangkuiyi over 1 year ago
- 1 comment
#8 - Train with other datasets collator/loader
Issue -
State: closed - Opened by CamaradaLares over 1 year ago
- 1 comment
#7 - can you provide the running config of 65b models?
Issue -
State: closed - Opened by cyz14 over 1 year ago
- 7 comments
#6 - pytorch的loss 的backward不是会把所有相关参数的grads算好并存在.grad中吗?
Issue -
State: closed - Opened by egptee over 1 year ago
- 2 comments
#5 - [DOC] add comments in lomo.py and add dependencies in the readme
Pull Request -
State: closed - Opened by QipengGuo over 1 year ago
- 1 comment
#4 - Gradient accumulation
Issue -
State: closed - Opened by EladDv over 1 year ago
- 2 comments
#3 - The implementation of LOMO is not released?
Issue -
State: closed - Opened by Amshaker over 1 year ago
#2 - time cost of 7b model training compared to AdamW
Issue -
State: closed - Opened by dawnranger over 1 year ago
- 2 comments
#1 - add downstream experiments
Pull Request -
State: closed - Opened by ayyyq over 1 year ago