openlmlab/lomo issues and pull requests

#37 - 公式4疑问

Issue - State: closed - Opened by yaorong1996 over 1 year ago - 1 comment

#37 - 公式4疑问

Issue - State: closed - Opened by yaorong1996 over 1 year ago - 1 comment

#37 - 公式4疑问

Issue - State: closed - Opened by yaorong1996 over 1 year ago - 1 comment

#36 - How to calculate the used GPU memory for each part as in the paper?

Issue - State: open - Opened by liming-ai over 1 year ago - 2 comments

#36 - How to calculate the used GPU memory for each part as in the paper?

Issue - State: open - Opened by liming-ai over 1 year ago - 2 comments

#36 - How to calculate the used GPU memory for each part as in the paper?

Issue - State: open - Opened by liming-ai over 1 year ago - 2 comments

#35 - LOMO+QLoRA简单更改后的报错

Issue - State: closed - Opened by 00drdelius over 1 year ago - 7 comments

#35 - LOMO+QLoRA简单更改后的报错

Issue - State: closed - Opened by 00drdelius over 1 year ago - 7 comments

#35 - LOMO+QLoRA简单更改后的报错

Issue - State: closed - Opened by 00drdelius over 1 year ago - 7 comments

#34 - 请教个问题，LLM 训练会存在 micro-batch 之间需要累积梯度的场景，这种场景也会有优化吗？

Issue - State: closed - Opened by nullnonenilNULL over 1 year ago - 1 comment

#34 - 请教个问题，LLM 训练会存在 micro-batch 之间需要累积梯度的场景，这种场景也会有优化吗？

Issue - State: closed - Opened by nullnonenilNULL over 1 year ago - 1 comment

#34 - 请教个问题，LLM 训练会存在 micro-batch 之间需要累积梯度的场景，这种场景也会有优化吗？

Issue - State: closed - Opened by nullnonenilNULL over 1 year ago - 1 comment

#33 - LORA+LOMO distributed learning

Issue - State: closed - Opened by JiaxiangRen over 1 year ago - 2 comments

#32 - type object 'torch._C._distributed_c10d.ReduceOp' has no attribute 'AVG'

Issue - State: closed - Opened by season1blue over 1 year ago - 4 comments

#31 - Key Error: LOCAL_RANK

Issue - State: closed - Opened by snykral over 1 year ago - 1 comment

#31 - Key Error: LOCAL_RANK

Issue - State: closed - Opened by snykral over 1 year ago - 1 comment

#31 - Key Error: LOCAL_RANK

Issue - State: closed - Opened by snykral over 1 year ago - 1 comment

#30 - about torch.stack(self.grad_norms)

Issue - State: open - Opened by jinzitian over 1 year ago - 3 comments

#30 - about torch.stack(self.grad_norms)

Issue - State: open - Opened by jinzitian over 1 year ago - 3 comments

#30 - about torch.stack(self.grad_norms)

Issue - State: open - Opened by jinzitian over 1 year ago - 3 comments

#29 - 我使用了Resnet50+LOMO优化器，使用cpu去跑，系统内存相比sgd 没有任何变化，请问合理吗

Issue - State: closed - Opened by yaocy over 1 year ago

#29 - 我使用了Resnet50+LOMO优化器，使用cpu去跑，系统内存相比sgd 没有任何变化，请问合理吗

Issue - State: closed - Opened by yaocy over 1 year ago

#29 - 我使用了Resnet50+LOMO优化器，使用cpu去跑，系统内存相比sgd 没有任何变化，请问合理吗

Issue - State: closed - Opened by yaocy over 1 year ago

#28 - llama-33B/llama-65B均报OOM，8*V100跑不起来怎么回事呢？

Issue - State: open - Opened by alisyzhu over 1 year ago - 7 comments

#28 - llama-33B/llama-65B均报OOM，8*V100跑不起来怎么回事呢？

Issue - State: open - Opened by alisyzhu over 1 year ago - 7 comments

#28 - llama-33B/llama-65B均报OOM，8*V100跑不起来怎么回事呢？

Issue - State: open - Opened by alisyzhu over 1 year ago - 7 comments

#27 - Some confusion about the method of the paper

Issue - State: open - Opened by JorunoJobana over 1 year ago - 3 comments

#27 - Some confusion about the method of the paper

Issue - State: open - Opened by JorunoJobana over 1 year ago - 3 comments

#27 - Some confusion about the method of the paper

Issue - State: open - Opened by JorunoJobana over 1 year ago - 3 comments

#26 - Memory consumption first grows up then falls down.

Issue - State: open - Opened by zhenqin96 over 1 year ago - 3 comments

#26 - Memory consumption first grows up then falls down.

Issue - State: open - Opened by zhenqin96 over 1 year ago - 3 comments

#26 - Memory consumption first grows up then falls down.

Issue - State: open - Opened by zhenqin96 over 1 year ago - 3 comments

#25 - Performance Model after Full Fine-tuning by LOMOTrainer

Issue - State: open - Opened by dat-browny over 1 year ago - 9 comments

#25 - Performance Model after Full Fine-tuning by LOMOTrainer

Issue - State: open - Opened by dat-browny over 1 year ago - 9 comments

#25 - Performance Model after Full Fine-tuning by LOMOTrainer

Issue - State: open - Opened by dat-browny over 1 year ago - 9 comments

#24 - ModuleNotFoundError: No module named 'rich' after ' python -m pip install rich'

Issue - State: closed - Opened by SeekPoint over 1 year ago - 1 comment

#24 - ModuleNotFoundError: No module named 'rich' after ' python -m pip install rich'

Issue - State: closed - Opened by SeekPoint over 1 year ago - 1 comment

#23 - wandb permission

Issue - State: closed - Opened by season1blue over 1 year ago - 4 comments

#23 - wandb permission

Issue - State: closed - Opened by season1blue over 1 year ago - 4 comments

#22 - 看了下lomo的代码实现，训练的速度会很慢吗？

Issue - State: closed - Opened by egptee over 1 year ago - 1 comment

#22 - 看了下lomo的代码实现，训练的速度会很慢吗？

Issue - State: closed - Opened by egptee over 1 year ago - 1 comment

#21 - 更充分实验，与Adam的实验效果进行比较

Issue - State: open - Opened by yangjianxin1 over 1 year ago - 6 comments

#21 - 更充分实验，与Adam的实验效果进行比较

Issue - State: open - Opened by yangjianxin1 over 1 year ago - 6 comments

#20 - Is LOMO capable of pre-training a LLM from scratch as well?

Issue - State: open - Opened by YuxingLu613 over 1 year ago - 2 comments

#20 - Is LOMO capable of pre-training a LLM from scratch as well?

Issue - State: open - Opened by YuxingLu613 over 1 year ago - 2 comments

#19 - 我理解是分批次进GPU内存再计算，而速度怎么做到没有下降的？太强了

Issue - State: closed - Opened by guotong1988 over 1 year ago - 6 comments

#19 - 我理解是分批次进GPU内存再计算，而速度怎么做到没有下降的？太强了

Issue - State: closed - Opened by guotong1988 over 1 year ago - 6 comments

#18 - 数据集问题

Issue - State: open - Opened by wanghao-007 over 1 year ago - 14 comments

#18 - 数据集问题

Issue - State: open - Opened by wanghao-007 over 1 year ago - 14 comments

#17 - 4070ti有机会训练一下吗

Issue - State: closed - Opened by EveningLin over 1 year ago - 1 comment

#17 - 4070ti有机会训练一下吗

Issue - State: closed - Opened by EveningLin over 1 year ago - 1 comment

#16 - Question about Memory usage (GB) when training LLaMA-7B under different settings.

Issue - State: open - Opened by kiseliu over 1 year ago - 3 comments

#16 - Question about Memory usage (GB) when training LLaMA-7B under different settings.

Issue - State: open - Opened by kiseliu over 1 year ago - 3 comments

#15 - I can not find the weights after training

Issue - State: closed - Opened by LeeJodie over 1 year ago - 1 comment

#15 - I can not find the weights after training

Issue - State: closed - Opened by LeeJodie over 1 year ago - 1 comment

#14 - the model weight seems not been updated

Issue - State: closed - Opened by henryxiao1997 over 1 year ago - 4 comments

#14 - the model weight seems not been updated

Issue - State: closed - Opened by henryxiao1997 over 1 year ago - 4 comments

#13 - Testing with P100 on Kaggle

Issue - State: closed - Opened by Iambestfeed over 1 year ago - 7 comments

#13 - Testing with P100 on Kaggle

Issue - State: closed - Opened by Iambestfeed over 1 year ago - 7 comments

#12 - 是否支持量化的模型呀？

Issue - State: closed - Opened by laoda513 over 1 year ago - 4 comments

#12 - 是否支持量化的模型呀？

Issue - State: closed - Opened by laoda513 over 1 year ago - 4 comments

#11 - 目前支持ChatGLM吗？我一直报错：NotImplementedError: Cannot copy out of meta tensor; no data!

Issue - State: closed - Opened by doubleguy over 1 year ago - 2 comments

#11 - 目前支持ChatGLM吗？我一直报错：NotImplementedError: Cannot copy out of meta tensor; no data!

Issue - State: closed - Opened by doubleguy over 1 year ago - 2 comments

#10 - 4张3090能训练llama13B么，我做了尝试但是失败了

Issue - State: closed - Opened by cc2017111 over 1 year ago - 4 comments

#9 - What is the difference from official PyTorch DDP hooks?

Issue - State: open - Opened by wangkuiyi over 1 year ago - 1 comment

#8 - Train with other datasets collator/loader

Issue - State: closed - Opened by CamaradaLares over 1 year ago - 1 comment

#7 - can you provide the running config of 65b models?

Issue - State: closed - Opened by cyz14 over 1 year ago - 7 comments

#6 - pytorch的loss 的backward不是会把所有相关参数的grads算好并存在.grad中吗？

Issue - State: closed - Opened by egptee over 1 year ago - 2 comments

#5 - [DOC] add comments in lomo.py and add dependencies in the readme

Pull Request - State: closed - Opened by QipengGuo over 1 year ago - 1 comment

#4 - Gradient accumulation

Issue - State: closed - Opened by EladDv over 1 year ago - 2 comments

#3 - The implementation of LOMO is not released?

Issue - State: closed - Opened by Amshaker over 1 year ago

#2 - time cost of 7b model training compared to AdamW

Issue - State: closed - Opened by dawnranger over 1 year ago - 2 comments

#1 - add downstream experiments

Pull Request - State: closed - Opened by ayyyq over 1 year ago

GitHub / openlmlab/lomo issues and pull requests