facebookresearch/LLM-QAT issues and pull requests

#32 - Does LLM-QAT support group-wise quantization?

Issue - State: open - Opened by mxjmtxrm 6 months ago

#31 - 运行报错elif self.deepspeed:

Issue - State: open - Opened by LSB0798 7 months ago

#30 - How to run inference

Issue - State: open - Opened by wangkuiyi 7 months ago

#29 - Looks like an Incorrect READ ME file

Issue - State: closed - Opened by gdsaikrishna 8 months ago - 1 comment

#29 - Looks like an Incorrect READ ME file

Issue - State: closed - Opened by gdsaikrishna 8 months ago - 1 comment

#28 - Question about the training cost

Issue - State: closed - Opened by KimythAnly 8 months ago - 1 comment

#28 - Question about the training cost

Issue - State: closed - Opened by KimythAnly 8 months ago - 1 comment

#27 - Does this method support chat models as well as Llama-2 models?

Issue - State: open - Opened by Saoyu99 11 months ago - 1 comment

#26 - How long will it take to train

Issue - State: open - Opened by XA23i about 1 year ago - 1 comment

#25 - Accuracy

Issue - State: open - Opened by yileijin over 1 year ago

#24 - Suggest change the README

Issue - State: closed - Opened by jingyao-zhang over 1 year ago - 1 comment

#23 - FileNotFoundError: [Errno 2] No such file or directory: 'wiki2.jsonl'

Issue - State: open - Opened by StiphyJay over 1 year ago - 1 comment

#22 - Training is not working.

Issue - State: open - Opened by XinnuoXu over 1 year ago

#21 - Hi, 可以开源你们生成的训练数据吗，感谢！

Issue - State: open - Opened by Xingrun-Xing over 1 year ago - 1 comment

#20 - docs: blockquote cite article format README

Pull Request - State: closed - Opened by guspan-tanadi over 1 year ago
Labels: cla signed

#19 - Inconsistent results with LLM.int8() and SmoothQuant papers

Issue - State: closed - Opened by fxmarty over 1 year ago - 1 comment

#18 - Questions about the valid_dataset format

Issue - State: closed - Opened by TravisL24 over 1 year ago - 1 comment

#17 - Questions about the valid_dataset format

Issue - State: closed - Opened by TravisL24 over 1 year ago

#16 - run run_train.sh, CUDA out of memory

Issue - State: closed - Opened by priscilla-pan over 1 year ago

#15 - no smoothquant in QuantizeLinear

Issue - State: closed - Opened by priscilla-pan over 1 year ago - 5 comments

#14 - Is there an efficient way to generate data?

Issue - State: closed - Opened by benyang0506 over 1 year ago - 3 comments

#13 - The choice of kd_loss_scale

Issue - State: closed - Opened by zhanlaoban over 1 year ago - 1 comment

#12 - can you provide inference example for QuantizeLinear in 8 8 8

Issue - State: closed - Opened by jackzhou121 over 1 year ago - 2 comments

#11 - How should I save the 8 bit model？

Issue - State: closed - Opened by liguodongiot over 1 year ago - 2 comments

#10 - If 4-8-8 is used to do QAT, how to process weight in inference?

Issue - State: closed - Opened by jackzhou121 over 1 year ago - 1 comment

#9 - No randomization operation for the first token in data generation phrase.

Issue - State: closed - Opened by xingyueye over 1 year ago - 1 comment

#8 - Why use clip_tensor[-2.0, 2.0] in the backward?

Issue - State: closed - Opened by jackzhou121 over 1 year ago - 1 comment

#7 - APEX and FSDP can not run

Issue - State: closed - Opened by jackzhou121 over 1 year ago - 1 comment

#6 - why generated data was not used

Issue - State: closed - Opened by jackzhou121 over 1 year ago - 1 comment

#5 - Does this method support Bloom？

Issue - State: closed - Opened by 18140663659 over 1 year ago - 1 comment

#4 - Expects full precision but got torch.bfloat16 error

Issue - State: open - Opened by liguodongiot over 1 year ago - 1 comment

#3 - Harcoded train paths and configuration for table in readme

Issue - State: closed - Opened by aitorormazabal over 1 year ago - 1 comment

#2 - Why do smoothquant dynamically in the forward() function of QuantizeLinear layer

Issue - State: closed - Opened by Starmys over 1 year ago - 2 comments

#1 - Change teaser image to relative path to properlly display on Github

Pull Request - State: closed - Opened by Lyken17 over 1 year ago
Labels: cla signed

Ecosyste.ms: Issues

GitHub / facebookresearch/LLM-QAT issues and pull requests

#32 - Does LLM-QAT support group-wise quantization?

#31 - 运行报错elif self.deepspeed:

#30 - How to run inference

#29 - Looks like an Incorrect READ ME file

#29 - Looks like an Incorrect READ ME file

#28 - Question about the training cost

#28 - Question about the training cost

#27 - Does this method support chat models as well as Llama-2 models?

#26 - How long will it take to train

#25 - Accuracy

#24 - Suggest change the README

#23 - FileNotFoundError: [Errno 2] No such file or directory: 'wiki2.jsonl'

#22 - Training is not working.

#21 - Hi, 可以开源你们生成的训练数据吗，感谢！

#20 - docs: blockquote cite article format README

#19 - Inconsistent results with LLM.int8() and SmoothQuant papers

#18 - Questions about the valid_dataset format

#17 - Questions about the valid_dataset format

#16 - run run_train.sh, CUDA out of memory

#15 - no smoothquant in QuantizeLinear

#14 - Is there an efficient way to generate data?

#13 - The choice of kd_loss_scale

#12 - can you provide inference example for QuantizeLinear in 8 8 8

#11 - How should I save the 8 bit model？

#10 - If 4-8-8 is used to do QAT, how to process weight in inference?

#9 - No randomization operation for the first token in data generation phrase.

#8 - Why use clip_tensor[-2.0, 2.0] in the backward?

#7 - APEX and FSDP can not run

#6 - why generated data was not used

#5 - Does this method support Bloom？

#4 - Expects full precision but got torch.bfloat16 error

#3 - Harcoded train paths and configuration for table in readme

#2 - Why do smoothquant dynamically in the forward() function of QuantizeLinear layer

#1 - Change teaser image to relative path to properlly display on Github