Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / arcee-ai/distillkit issues and pull requests

#19 - Setup broken with PyTorch 2.5.x

Issue - State: open - Opened by juliensimon 10 days ago - 1 comment
Labels: bug

#18 - Offline Distillation using Top K Logits

Issue - State: closed - Opened by raghavgarg97 20 days ago - 2 comments

#17 - Models with same architecture but different tokenizer

Issue - State: closed - Opened by bil-ash 23 days ago - 2 comments

#15 - Support Llama 3.2 1B model ?

Issue - State: closed - Opened by JinYu1998 about 1 month ago - 1 comment

#15 - Support Llama 3.2 1B model ?

Issue - State: closed - Opened by JinYu1998 about 1 month ago - 1 comment

#14 - Is Dense to MoE, MoE to Dense or MoE to MoE Distillation supported?

Issue - State: closed - Opened by linux-leo about 2 months ago - 1 comment

#13 - [News] GKD method

Issue - State: closed - Opened by kashif about 2 months ago - 1 comment

#11 - Can it support multi-node training?

Issue - State: open - Opened by zidong-onepiece1 2 months ago

#10 - Deprecated positional argument(s) used in SFTTrainer

Issue - State: closed - Opened by JinYu1998 3 months ago - 7 comments

#9 - After training, the model output cannot stop

Issue - State: closed - Opened by blackblue9 3 months ago - 5 comments

#8 - Update distil_hidden.py for teacher tokenizer

Pull Request - State: closed - Opened by fernando-neto-ai 3 months ago

#7 - Teacher uses student tokenizer in distill_hidden.py

Issue - State: closed - Opened by HeegonJin 3 months ago - 1 comment

#6 - Plan for larger model?

Issue - State: closed - Opened by YixinSong-e 3 months ago - 2 comments

#5 - encoder model distillation?

Issue - State: closed - Opened by riyajatar37003 3 months ago - 1 comment

#4 - CUDA Out of memory issue

Issue - State: closed - Opened by avemio-digital 3 months ago - 8 comments

#3 - RuntimeError: 'weight' must be 2-D

Issue - State: open - Opened by Hasan-Syed25 3 months ago - 6 comments

#1 - added some initial logic to load the teacher logits

Pull Request - State: open - Opened by shamanez 4 months ago - 3 comments