microsoft/DeepSpeed-MII issues and pull requests

#546 - How to use data parallelism in multi gpus inference

Issue - State: open - Opened by hhf-hu 4 days ago

#545 - Issue: Multi-node and Multi-GPU Inference Problems with DeepSpeed MII

Issue - State: open - Opened by lcnmzz00 6 days ago

#544 - Please clarify structured output support

Issue - State: open - Opened by MRYingLEE 7 days ago

#543 - Bug: Removal of mii.pydantic_v1 broke entrypoint scripts

Issue - State: open - Opened by KMouratidis 15 days ago - 3 comments

#542 - Update transformers

Pull Request - State: open - Opened by loadams 18 days ago

#541 - Updating transformers issue with bloom models

Issue - State: open - Opened by loadams 25 days ago

#540 - Updating transformers issue with zero-shot-image-classification

Issue - State: open - Opened by loadams 25 days ago

#539 - Update version.txt

Pull Request - State: closed - Opened by loadams 27 days ago

#538 - Update clang-format version to match DeepSpeed

Pull Request - State: closed - Opened by loadams 27 days ago

#537 - Update path triggers that were incorrect before

Pull Request - State: closed - Opened by loadams 27 days ago

#536 - Non-persistent example fails with KeyError

Issue - State: closed - Opened by jjaymick001 28 days ago - 1 comment

#535 - Update CODEOWNERS

Pull Request - State: closed - Opened by loadams 28 days ago

#534 - Update labels to acquire new runners

Pull Request - State: closed - Opened by loadams 28 days ago

#533 - Update docker container version

Pull Request - State: closed - Opened by loadams 29 days ago

#532 - Logits Processors

Issue - State: open - Opened by psitronic about 1 month ago

#531 - need help understanding profiler in deespeed mio

Issue - State: open - Opened by krishnanpooja about 2 months ago

#530 - Deepspeed mii library issues

Issue - State: closed - Opened by gayatripadmani about 2 months ago - 2 comments

#529 - DeepSpeed with Phi-3-mini-128K-instruct does not generate `<|endoftext|>` token

Issue - State: open - Opened by shubhanshu786 2 months ago - 1 comment

#528 - Repeated token generation with Phi-3-mini for longer context

Issue - State: open - Opened by shubhanshu786 2 months ago

#527 - LoRA Support

Issue - State: open - Opened by bagelbig 2 months ago

#526 - deepspeed MoE all_to_all communication

Issue - State: open - Opened by miaomiaoma0703 2 months ago

#525 - multi model deployment

Issue - State: open - Opened by whcjb 2 months ago - 1 comment

#524 - Fix missing pydantic updates in legacy mii code

Pull Request - State: closed - Opened by loadams 3 months ago

#523 - Question About Offloading and Recomputation

Issue - State: open - Opened by lxnlxnlxnlxnlxn 3 months ago

#522 - Configuration setting to pass parameters to tokenizer while encoding and decoding

Issue - State: open - Opened by krishnanpooja 3 months ago

#521 - OpenAI server fails

Issue - State: open - Opened by nivibilla 3 months ago - 1 comment

#520 - Update version.txt after 0.3.0 release

Pull Request - State: closed - Opened by loadams 3 months ago

#519 - Update supported model list

Pull Request - State: closed - Opened by tohtana 3 months ago

#518 - By default does deepspeed mii use bf16 dtype or fp16?

Issue - State: open - Opened by krishnanpooja 3 months ago

#517 - Confirm PyDantic v2 update passes DS tests

Pull Request - State: closed - Opened by loadams 3 months ago

#516 - FileExistsError: [Errno 17] File exists: '/tmp/mii_cache' ` on generate function call

Issue - State: open - Opened by krishnanpooja 4 months ago

#515 - Fix scheduling for non-persistent pipeline

Pull Request - State: closed - Opened by tohtana 4 months ago

#514 - Can't use Llama 3.1 with MII, ImportError: cannot import name 'Conversation' from 'transformers'

Issue - State: closed - Opened by chuyuanli 4 months ago - 1 comment

#513 - non-persistent example doesn't work on Mixtral-8*7B-v0.1

Issue - State: open - Opened by tang-t21 4 months ago