Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / microsoft/DeepSpeedExamples issues and pull requests

#307 - My model Performs Badly...Is GPU memory to small?

Issue - State: open - Opened by Trace2333 over 1 year ago - 12 comments
Labels: deespeed chat, modeling

#305 - New training: Alpaca-lora-zero3 on 2080Ti

Pull Request - State: closed - Opened by bigeagle over 1 year ago - 7 comments

#305 - New training: Alpaca-lora-zero3 on 2080Ti

Pull Request - State: closed - Opened by bigeagle over 1 year ago - 7 comments

#304 - If I use a self-improved transformer architecture, can it support?

Issue - State: open - Opened by liujuncn over 1 year ago
Labels: deespeed chat

#304 - If I use a self-improved transformer architecture, can it support?

Issue - State: open - Opened by liujuncn over 1 year ago
Labels: deespeed chat

#297 - The step2 scoring looks correct but the step3 model is talking gibberish

Issue - State: closed - Opened by panganqi over 1 year ago - 12 comments
Labels: bug, deespeed chat

#297 - The step2 scoring looks correct but the step3 model is talking gibberish

Issue - State: closed - Opened by panganqi over 1 year ago - 12 comments
Labels: bug, deespeed chat

#283 - Does it support lora and pipeline parallel now?

Issue - State: closed - Opened by blldd over 1 year ago - 7 comments
Labels: question, deespeed chat

#279 - RuntimeError: Step 1 exited with non-zero status 1

Issue - State: closed - Opened by yudonglee over 1 year ago - 30 comments
Labels: bug, deespeed chat

#279 - RuntimeError: Step 1 exited with non-zero status 1

Issue - State: closed - Opened by yudonglee over 1 year ago - 30 comments
Labels: bug, deespeed chat

#271 - [Deepspeed-Chat] OOM issue on opt-1.3B on a 8xV100 machine (8x16GB)

Issue - State: closed - Opened by kouroshHakha over 1 year ago - 14 comments

#271 - [Deepspeed-Chat] OOM issue on opt-1.3B on a 8xV100 machine (8x16GB)

Issue - State: closed - Opened by kouroshHakha over 1 year ago - 14 comments

#172 - My deepspeed code is very slow

Issue - State: open - Opened by zhaowei-wang-nlp over 2 years ago - 25 comments

#162 - Is there any example with recent version of Megatron-LM?

Issue - State: open - Opened by cryoco almost 3 years ago - 2 comments

#117 - Fixed dataset bug in bing_bert.

Pull Request - State: closed - Opened by wenting-zhao over 3 years ago - 17 comments

#117 - Fixed dataset bug in bing_bert.

Pull Request - State: closed - Opened by wenting-zhao over 3 years ago - 17 comments

#8 - Bing BERT

Issue - State: open - Opened by tomekrut over 4 years ago - 28 comments