volcengine/verl issues and pull requests

#83 - [misc] feat: add several useful functions in protocol

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago - 2 comments

#82 - No module named 'megatron.optimizer'

Issue - State: closed - Opened by Wraythh about 1 month ago - 1 comment

#81 - Fix repeat

Pull Request - State: closed - Opened by caoshiyi about 1 month ago - 1 comment

#80 - [rollout] feat: support best-of-n generation in vLLM

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago

#79 - [Pending SGLang] Support torch.compile and allow disabling FSDP

Pull Request - State: open - Opened by fzyzcjy about 1 month ago

#78 - Validation dataset silently drops last batch

Issue - State: closed - Opened by fzyzcjy about 1 month ago - 3 comments

#77 - RayTaskError

Issue - State: closed - Opened by Raf-Chen about 1 month ago - 2 comments
Labels: bug, vllm related

#76 - Resume from checkpoints

Issue - State: open - Opened by fzyzcjy about 1 month ago - 2 comments

#75 - Tiny refactor and cleanup

Pull Request - State: open - Opened by fzyzcjy about 1 month ago - 2 comments

#74 - Support Mlflow, allow forward to have different batch size, compute more metrics

Pull Request - State: closed - Opened by fzyzcjy about 1 month ago - 1 comment

#73 - Question about actor training-rollout resharding

Issue - State: open - Opened by 0oshowero0 about 1 month ago - 2 comments

#72 - [community] docs: fix WeChat link

Pull Request - State: closed - Opened by eric-haibin-lin about 1 month ago

#72 - [community] docs: fix WeChat link

Pull Request - State: closed - Opened by eric-haibin-lin about 1 month ago

#71 - [community] docs: fix WeChat link

Pull Request - State: closed - Opened by eric-haibin-lin about 1 month ago

#71 - [community] docs: fix WeChat link

Pull Request - State: closed - Opened by eric-haibin-lin about 1 month ago

#70 - [doc] fix: experiment section url

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago

#70 - [doc] fix: experiment section url

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago

#69 - does this framework support long-generation such 8k-16k

Issue - State: closed - Opened by yyht about 1 month ago - 3 comments

#69 - does this framework support long-generation such 8k-16k

Issue - State: open - Opened by yyht about 1 month ago - 1 comment

#68 - Support RLOO/GRPO/REINFORCE?

Issue - State: open - Opened by fzyzcjy about 1 month ago - 24 comments

#68 - Support RLOO/GRPO/REINFORCE?

Issue - State: open - Opened by fzyzcjy about 1 month ago - 24 comments

#67 - Super tiny fix typo in title

Pull Request - State: closed - Opened by fzyzcjy about 1 month ago - 2 comments

#67 - Super tiny fix typo in title

Pull Request - State: closed - Opened by fzyzcjy about 1 month ago - 2 comments

#66 - Missing doc about "Algorithm Baselines"

Issue - State: closed - Opened by fzyzcjy about 1 month ago - 6 comments

#65 - [misc] fix: weak reference of WorkerDict in RayTrainer

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago

#65 - [misc] fix: weak reference of WorkerDict in RayTrainer

Pull Request - State: closed - Opened by PeterSH6 about 1 month ago

#64 - Actor model didn't update correctly when upgrade megatron to core-r0.6.0

Issue - State: open - Opened by Wodswos about 2 months ago - 7 comments

#63 - [install] chore: add pyproject.toml. make vllm default dependency

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#62 - package confilct

Issue - State: closed - Opened by hljjjmssyh about 2 months ago - 1 comment

#61 - [misc] feat: remove @ray.remote on workers to allow inheritance

Pull Request - State: closed - Opened by PeterSH6 about 2 months ago

#60 - fix response_mask index

Pull Request - State: closed - Opened by huiyeruzhou about 2 months ago - 2 comments

#59 - [install] fix: revert pyproj.toml and fix tensordict req

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#58 - FIRE sampling added.

Pull Request - State: open - Opened by laonahongchen about 2 months ago

#57 - Add pyproject.toml

Pull Request - State: closed - Opened by pcmoritz about 2 months ago - 1 comment

#56 - [example] docs: improve the quickstart documentation

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#55 - [ppo] chore: remove unused flash_attn dependency, and add docs for GSM8k reward

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago - 1 comment

#54 - [algorithm] docs: add steps to reproduce PPO algorithm results

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#53 - Do we have plans for data packing?

Issue - State: closed - Opened by YixinSong-e about 2 months ago - 7 comments

#52 - [BREAKING][refact]: move actor/critic/hybrid_engine/reward_model/rollout/workers out of ppo directory for reuse

Pull Request - State: closed - Opened by PeterSH6 about 2 months ago

#51 - Question about recomputation in actor module

Issue - State: closed - Opened by 0oshowero0 about 2 months ago - 3 comments

#50 - (fix): fix values response mask in dp critic.

Pull Request - State: closed - Opened by PanAndy about 2 months ago - 4 comments

#49 - [sft] feat: fix sft dataset with latest preprocess code

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#48 - Calling for Improving Robustness of FSDP-vLLM Rollout

Issue - State: closed - Opened by nwiad about 2 months ago - 4 comments

#47 - api: rename tracking logger to wandb logger type

Pull Request - State: closed - Opened by eric-haibin-lin about 2 months ago

#46 - [ray] latest ray compatibility

Issue - State: closed - Opened by eric-haibin-lin about 2 months ago - 1 comment

#45 - [BREAKING][core] move single_controller into verl directory

Pull Request - State: closed - Opened by PeterSH6 about 2 months ago

#44 - [doc] add a new quickstart section

Pull Request - State: closed - Opened by PeterSH6 about 2 months ago

#43 - [example] add a split placement tutorial

Pull Request - State: closed - Opened by PeterSH6 about 2 months ago - 1 comment