Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / awslabs/slapo issues and pull requests

#113 - fix a file path

Pull Request - State: closed - Opened by eric-haibin-lin 5 months ago

#112 - [Doc] Take automatic version updates in requirements.txt

Pull Request - State: closed - Opened by liangfu 6 months ago

#111 - Add ASPLOS'24 paper

Pull Request - State: closed - Opened by chhzh123 8 months ago

#110 - Bump transformers from 4.28.1 to 4.36.0 in /docs

Pull Request - State: open - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#109 - Bump pygments from 2.13.0 to 2.15.0 in /docs

Pull Request - State: open - Opened by dependabot[bot] over 1 year ago
Labels: dependencies

#107 - [Feature] Remove restriction on sequence length

Pull Request - State: closed - Opened by zarzen over 1 year ago - 1 comment

#106 - Bump transformers from 4.28.1 to 4.30.0 in /docs

Pull Request - State: closed - Opened by dependabot[bot] over 1 year ago - 1 comment
Labels: dependencies

#105 - [Verification] Add pipeline and 3D parallelism (DP/TP/PP) support

Pull Request - State: open - Opened by chhzh123 over 1 year ago

#104 - [Pipeline] [Bugfix] Make 3D compatible with dmlc/DeepSpeed

Pull Request - State: closed - Opened by zarzen over 1 year ago

#103 - [Pipeline] Fix incorrect argument order in the pipeline module

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 4 comments

#102 - [Trace] Enhance HuggingFace tracer to support tracing at arbitrary levels

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#100 - [CI][Docker] Update PyTorch version to 2.0.1

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#99 - [Primitive] .find() & .replace() API enhancement

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#98 - Enquiries about Parameter Sharding

Issue - State: open - Opened by keneoneth over 1 year ago - 3 comments

#97 - [Tracer] Add TorchDynamo support

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 2 comments

#96 - [Autoshard] Auto-parallelism solver

Pull Request - State: open - Opened by chhzh123 over 1 year ago - 1 comment

#95 - [Autoshard] Add resharding support

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#94 - [Verification] Enhanced verifier for end-to-end HF model testing

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#93 - [README] Update README for third-part library installation

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#92 - [Bugfix] Fix QKV matching

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#91 - [Feature] Type Inference of primitives and module selection API change

Issue - State: open - Opened by zarzen over 1 year ago - 7 comments

#90 - [Fix] Silent tracing error

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 3 comments

#89 - Compatibility issues with DeepSpeed main branch

Issue - State: closed - Opened by zarzen over 1 year ago - 3 comments

#88 - [Primitive] Add kwargs to .replace_all()

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#87 - [Verification] Add module verification support

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 3 comments

#86 - [API] Add .named_schedules()

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#85 - [Primitive] Add .replace_all()

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#84 - [Version] Update version to v0.0.3

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#83 - [Feature] End-to-end verification for module correctness

Issue - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#82 - [Version] Refactor version updating logic

Pull Request - State: closed - Opened by comaniac over 1 year ago - 1 comment

#81 - [Op] Print

Pull Request - State: closed - Opened by comaniac over 1 year ago - 6 comments

#80 - [Bugfix] Shard embedding hooks

Pull Request - State: closed - Opened by comaniac over 1 year ago - 1 comment

#79 - [examples] Refactor dataloader to support BERT

Pull Request - State: closed - Opened by chhzh123 over 1 year ago

#78 - [Primitive] Add fallback fusion

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 4 comments

#77 - [Bugfix] Include other custom LinearWithXX

Pull Request - State: closed - Opened by comaniac over 1 year ago - 1 comment

#76 - [Primitive][fork_rng] Do not replace module

Pull Request - State: closed - Opened by comaniac over 1 year ago - 1 comment

#75 - [CI] Quick fix

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#74 - [Refactor] Modulize sharding methods

Pull Request - State: closed - Opened by comaniac over 1 year ago - 1 comment

#73 - [Op] Fuse bias+dropout in FusedMLP

Pull Request - State: closed - Opened by comaniac over 1 year ago - 2 comments

#72 - [CI] Update CI rules for docs

Pull Request - State: closed - Opened by chhzh123 over 1 year ago - 1 comment

#71 - [Primitive] .annotate() and .trace_until()

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#70 - [Primitive] .fork_rng()

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#69 - [Action] Fix release flow

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#68 - [Refactor] Schedule primitives

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#67 - [Feature] A primitive to set `get_cuda_rng_tracker()`

Issue - State: closed - Opened by comaniac almost 2 years ago

#66 - [Action] Failed to upload to PYPI

Issue - State: closed - Opened by comaniac almost 2 years ago

#65 - [Examples] Enable launch with torchrun

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#64 - Enable launch training with torchrun

Pull Request - State: closed - Opened by zarzen almost 2 years ago

#63 - [Docs] Add initial documentations

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#62 - Add param_name to shard infer type and fix consolidate

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#61 - [Feature] Layernorm Tag

Pull Request - State: closed - Opened by szhengac almost 2 years ago - 1 comment

#60 - [README] Temporary remove paper info

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#59 - [Bugfix] Consolidate params with orig size

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#58 - [Bugfix] Support tree-like subgraph matching

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#57 - [Bugfix] Fix a small device bug

Pull Request - State: closed - Opened by szhengac almost 2 years ago - 3 comments

#56 - [DeepSpeed] Support TP=nGPU and PP=DP=1

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#55 - [Schedule] Support partial checkpointing

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago

#54 - [Test] Add default initialization test

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#53 - [Examples] Move examples to slapo.model_schedule

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 6 comments

#52 - [Schedule] Create subschedule for subgraph replacement

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago

#51 - [Refactor] model_dialect -> framework_dialect

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#50 - [Bugfix] Fix tensor device

Pull Request - State: closed - Opened by szhengac almost 2 years ago

#49 - [Op] Add flash-attention CUDA kernel

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#48 - Add num_workers to GPT dataloader

Pull Request - State: closed - Opened by szhengac almost 2 years ago - 1 comment

#47 - [Release] v0.0.2

Issue - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#46 - [Op] Refactor qkv processing

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#45 - [Model] Add HuggingFace GPT-2

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#44 - [Tracer] Remove SelfAttention renaming

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#42 - [Example] Use .fuse() primitive when possible

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 6 comments

#41 - [Op] Add attention and bias_gelu ops

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#40 - [GPT] Use flash-attention and enable dropout

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#39 - [Setup] Fix dependency

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#38 - [Random] Random state management

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#37 - [Feature] A systematic tensor parallelism verifier

Issue - State: open - Opened by comaniac almost 2 years ago

#36 - [Bugfix] Fix GPT script

Pull Request - State: closed - Opened by szhengac almost 2 years ago

#35 - [Schedule] Refactor subgraph matching

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 2 comments

#34 - [Bugfix] Using None for mpu when PP > 1

Pull Request - State: closed - Opened by zarzen almost 2 years ago

#33 - [Primitive][shard] Use autograd function for all sync ops

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#32 - [Bugfix] Fix for sharding TP only

Pull Request - State: closed - Opened by zarzen almost 2 years ago - 2 comments

#31 - [Benchmark] Fix ZeRO-3 step log

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#30 - [Tracer] PyTorch tracer does not rename SelfAttention module

Issue - State: closed - Opened by chhzh123 almost 2 years ago - 2 comments

#29 - [Tracer] Add `flatten` argument to .trace()

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 4 comments

#28 - [Bugfix] Transfer hooks in pipeline modules

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#27 - [Schedule][replace] Transfer hooks when replacing modules

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#26 - [Bugfix] Fix GPT script

Pull Request - State: closed - Opened by szhengac almost 2 years ago - 1 comment

#25 - [Schedule] Add .fuse() primitive

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 4 comments

#24 - [Bug] Original hooks are discarded in pipeline module

Issue - State: closed - Opened by chhzh123 almost 2 years ago

#23 - [Schedule] Fix linear bias after row sharding weight (#10)

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 5 comments

#22 - [Examples] Add disable_flash_attn

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#21 - [Feature] Random seed management for dropout layers in distributed environment

Issue - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#20 - [Bugfix] Fix sequence parallelism

Pull Request - State: closed - Opened by szhengac almost 2 years ago - 1 comment

#19 - [Pipeline] Drop last batch in DeepSpeed scripts

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#18 - [Release] Setup wheel and release scripts

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#17 - [Bugfix] Fix schedule and dockerfile

Pull Request - State: closed - Opened by comaniac almost 2 years ago

#16 - [Test] Add tracer unit tests

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 1 comment

#15 - [Pipeline] Register tie weights

Pull Request - State: closed - Opened by comaniac almost 2 years ago - 1 comment

#14 - [Test] Add end-to-end tests

Pull Request - State: closed - Opened by chhzh123 almost 2 years ago - 3 comments