pytorch-labs/float8_experimental issues and pull requests

#353 - Update README.md

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#352 - [wip] add axiswise granularity to Float8Tensor

Pull Request - State: open - Opened by vkuzo about 1 year ago
Labels: CLA Signed

#351 - [wip] add axiswise granularity to Float8Tensor

Pull Request - State: closed - Opened by vkuzo about 1 year ago
Labels: CLA Signed

#350 - [6/x] clean up casting: rename delayed and dynamic casting functions

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#349 - [5/x] clean up casting: cast_to_float8_e4m3_dynamic -> cast_to_float8_dynamic

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#348 - [4/x] clean up casting: ToFloat8ConstrFunc -> hp_tensor_and_scale_to_float8

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#347 - [3/x] clean up casting functions: delete to_fp8_no_autograd

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 3 comments
Labels: CLA Signed, Merged

#346 - [2/x] clean up casting functions: delayed scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 4 comments
Labels: CLA Signed, Merged

#345 - [1/x] clean up casting functions

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 4 comments
Labels: CLA Signed, Merged

#344 - bring back torch.autograd.Function for float8 matmul

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 4 comments
Labels: CLA Signed, Merged

#343 - [2/x] clean up casting functions: delayed scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#342 - [1/x] clean up casting functions

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#341 - bring back torch.autograd.Function for float8 matmul

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed

#340 - [2/x] clean up casting functions: delayed scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#339 - [1/x] clean up casting functions

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#338 - [wip] add scaling granularity

Pull Request - State: open - Opened by vkuzo about 1 year ago
Labels: CLA Signed

#337 - [bc-breaking] rename TensorScalingType->ScalingType, Float8TensorCastConfig->CastConfig

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#336 - bring back torch.autograd.Function for float8 matmul

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed

#335 - rename all variables to use input/weight/grad_output notation

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#334 - add per-gemm config to `Float8LinearConfig`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#333 - [bc-breaking] rename `DelayedScalingRecipe` to `DelayedScalingConfig`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#332 - [bc-breaking] rename `config.enable_fsdp_fp8_all_gather` to use `float8`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#331 - Reduced CPU overhead in `precompute_float8_dynamic_scale_for_fsdp`

Pull Request - State: closed - Opened by awgu about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#330 - [bc-breaking] clarify public API of float8_experimental

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#329 - [bc-breaking] rename top level UX to `convert_to_float8_training`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#328 - [bc-breaking] switch argument order to `module_filter_fn`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#327 - [bc-breaking] move `emulate` flag to `Float8LinearConfig`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#326 - [DISCUSSION] fix float8 all-gather in FSDP2 + TP: DTensor(WeightWithDynamicFloat8CastTensor)

Pull Request - State: open - Opened by weifengpy about 1 year ago
Labels: CLA Signed

#325 - [bc-breaking] move tensor scaling configuration to Float8LinearConfig

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#324 - [bc breaking] move most previously-global configs to Float8LinearConfig

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#323 - [bc breaking] change x, w, dL_dY variable names to input, weight, grad_output

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#322 - [bc breaking] unify filtering functions

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#321 - add unit tests for FSDP2 + torch.compile(transformer block)

Pull Request - State: closed - Opened by weifengpy about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#320 - fixes to matmul and linear benchmarks

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#319 - fix README.md description of `swap_linear_with_float8_linear`

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#318 - Any quality benchmarks?

Issue - State: closed - Opened by tsengalb99 about 1 year ago - 1 comment

#317 - update readme

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#316 - bring back torch.autograd.Function

Pull Request - State: open - Opened by vkuzo about 1 year ago
Labels: CLA Signed

#315 - make all 3 gemms in Float8Linear support configurability, not user facing

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#314 - [RFC] Float8 Inference

Issue - State: closed - Opened by drisspg about 1 year ago - 8 comments

#313 - fix linter error in CI

Pull Request - State: closed - Opened by weifengpy about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#312 - support delayed scaling of weight in float8 all-gather

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 4 comments
Labels: CLA Signed, Merged

#311 - delete `swap_linear_with_dynamic` from fsdp2 eager test case

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#310 - move WeightWithDynamicFloat8CastTensor to fsdp_utils.py

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#309 - one more delayed -> dynamic default update

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#308 - fix nits from deletion of Float8DynamicLinear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#307 - unify linear test cases

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#306 - static scaling support for training

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#305 - Add rowwise scaling to Float8Inference module

Pull Request - State: open - Opened by drisspg about 1 year ago
Labels: CLA Signed

#304 - delete Float8DynamicLinear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#303 - Add utility for filtering out skipped tests in large cross-product groups

Pull Request - State: open - Opened by drisspg about 1 year ago
Labels: CLA Signed

#302 - Add sanity checks to dtensor tests

Pull Request - State: closed - Opened by drisspg about 1 year ago - 1 comment
Labels: CLA Signed

#301 - Thread the scaling type argument throughout fp8

Pull Request - State: open - Opened by drisspg about 1 year ago
Labels: CLA Signed

#300 - [9/x]: make dynamic scaling default in Float8Linear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#299 - [8/x] make single linear profiling script work with Float8 scaling type

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#298 - [7/x] make profiling script support Float8Linear dynamic scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#297 - [6/x] switch inference tests to use Float8Linear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#296 - [5/x] make FSDP2 with float8 all-gather work for Float8Linear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#295 - Adds a test comparing the output of torch.compile and export

Pull Request - State: open - Opened by drisspg about 1 year ago
Labels: CLA Signed

#294 - [4/x] add tests for DTensor TP/SP + Float8Linear

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#293 - [3/x]: simplify FSDP1 test and add coverage for dynamic scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#292 - Float8Tensor.to_original_precision() returns wrong dtype

Issue - State: closed - Opened by ani300 about 1 year ago - 1 comment

#291 - [2/x]: fix numerics integration test and test delayed vs dynamic

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#290 - [1/x]: Make Float8Linear support dynamic scaling

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#287 - Add a Float8LinearInference module to support static, dynamic, and wo quant

Pull Request - State: closed - Opened by drisspg about 1 year ago - 5 comments
Labels: CLA Signed, Merged

#283 - Adding Float8 Linear variants supporting inference-only with lower overhead

Pull Request - State: closed - Opened by cyang49 about 1 year ago - 2 comments
Labels: CLA Signed

#282 - add norm_ffn_norm to profile script

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#281 - QOL improvements to benchmarks/profile_linear_float8.py

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 2 comments
Labels: CLA Signed, Merged

#280 - Docs should say what's the smallest model users will see a benefit for

Issue - State: closed - Opened by msaroufim about 1 year ago - 2 comments
Labels: documentation

#279 - Expected trailing dimension of mat1 to be divisible by 16 but got mat1 shape

Issue - State: closed - Opened by msaroufim about 1 year ago - 4 comments
Labels: documentation

#274 - [QST] Dynamic Scaling

Issue - State: closed - Opened by jeromeku about 1 year ago - 3 comments

#267 - delayed scaling safety logic currently doesn't work with activation checkpointing

Issue - State: closed - Opened by vkuzo about 1 year ago - 1 comment

#266 - [FSDP2] precompute scale after optimizer.step for dynamic scaling

Pull Request - State: closed - Opened by weifengpy about 1 year ago - 7 comments
Labels: CLA Signed, Merged

#259 - memory alignment issue in torch.compile mode

Issue - State: closed - Opened by czmrand about 1 year ago - 1 comment

#258 - [wip] make all 3 gemms in float8 linear configurable

Pull Request - State: closed - Opened by vkuzo about 1 year ago - 1 comment
Labels: CLA Signed

#257 - Float8Linear does not support autocast

Issue - State: closed - Opened by yitzhaklevi about 1 year ago - 2 comments

#254 - Better default for DelayedScalingRecipe.history_len

Issue - State: closed - Opened by vkuzo about 1 year ago

#252 - Added func for precomputing dynamic fp8 casts

Pull Request - State: closed - Opened by awgu over 1 year ago
Labels: CLA Signed

#249 - investigate cuda graphs + dynamic scaling leading to memory fragmentations

Issue - State: closed - Opened by vkuzo over 1 year ago - 1 comment

#246 - add configuration of precision for all 3 gemms

Issue - State: closed - Opened by vkuzo over 1 year ago - 1 comment

#245 - add option to use fast accumulation in the float8 matmul

Issue - State: closed - Opened by vkuzo over 1 year ago - 2 comments

#244 - clean up and document skipping layers logic

Issue - State: closed - Opened by vkuzo over 1 year ago - 1 comment

#243 - addmm implemented incorrectly

Issue - State: closed - Opened by lw over 1 year ago - 1 comment

#242 - Investigate Sam test tolerances

Issue - State: closed - Opened by drisspg over 1 year ago - 1 comment

#238 - torch.inference_mode switches`aten.linear.default, this is not supported`

Issue - State: closed - Opened by michaelfeil over 1 year ago - 4 comments

#237 - Graph breaks in Float8Linear code

Issue - State: closed - Opened by y-sq over 1 year ago - 1 comment

#226 - Issue with running isnan and _is_any_true with Float8Tensor

Issue - State: closed - Opened by snarayan21 over 1 year ago - 5 comments

#223 - Use Activation Hooks failing with AotAutograd for dynamic linear

Issue - State: closed - Opened by drisspg over 1 year ago - 4 comments
Labels: Compile

#207 - torch.compile() not working with Float8Linear

Issue - State: closed - Opened by yitzhaklevi over 1 year ago - 2 comments

#202 - Float8Tensor + DTensor composability

Issue - State: closed - Opened by wanchaol over 1 year ago - 1 comment

#201 - Float8 + Per-Parameter FSDP

Issue - State: closed - Opened by drisspg over 1 year ago - 1 comment
Labels: Tracking

#199 - Hugging Face transformers integration

Issue - State: closed - Opened by younesbelkada over 1 year ago - 1 comment

#195 - Top Level Torch Compile Issue Tracker

Issue - State: closed - Opened by drisspg over 1 year ago - 1 comment
Labels: Tracking

#187 - upcoming feature tracker

Issue - State: closed - Opened by vkuzo over 1 year ago - 3 comments
Labels: Tracking

#185 - Enable fp8_weight recomputation during backwards pass

Issue - State: closed - Opened by drisspg over 1 year ago - 2 comments

#174 - [not for land, yet] example of FSDP with dynamic float8 linear

Pull Request - State: closed - Opened by vkuzo over 1 year ago
Labels: CLA Signed

#156 - Revisit approach to keeping amax/scale buffers in fp32

Issue - State: closed - Opened by awgu over 1 year ago - 1 comment

#119 - Speedup sync_float8_amax_and_scale_history

Issue - State: closed - Opened by drisspg almost 2 years ago - 3 comments
Labels: Perf

#111 - Support for Fused Attention + FP8

Issue - State: closed - Opened by drisspg almost 2 years ago - 1 comment
Labels: Perf

#100 - make benchmarking script output nicer

Pull Request - State: closed - Opened by vkuzo almost 2 years ago
Labels: CLA Signed

GitHub / pytorch-labs/float8_experimental issues and pull requests