linkedin/Liger-Kernel issues and pull requests

#329 - Cross entropy for packing

Issue - State: open - Opened by fzyzcjy 12 days ago

#328 - Logits seems to be missing

Issue - State: open - Opened by fzyzcjy 13 days ago - 4 comments

#327 - Creation of Python autodoc using Sphinx

Pull Request - State: open - Opened by ParagEkbote 13 days ago

#326 - [AMD] [ROCm] Pick `num_warps` based on platform

Pull Request - State: open - Opened by tjtanaa 14 days ago

#325 - Fix incorrect training of first and last Medusa heads

Pull Request - State: open - Opened by chiwanpark 15 days ago

#324 - Add TVD Loss Kernel

Pull Request - State: open - Opened by saurabhkoshatwar 15 days ago - 1 comment

#323 - Liger kernel does not speed up, and even slow down in many scenarios

Issue - State: closed - Opened by fzyzcjy 16 days ago - 3 comments

#322 - handle updated gradient accumulation fixes from transformers

Pull Request - State: open - Opened by winglian 17 days ago - 1 comment

#321 - added batch norm

Pull Request - State: open - Opened by vulkomilev 18 days ago

#320 - Support FusedLinearCrossEntropy for Gemma2

Pull Request - State: open - Opened by Tcc0403 19 days ago

#319 - Training LLaVA with the Liger kernel results in degraded performance.

Issue - State: open - Opened by y-rok 19 days ago - 1 comment

#318 - fix FLCE AMP issue

Pull Request - State: closed - Opened by yundai424 19 days ago - 5 comments

#317 - Update citation and add tech report

Pull Request - State: closed - Opened by ByronHsu 19 days ago

#316 - Introducing Liger Kernel Guru on Gurubase.io

Pull Request - State: closed - Opened by kursataktas 19 days ago - 2 comments

#315 - mllama patch modifies nn.LayerNorm globally

Issue - State: open - Opened by tyler-romero 21 days ago - 3 comments

#314 - why logits of model outputs is None

Issue - State: closed - Opened by Yangruipis 22 days ago - 2 comments

#313 - AutoLigerKernelForCausalLM is incompatible with AutoModelForCausalLM

Issue - State: closed - Opened by helloworld1 24 days ago

#312 - Memory usage is different between GPUs when using FSDP1 and meta device.

Issue - State: closed - Opened by cailun01 25 days ago - 2 comments

#311 - OOM issue occurs when training Llama-3-8B

Issue - State: closed - Opened by dongwook92 25 days ago - 2 comments

#310 - Add missing ignore_index tests

Pull Request - State: open - Opened by Tcc0403 25 days ago

#309 - Empty Medusa head tensors

Issue - State: open - Opened by vkc1vk 26 days ago - 1 comment

#308 - Move `logits.float()` call

Pull Request - State: closed - Opened by ringohoffman 27 days ago - 2 comments

#307 - Fix dtype mismatch in fused_linear_cross_entropy_forward

Pull Request - State: closed - Opened by kostum123 29 days ago - 1 comment

#306 - Add ignore_index and label to jsd and fl-jsd

Pull Request - State: closed - Opened by Tcc0403 29 days ago

#305 - RuntimeError due to dtype mismatch in fused_linear_cross_entropy_forward

Issue - State: closed - Opened by kostum123 29 days ago - 5 comments
Labels: bug

#304 - Added contributors and back to top

Pull Request - State: closed - Opened by barbarian360 29 days ago

#303 - how to use with peft lora?

Issue - State: closed - Opened by YooSungHyun 30 days ago - 1 comment

#302 - Monkey patch layer norm in mllama

Pull Request - State: closed - Opened by shivam15s 30 days ago

#301 - Add end-to-end example of mllama

Issue - State: open - Opened by shivam15s about 1 month ago

#300 - Add FusedLinearJSD

Pull Request - State: closed - Opened by Tcc0403 about 1 month ago

#299 - Add Fused Linear JSD

Pull Request - State: closed - Opened by Tcc0403 about 1 month ago

#298 - Add fused_linear_jsd

Pull Request - State: closed - Opened by Tcc0403 about 1 month ago

#297 - Add TensorFlow Check and Improve Package Listing in Environment Report

Pull Request - State: closed - Opened by Vanshika110 about 1 month ago

#296 - fix typos and improve grammar in README.md

Pull Request - State: closed - Opened by amantyagiprojects about 1 month ago

#295 - Update rope.py

Pull Request - State: closed - Opened by Yash-2707 about 1 month ago - 1 comment

#294 - Apache and MIT license reference

Pull Request - State: closed - Opened by momochen about 1 month ago

#293 - chore: update cross_entropy.py

Pull Request - State: closed - Opened by eltociear about 1 month ago

#292 - [DO NOT REVIEW] Add sleep in tests

Pull Request - State: closed - Opened by helloworld1 about 1 month ago

#291 - WIP Set gpu ci env to have HF hub read token

Pull Request - State: closed - Opened by shivam15s about 1 month ago

#290 - Add beta support for jsd

Pull Request - State: closed - Opened by Tcc0403 about 1 month ago - 4 comments

#289 - Cancel in-progress but out-of-date GPU actions

Pull Request - State: closed - Opened by tyler-romero about 1 month ago - 1 comment

#288 - Disable gemma2 and qwen2_vl tests

Pull Request - State: closed - Opened by shimizust about 1 month ago

#287 - Acknowledgement in NOTICE file

Pull Request - State: closed - Opened by momochen about 1 month ago

#286 - Release version 0.3.1

Pull Request - State: closed - Opened by shimizust about 1 month ago - 1 comment

#285 - 2024 Q4 Roadmap

Issue - State: open - Opened by ByronHsu about 1 month ago

#285 - 2024 Q4 Roadmap

Issue - State: open - Opened by ByronHsu about 1 month ago

#284 - poke tests

Pull Request - State: closed - Opened by tyler-romero about 1 month ago

#283 - Add missing Qwen2-VL monkey patch test

Pull Request - State: closed - Opened by tyler-romero about 1 month ago - 1 comment

#282 - Monkeypatch for Llama 3.2-Vision

Pull Request - State: closed - Opened by tyler-romero about 1 month ago - 13 comments

#281 - Add TVD (Total variation distance) Kernel

Issue - State: open - Opened by qingquansong about 1 month ago - 1 comment
Labels: feature

#281 - Add TVD (Total variation distance) Kernel

Issue - State: open - Opened by qingquansong about 1 month ago - 3 comments
Labels: feature

#280 - Post-init model patching fix

Pull Request - State: closed - Opened by shimizust about 1 month ago - 1 comment

#280 - Post-init model patching fix

Pull Request - State: closed - Opened by shimizust about 1 month ago - 1 comment

#279 - Weights are not copied when model is already instantiated

Issue - State: closed - Opened by ByronHsu about 1 month ago - 1 comment

#278 - Add general JSD (w/ beta) support

Issue - State: closed - Opened by qingquansong about 1 month ago - 4 comments
Labels: feature

#277 - Adding ignore index support for divergence losses

Issue - State: closed - Opened by qingquansong about 1 month ago
Labels: feature

#277 - Adding ignore index support for divergence losses

Issue - State: open - Opened by qingquansong about 1 month ago
Labels: feature

#276 - fix qwen2-vl: create correct rope position_ids when position_ids is None

Pull Request - State: closed - Opened by Sanster about 1 month ago

#275 - [Kernel] Flash attention 2

Pull Request - State: open - Opened by remi-or about 1 month ago - 2 comments

#274 - FIX: tl.program_id() does indeed not have a cast method in triton2.3.1

Pull Request - State: closed - Opened by wizyoung about 1 month ago - 1 comment

#273 - Ensure In-place correctness checks work properly

Pull Request - State: open - Opened by Tcc0403 about 2 months ago - 4 comments

#272 - In-place operations in triton kernel might result in incorrect gradient calculations

Issue - State: open - Opened by Tcc0403 about 2 months ago - 3 comments
Labels: bug

#271 - Encountered errors when reproducing lightning training example

Issue - State: open - Opened by ReginaZh about 2 months ago - 3 comments

#270 - Relaxed transformers dependency

Pull Request - State: closed - Opened by shimizust about 2 months ago

#269 - Fix sharing a ResBlock layer for each head in Medusa example

Pull Request - State: closed - Opened by chiwanpark about 2 months ago - 2 comments

#268 - inference qwen2 model ,The reasoning is garbled and ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)

Issue - State: open - Opened by Dujianhua1008 about 2 months ago - 1 comment

#268 - inference qwen2 model ,The reasoning is garbled and ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)

Issue - State: closed - Opened by Dujianhua1008 about 2 months ago - 1 comment

#267 - rename cuda mode to gpu mode

Pull Request - State: closed - Opened by msaroufim about 2 months ago

#266 - Choice of num_warps

Issue - State: open - Opened by Edenzzzz about 2 months ago - 7 comments

#265 - chore: Add Qwen2.5 and Phi3.5 to Readme

Pull Request - State: closed - Opened by tyler-romero about 2 months ago

#264 - Add JSD kernel

Pull Request - State: closed - Opened by Tcc0403 about 2 months ago - 6 comments

#263 - Fix AutoLigerKernelForCausalLM to pass through original kwargs

Pull Request - State: closed - Opened by shimizust about 2 months ago - 1 comment

#262 - Fix/kldiv

Pull Request - State: closed - Opened by S1ro1 about 2 months ago - 13 comments

#261 - Fix assert_verbose_allclose bugs

Pull Request - State: closed - Opened by Tcc0403 about 2 months ago

#260 - Update contributing guide for adding a new model

Pull Request - State: closed - Opened by shivam15s about 2 months ago

#259 - test.utils.assert_verbose_allclose has multiple bugs

Issue - State: closed - Opened by Tcc0403 about 2 months ago - 4 comments

#258 - Optional dependency on transformers

Issue - State: closed - Opened by DuarteMRAlves about 2 months ago - 6 comments

#257 - Loss does not drop when using Liger Kernel at Qwen2.5

Issue - State: open - Opened by Se-Hun about 2 months ago - 9 comments

#256 - Fix a comment typo in flce

Pull Request - State: closed - Opened by Tcc0403 about 2 months ago

#255 - RMSNorm aggregation

Pull Request - State: closed - Opened by Tcc0403 about 2 months ago - 3 comments

#254 - Are you even allowed to do these ops inplace?

Issue - State: closed - Opened by mayank31398 about 2 months ago - 4 comments

#253 - [Model] Pixtral Support

Pull Request - State: open - Opened by AndreSlavescu about 2 months ago - 3 comments

#252 - reverse KL and JSD

Issue - State: closed - Opened by yundai424 about 2 months ago - 1 comment
Labels: feature

#251 - [Easy] Cast program_id to int64 in SwiGLU/GeGLU kernels

Pull Request - State: closed - Opened by hansonw about 2 months ago

#250 - AutoLigerKernelForCausalLM.from_pretrained discards hub_kwargs_names

Issue - State: closed - Opened by uris-opti about 2 months ago - 5 comments

#249 - ValueError when Loading Qwen2-VL Model with Liger Kernel

Issue - State: open - Opened by rahatarinasir about 2 months ago - 1 comment

#248 - Support for Cohere models

Issue - State: open - Opened by nyxkrage about 2 months ago - 1 comment

#247 - Remove debug print statement

Pull Request - State: closed - Opened by EdoardoLuciani about 2 months ago - 1 comment

#246 - Release Liger-Kernel version 0.3.0

Pull Request - State: closed - Opened by qingquansong about 2 months ago

#245 - Is not compatible with DoRA?

Issue - State: open - Opened by gotzmann about 2 months ago - 1 comment
Labels: bug

#244 - Add label smoothing to FLCE and unit tests

Pull Request - State: closed - Opened by Tcc0403 about 2 months ago - 1 comment

#243 - Lable smoothing is not applied and tested in flce

Issue - State: closed - Opened by Tcc0403 about 2 months ago

#242 - Patch Application Relies on Global State and `AutoLigerKernelForCausalLM.from_config` doesn't work properly

Issue - State: closed - Opened by lapp0 2 months ago - 7 comments

#241 - Reasons for upcasting the logits dtype outside the kernel

Issue - State: open - Opened by yzhangcs 2 months ago - 7 comments

#240 - SWIFT Trainer Integration

Pull Request - State: closed - Opened by tastelikefeet 2 months ago

#239 - Support Z Loss in CE

Pull Request - State: open - Opened by Tcc0403 2 months ago - 4 comments
Labels: reviewing

#238 - fused_linear_cross_entropy: Move float32 cast into kernel

Pull Request - State: open - Opened by hansonw 2 months ago - 6 comments

#237 - Optimize fused_linear_cross_entropy when weight does not require grads

Pull Request - State: closed - Opened by hansonw 2 months ago

#236 - Benchmarking phi3 on single A100 40gb GPU: unable to reproduce benchmark results

Issue - State: open - Opened by cosmicBboy 2 months ago - 3 comments

#235 - Compatibility Issue: PEFT and BitsAndBytesConfig with Liger Kernel. Seeking Alternatives for Quantization and LoRA Fine-Tuning.

Issue - State: closed - Opened by GianottiGustavo 2 months ago - 17 comments

GitHub / linkedin/Liger-Kernel issues and pull requests