TransformerLensOrg/TransformerLens issues and pull requests

#755 - Upstream update

Pull Request - State: closed - Opened by bryce13950 4 months ago

#754 - [Proposal] Ensure TransformerLens does not load from hugging face when config is passed in

Issue - State: open - Opened by hamind 4 months ago - 2 comments
Labels: complexity-moderate

#753 - Can't load gpt2 or gpt2-medium locally

Issue - State: closed - Opened by hamind 4 months ago

#752 - Demo colab compatibility

Pull Request - State: closed - Opened by bryce13950 4 months ago

#751 - Add support for `Mistral-Nemo-Base-2407` model

Pull Request - State: closed - Opened by ryanhoangt 4 months ago - 8 comments

#750 - [Bug Report] Encountering a possible model format issue with AWQ-INT4 quantized Llama3.1-8B

Issue - State: closed - Opened by TacticalSpoon331 4 months ago - 2 comments

#749 - add transformer diagram

Pull Request - State: closed - Opened by akozlo 4 months ago

#748 - [Bug Report] microsoft/Phi-3.5-mini-instruct could not be loaded into Transformer_Lens Kernel get killed.

Issue - State: closed - Opened by prof-schacht 4 months ago - 3 comments

#747 - [Bug Report] hook_normalized is inconsistent between RMSNorm and LayerNorm

Issue - State: open - Opened by neelnanda-io 4 months ago - 1 comment
Labels: bug, complexity-moderate, breaking-change

#746 - [Proposal] Add example of collecting activations from a single layer.

Issue - State: closed - Opened by adamkarvonen 4 months ago - 6 comments
Labels: demo, complexity-moderate

#745 - release 2.7.1

Pull Request - State: closed - Opened by bryce13950 4 months ago

#744 - Upstream update

Pull Request - State: closed - Opened by bryce13950 4 months ago

#743 - `from_pretrained` has correct return type (i.e. `HookedSAETransformer.from_pretrained` returns `HookedSAETransformer`)

Pull Request - State: closed - Opened by callummcdougall 4 months ago - 1 comment

#742 - Updated broken Slack link

Pull Request - State: closed - Opened by neelnanda-io 4 months ago - 3 comments

#741 - Llama3.1

Pull Request - State: closed - Opened by mylesgoose 4 months ago - 22 comments

#740 - added llama 3.1 models and base for working on mllama

Pull Request - State: closed - Opened by mylesgoose 4 months ago - 5 comments

#739 - Avoid warning in `utils.download_file_from_hf`

Pull Request - State: closed - Opened by albertsgarde 4 months ago

#738 - [Proposal] Usage with openai/transformer-debugger

Issue - State: closed - Opened by fzyzcjy 4 months ago - 4 comments

#737 - [Bug Report] Q cannot be reshaped correctly when model is loaded in 4bit

Issue - State: open - Opened by po13on 4 months ago - 4 comments
Labels: bug, needs-investigation

#736 - Upstream update

Pull Request - State: closed - Opened by bryce13950 5 months ago

#735 - Release 2.7

Pull Request - State: closed - Opened by bryce13950 5 months ago

#734 - Model llama 3.2

Pull Request - State: closed - Opened by bryce13950 5 months ago

#733 - `utils.test_prompt` compares multiple prompts

Pull Request - State: closed - Opened by bryce13950 5 months ago

#732 - Type hooked encoder

Pull Request - State: closed - Opened by bryce13950 5 months ago

#731 - Model llama 3.2

Pull Request - State: closed - Opened by bryce13950 5 months ago

#730 - Fine tune model and using this framework

Issue - State: closed - Opened by nitay16 5 months ago - 3 comments
Labels: question, needs-information

#729 - [Proposal] Guide to adding new models

Issue - State: open - Opened by deven367 5 months ago - 9 comments
Labels: documentation, complexity-moderate

#728 - `utils.test_prompt` compares multiple prompts

Pull Request - State: closed - Opened by callummcdougall 5 months ago - 12 comments

#727 - fixed typo

Pull Request - State: closed - Opened by bryce13950 5 months ago

#726 - [Proposal] Warn people when trying to load t5 into HookedTransformer

Issue - State: closed - Opened by bryce13950 5 months ago - 4 comments
Labels: good first issue, complexity-simple

#725 - Fix the bug that tokenize_and_concatenate function not working for small dataset

Pull Request - State: closed - Opened by xy-z-code 5 months ago - 1 comment

#724 - Load state dict with assign to avoid OOMs

Pull Request - State: closed - Opened by cyber-chris 5 months ago - 6 comments

#723 - 2.6

Pull Request - State: closed - Opened by bryce13950 5 months ago

#722 - Redo of #713

Pull Request - State: closed - Opened by bryce13950 5 months ago

#721 - Release 2.5

Pull Request - State: closed - Opened by bryce13950 5 months ago

#720 - [Bug Report] Review current matmul function usages

Issue - State: open - Opened by bryce13950 5 months ago
Labels: bug, complexity-high

#719 - [Proposal] Add frequency-based RoPE support for Llama 3.1 models

Issue - State: closed - Opened by frances720 5 months ago - 3 comments

#718 - Add `allenai/OLMoE-1B-7B-0924`.

Pull Request - State: closed - Opened by joelburget 5 months ago - 7 comments

#717 - Allow loading only first n layers.

Pull Request - State: closed - Opened by joelburget 5 months ago

#716 - HookedTransformerConfig docs string: `weight_init_mode` => `init_mode`

Pull Request - State: closed - Opened by JasonGross 5 months ago - 1 comment

#715 - Fix typo in bug issue template

Pull Request - State: closed - Opened by JasonGross 5 months ago

#714 - [Bug Report] Torch FutureWarning when calling `utils.download_file_from_hf` with `torch==2.4.1`

Issue - State: closed - Opened by albertsgarde 5 months ago - 6 comments
Labels: good first issue, complexity-simple

#713 - Ungrouping GQA

Pull Request - State: closed - Opened by hannamw 5 months ago - 5 comments

#712 - v2.4.1

Pull Request - State: closed - Opened by bryce13950 5 months ago

#711 - support for the Amber model, including checkpoints

Pull Request - State: closed - Opened by tadakeigo 5 months ago

#710 - [Proposal] Add MVP Support For 1-2 Models Per-Modality

Issue - State: open - Opened by 4gatepylon 5 months ago - 4 comments
Labels: complexity-high, discussion

#709 - [Bug Report] Gemma 2 unsupported?

Issue - State: closed - Opened by jasonlim131 6 months ago

#708 - [Bug Report] Gemma-2-2b not found

Issue - State: closed - Opened by jasonlim131 6 months ago - 1 comment

#707 - [Bug Report] `tokenize_and_concatenate` doesn't work with small datasets.

Issue - State: closed - Opened by yash-srivastava19 6 months ago - 2 comments

#706 - revised loading to recycle state dict

Pull Request - State: closed - Opened by bryce13950 6 months ago

#705 - Updated state loading to copy by reference

Pull Request - State: closed - Opened by bryce13950 6 months ago

#704 - [Proposal] Add support for TracrBench

Issue - State: open - Opened by HannesThurnherr 6 months ago - 3 comments
Labels: new-architecture, complexity-high

#703 - Upstream commit update

Pull Request - State: closed - Opened by bryce13950 6 months ago

#702 - Release 2.4.0

Pull Request - State: closed - Opened by bryce13950 6 months ago

#701 - Release v2.3.1

Pull Request - State: closed - Opened by bryce13950 6 months ago

#700 - Recent release

Pull Request - State: closed - Opened by bryce13950 6 months ago

#699 - Improve attention masking

Pull Request - State: closed - Opened by UFO-101 6 months ago - 1 comment

#698 - Vram not used rtx 4090's

Issue - State: closed - Opened by mylesgoose 6 months ago - 1 comment

#697 - How to get the Activation cache while the LLM is generating new tokens?

Issue - State: open - Opened by Meehaohao 6 months ago - 2 comments
Labels: complexity-moderate

#696 - About the cached layernorm scale factors

Issue - State: open - Opened by Meehaohao 6 months ago - 2 comments

#695 - Changed fold_value_biases method to be able to handle multi-gpu.

Pull Request - State: closed - Opened by Heigke 6 months ago - 4 comments

#694 - Update Gemma2 attention scale

Pull Request - State: closed - Opened by mntss 6 months ago - 4 comments

#693 - [Bug Report] Gemma-2-2b-it output logit doesn't match with huggingface

Issue - State: closed - Opened by yeutong 6 months ago - 7 comments
Labels: complexity-high, implementation-inaccuracy

#692 - add a demo for Patchscopes and Generation with Patching

Pull Request - State: closed - Opened by HenryCai11 6 months ago - 2 comments

#691 - [Proposal] Add Lllama 3.1 support

Issue - State: closed - Opened by ssuukk 6 months ago - 20 comments
Labels: new-architecture, complexity-moderate

#690 - Python 3.8 removal

Pull Request - State: closed - Opened by bryce13950 6 months ago

#689 - Added gemma-2 2b (#687)

Pull Request - State: closed - Opened by bryce13950 6 months ago

#688 - 2.3.0

Pull Request - State: closed - Opened by bryce13950 6 months ago

#687 - Added gemma-2 2b

Pull Request - State: closed - Opened by curt-tigges 6 months ago - 1 comment

#686 - OSError: gpt2 does not appear to have a file named config.json. Checkout 'https://huggingface.co/gpt2/None' for available files.

Issue - State: closed - Opened by Iust1n2 6 months ago

GitHub / TransformerLensOrg/TransformerLens issues and pull requests