stanfordnlp/pyreft issues and pull requests

#160 - [P1] customize a reward model

Issue - State: open - Opened by XiaoMaGe-hero 6 days ago - 1 comment

#159 - [P1] Add resume checkpointing support (#151)

Pull Request - State: closed - Opened by frankaging 11 days ago

#158 - [P1] Code refactory

Issue - State: open - Opened by frankaging 12 days ago
Labels: help wanted

#157 - [P1] How to load a model from ReftTrainerForCausalLM checkpoint

Issue - State: closed - Opened by weissercn 12 days ago - 2 comments
Labels: enhancement

#156 - [P1] Are make_last_position_supervised_chat_data_module and make_last_position_supervised_data_module the same?

Issue - State: closed - Opened by weissercn 12 days ago - 3 comments

#155 - [P0] Fixing trainer saving due to FSDP integration (#154)

Pull Request - State: closed - Opened by frankaging 13 days ago

#154 - [P0] ValueError: Default process group has not been initialized, please make sure to call init_process_group.

Issue - State: closed - Opened by KristijanBoshev 13 days ago - 4 comments
Labels: bug

#153 - [Minor] Update setup.py

Pull Request - State: closed - Opened by frankaging 14 days ago

#152 - [P0] Enabling FSDP with pyvene backbone change

Pull Request - State: closed - Opened by frankaging 15 days ago

#151 - [P1] How to resume checkpoint?

Issue - State: closed - Opened by mrsempress 27 days ago - 1 comment
Labels: enhancement

#150 - [P1] Weird Memory issues during finetuning

Issue - State: open - Opened by LuisMalhadas about 1 month ago
Labels: question

#148 - [P0] Fix ReftSupervisedDataset for composition

Pull Request - State: closed - Opened by PinetreePantry about 2 months ago

#147 - [P1] AttributeError: 'CausalLMOutputWithPast' object has no attribute 'mean'

Issue - State: open - Opened by krishnardt about 2 months ago - 1 comment
Labels: enhancement

#146 - Fix model load for unsupported models by pyvene

Pull Request - State: open - Opened by Masao-Taketani 2 months ago

#145 - [P2] TypeError: Object of type type is not JSON serializable

Issue - State: closed - Opened by krishnardt 2 months ago - 3 comments
Labels: question

#144 - [P0] Integration with FSDP

Issue - State: closed - Opened by xlxcomputer 2 months ago - 2 comments
Labels: help wanted, engineering

#143 - Add example Multi-GPU training script using pyreft

Pull Request - State: closed - Opened by ramvenkat98 3 months ago

#142 - [P2] Model responses repeat a lot

Issue - State: open - Opened by mfirth-truffle 3 months ago - 1 comment
Labels: question

#141 - [P0] Can't reproduce commonsense reasoning numbers

Issue - State: closed - Opened by dyahadila 4 months ago - 6 comments

#140 - RuntimeError: Numpy is not available

Issue - State: closed - Opened by gdevakumar 4 months ago

#139 - [P1] Question regarding training flag.

Issue - State: open - Opened by m-dev12 4 months ago - 4 comments
Labels: question

#138 - My mistake. Solved.

Issue - State: closed - Opened by heroz-mtaketani 4 months ago

#137 - Support Llama 3.1

Issue - State: closed - Opened by lyy1994 5 months ago - 1 comment

#136 - [P1] GPU Memory usage issue

Issue - State: open - Opened by TranscenderNing 5 months ago - 2 comments
Labels: question

#135 - [P1] Unable to access gradients of intervention blocks

Issue - State: closed - Opened by m-dev12 5 months ago - 1 comment
Labels: question

#134 - [Minor] fix undefined var

Pull Request - State: closed - Opened by frankaging 6 months ago

#133 - [Minor] Update notebook with newer names (#132)

Pull Request - State: closed - Opened by frankaging 6 months ago

#132 - Name 'LearnedSourceLowRankRotatedSpaceIntervention' is not defined in tutorial notebook

Issue - State: closed - Opened by Z1zs 6 months ago - 1 comment
Labels: bug, question

#131 - Check in negated reft notebook

Pull Request - State: closed - Opened by PinetreePantry 6 months ago

#130 - Debug subspace composition notebook impl

Pull Request - State: closed - Opened by PinetreePantry 6 months ago

#129 - [P1] Trouble Getting a ReFT Model with Pythia

Issue - State: closed - Opened by ShayanShamsi 6 months ago - 1 comment
Labels: question

#128 - [P1] Addressing issues in DPO Training (#127)

Pull Request - State: closed - Opened by AmirZur 6 months ago - 1 comment

#127 - [P1] About DPO training

Issue - State: closed - Opened by HongzhengYang 6 months ago - 3 comments

#126 - [P1] Saving/loading issues

Issue - State: open - Opened by aryamanarora 7 months ago

#125 - [P1] Cannot load trained model anymore - "type must be tuple of ints,but got NoneType"

Issue - State: open - Opened by chris-aeviator 7 months ago - 9 comments
Labels: question

#124 - [P1] Reproducing commonsense results

Issue - State: closed - Opened by jacknewsom 7 months ago - 2 comments
Labels: question

#123 - [P0] Fixing LoReFT rotation layer hot loading problem (#114)

Pull Request - State: closed - Opened by frankaging 7 months ago - 2 comments

#122 - [P1] Intervention Locations more than Prefix and Suffix

Issue - State: open - Opened by comeandcode 7 months ago - 5 comments
Labels: question

#121 - [P1] Support for non-HuggingFace models?

Issue - State: closed - Opened by 5hadytru 7 months ago - 1 comment
Labels: question

#120 - RuntimeError: Failed to import transformers.models.blip.modeling_blip due to missing split_torch_state_dict_into_shards from huggingface_hub

Issue - State: closed - Opened by Akhorramrouz 7 months ago - 5 comments

#119 - Support VLBart with ReFT

Pull Request - State: open - Opened by PinetreePantry 7 months ago

#118 - Merge "main" into peterwz-dora

Pull Request - State: closed - Opened by PinetreePantry 7 months ago

#117 - Different Intervention Locations on Different Model Components

Issue - State: closed - Opened by comeandcode 7 months ago - 2 comments

#116 - [P1] Contact information

Issue - State: closed - Opened by automateyournetwork 8 months ago - 1 comment
Labels: question

#115 - [P1] Error(s) in loading state_dict for Linear

Issue - State: open - Opened by Hamana0509 8 months ago - 2 comments
Labels: question

#114 - [P1] Eval time model is not loaded: Unable to replicate results from paper for RoBERTa Base for Glue tasks like CoLa

Issue - State: open - Opened by m-dev12 8 months ago - 50 comments
Labels: bug, question

#113 - [P1] Getting key error in parameter while training REFT using LLAMA3

Issue - State: open - Opened by AkashGhosh 8 months ago - 9 comments
Labels: question

#112 - [P1] Loading ReFT for Llama3 model after fine-tuned with ReFT and LoRA

Issue - State: closed - Opened by Hamana0509 8 months ago - 4 comments
Labels: question

#111 - [P1] If set output_original_output to True in intervenable.generate, can we get the model performance without intervention?

Issue - State: closed - Opened by mrsempress 8 months ago - 1 comment
Labels: question

#110 - [P1] For left_padding in compute_metrics.py

Issue - State: open - Opened by mrsempress 8 months ago - 2 comments
Labels: question

#109 - [P1] Refactor ReftTrainer to save artifacts with the config

Issue - State: open - Opened by BryanWBear 8 months ago - 1 comment
Labels: enhancement, engineering

#108 - Fix: datasets.exceptions.DatasetNotFoundError when training with alpaca_data_cleaned

Pull Request - State: closed - Opened by savadikarc 8 months ago

#107 - [P1] Experimental setup for instruction following experiments in the ReFT paper

Issue - State: closed - Opened by savadikarc 8 months ago - 3 comments
Labels: question

#106 - [P1] TypeError: Object of type type is not JSON serializable

Issue - State: closed - Opened by ajayspatil7 8 months ago - 4 comments
Labels: question

#105 - [P1] Possible to do batch inference?

Issue - State: open - Opened by thistleknot 8 months ago - 3 comments
Labels: question

#104 - [Major][pyreft-core] ReFT next release items

Issue - State: open - Opened by frankaging 9 months ago - 1 comment
Labels: enhancement, engineering

#103 - [P0] Revert back to ortho init as unstable training

Pull Request - State: closed - Opened by frankaging 9 months ago

#102 - [P1] [Error] can not use bfloat16 and TypeError: Object of type type is not JSON serializable

Issue - State: closed - Opened by mrsempress 9 months ago - 22 comments
Labels: question

#101 - transformers_modules.microsoft.Phi-3-mini-4k-instruct.d269012bea6fbe38ce7752c8940fea010eea3383.modeling_phi3.Phi3ForCausalLM

Issue - State: closed - Opened by thistleknot 9 months ago - 2 comments

#100 - [Minor] Basic support of quantization

Pull Request - State: closed - Opened by frankaging 9 months ago

#99 - [P1] Is it possible to merge the base model + REFT model into only model?

Issue - State: closed - Opened by celsowm 9 months ago - 1 comment
Labels: question

#98 - [P1] Loss decrease slow in readme demo when use NousResearch/Llama-2-7b-chat-hf

Issue - State: closed - Opened by svjack 9 months ago - 2 comments
Labels: question

#97 - [P0] Does this project support turning in 4bit or 8bit Quantify？

Issue - State: closed - Opened by svjack 9 months ago - 5 comments
Labels: enhancement, engineering

#96 - [P1] Multiple Positions Intervention

Issue - State: closed - Opened by comeandcode 9 months ago - 1 comment
Labels: question

#95 - [P1] Questions on differences between paper and code

Issue - State: closed - Opened by calpt 9 months ago - 2 comments
Labels: question

#94 - [Minor] Enable lora with loreft training

Pull Request - State: closed - Opened by frankaging 9 months ago

#93 - [P1] support ReFT+PEFT by using ReftModel to wrap PeftModel (#46)

Pull Request - State: closed - Opened by frankaging 9 months ago - 1 comment

#92 - [P1] Transitioning from peft to pyreft for Classification Approach

Issue - State: open - Opened by SaBay89 9 months ago - 2 comments
Labels: question

#91 - [P1] Model Compatibility

Issue - State: closed - Opened by SaBay89 9 months ago - 2 comments
Labels: question

#90 - forward() got an unexpected keyword argument 'unit_locations'

Issue - State: closed - Opened by xerkey 9 months ago - 2 comments

#89 - Title: Fix: Shape Mismatch during Left Padding Adjustment in compute_metrics (Generated by Ana - AI SDE)

Pull Request - State: closed - Opened by ana-ai-sde 9 months ago - 3 comments

#88 - [P1] Loreft example gsm8k train gives: RuntimeError: output with shape [64, 1, 7] doesn't match the broadcast shape [64, 0, 7]

Issue - State: closed - Opened by jaymefosa 9 months ago - 3 comments
Labels: question

#87 - [P1] TypeError: train() takes 1 positional argument but 2 were given

Issue - State: closed - Opened by alpozdarendeli 9 months ago - 1 comment
Labels: question

#86 - [P1] Loading REFT fro RoBERTa Models

Issue - State: open - Opened by hSterz 9 months ago - 4 comments
Labels: question

#85 - [P0] Make `make_last_position_supervised_data_module` parallelizable to speed up processing!

Issue - State: open - Opened by truskovskiyk 9 months ago - 2 comments
Labels: enhancement

#84 - [P1] Convert reft model to hf model

Issue - State: closed - Opened by thu-yn 9 months ago - 1 comment
Labels: question

#83 - [P1] Getting error as IntervenableModel.train() takes 1 positional argument but 2 were given

Issue - State: closed - Opened by atharvapatiil 9 months ago - 4 comments
Labels: question

#82 - [P0] Additional intervention arguments are not saved correctly, e.g. `add_bias`

Issue - State: open - Opened by frankaging 9 months ago
Labels: bug

#81 - [P1] How did you create the validation set for Commonsense reasoning hyperparameter tuning?

Issue - State: closed - Opened by Edenzzzz 9 months ago - 5 comments
Labels: question

#80 - Getting issue while loading Phi3 in reft_model

Issue - State: closed - Opened by atharvapatiil 10 months ago - 9 comments

#79 - [P1] RuntimeError: cutlassF: no kernel found to launch!

Issue - State: closed - Opened by ds-praveenkumar 10 months ago - 4 comments
Labels: question

#78 - [P1] catastrophic forgetting

Issue - State: closed - Opened by jiacheo 10 months ago - 1 comment
Labels: question

#77 - [P1] Intuitive-wise, should we keep the projection orthogonal during training?

Issue - State: closed - Opened by Edenzzzz 10 months ago - 2 comments
Labels: question

#76 - ReFT + DPO Tutorial

Pull Request - State: closed - Opened by AmirZur 10 months ago - 1 comment

#75 - [Minor] fix subspace (#72)

Pull Request - State: closed - Opened by frankaging 10 months ago - 1 comment

#74 - [Minor] More refactory to support Llama3 experiments

Pull Request - State: closed - Opened by frankaging 10 months ago

#73 - [P1] Confirmation of alpaca_eval version

Issue - State: closed - Opened by BaohaoLiao 10 months ago - 4 comments
Labels: question

#72 - [P0] compreft.ipynb error = KeyError: 'subspaces'

Issue - State: closed - Opened by RonanKMcGovern 10 months ago - 5 comments
Labels: bug

#71 - [P1] Location of code for "LM training and serving with ReFT"

Issue - State: open - Opened by RonanKMcGovern 10 months ago - 2 comments
Labels: enhancement

#70 - [P2] Pyreft tensorboard integration

Issue - State: open - Opened by PinetreePantry 10 months ago
Labels: bug

#69 - [P1] TypeError: Object of type type is not JSON serializable

Issue - State: closed - Opened by srn-source 10 months ago - 7 comments
Labels: question

#67 - [P0] Why is the number of trainable parameters for prefix-tuning is 0.11%

Issue - State: closed - Opened by BaohaoLiao 10 months ago - 7 comments
Labels: documentation, question

#66 - [P0] Adding DPO Support

Issue - State: closed - Opened by jinzhuoran 10 months ago - 9 comments
Labels: enhancement, help wanted

#65 - [P1] I am bit confused how to reproduce Table 2 (all baselines + main method)

Issue - State: closed - Opened by sanyalsunny111 10 months ago - 3 comments
Labels: question

#64 - [Major] Support Llama3 models

Pull Request - State: closed - Opened by frankaging 10 months ago

#63 - [P1] TGI and vLLM support

Issue - State: open - Opened by RonanKMcGovern 10 months ago - 7 comments
Labels: question

#62 - [P1] MNLI has two validation set, how do you report the score

Issue - State: closed - Opened by BaohaoLiao 10 months ago - 3 comments
Labels: question

#61 - [P1] OpenAI CLIP model

Issue - State: open - Opened by sailfish009 10 months ago
Labels: enhancement

#60 - [P1] How to attend to memorized intervention?

Issue - State: open - Opened by chris-aeviator 10 months ago - 2 comments
Labels: question

#59 - [P0] How do I train more than 1 layer at a time?

Issue - State: closed - Opened by thistleknot 10 months ago - 4 comments
Labels: enhancement

GitHub / stanfordnlp/pyreft issues and pull requests