Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / arcee-ai/mergekit issues and pull requests
#453 - N-model ModelStock merging
Issue -
State: open - Opened by vishaal27 8 days ago
- 1 comment
#452 - Moe merging failed
Issue -
State: open - Opened by PsoriasiIR 8 days ago
- 2 comments
#451 - Use sst2 to eval merging
Pull Request -
State: closed - Opened by VivianeGalvao 9 days ago
#450 - Merge Models with Non-Standard Architectures (e.g., Multimodal Models)
Pull Request -
State: open - Opened by ElliotStein 12 days ago
#449 - [question] multi gpu available?
Issue -
State: open - Opened by eunbin079 13 days ago
#448 - Bump version number
Pull Request -
State: closed - Opened by cg123 14 days ago
#447 - mergekit-extract-lora does not extract - the destination is empty
Issue -
State: open - Opened by raulod 16 days ago
- 2 comments
#446 - KeyError model[0] did not exist in tensor?
Issue -
State: open - Opened by FrozzDay 17 days ago
#445 - Report issues regarding the architecture-agnostic branch.
Issue -
State: open - Opened by win10ogod 20 days ago
- 3 comments
#444 - Bump dependencies
Pull Request -
State: closed - Opened by cg123 21 days ago
#442 - RuntimeError: Need to specify cache dir to merge adapters
Issue -
State: closed - Opened by Zolilio 24 days ago
- 1 comment
#441 - Add methods from https://arxiv.org/abs/2405.07813
Pull Request -
State: open - Opened by zsgvivo 25 days ago
- 2 comments
#440 - add methods from https://arxiv.org/abs/2405.07813
Pull Request -
State: closed - Opened by zsgvivo 25 days ago
#439 - 11
Issue -
State: closed - Opened by meiyiyeshi 27 days ago
#438 - [question] `task_arithmetic` simple question
Issue -
State: closed - Opened by eunbin079 27 days ago
- 2 comments
#437 - After the two Qwen1.5-7B-chat models were merged, garbled inference results appeared.
Issue -
State: closed - Opened by Zhangfanfan0101 30 days ago
#435 - Fixed the YML/YAML documentation for Qwen MoE creation
Pull Request -
State: open - Opened by Nottlespike about 1 month ago
- 1 comment
#434 - [request] Support for Vision Language Models
Issue -
State: closed - Opened by NickGao96 about 1 month ago
- 8 comments
#433 - [request]Can it support architectures such as stable diffusion Xl and flux dev?
Issue -
State: open - Opened by win10ogod about 1 month ago
- 2 comments
#432 - Initial implementation of PCB merge method
Pull Request -
State: open - Opened by cg123 about 1 month ago
#431 - Update actors.py
Pull Request -
State: open - Opened by kwon13 about 1 month ago
#430 - Handle merges stored as list instead of space-separated string
Pull Request -
State: closed - Opened by cg123 about 1 month ago
#429 - Update Llama architecture to handle 3b/1b
Pull Request -
State: closed - Opened by cg123 about 1 month ago
#428 - Broken tokenizer in Yi-34B merge
Issue -
State: closed - Opened by Asherathe about 1 month ago
- 3 comments
#427 - I would like to merge the deepseekForCausalLM model. Are there any related examples available?
Issue -
State: open - Opened by xaiocaibi about 1 month ago
#426 - Merging Lora fine-tuned models with MoE
Issue -
State: open - Opened by AmineBechar07 about 2 months ago
#425 - Qwen2.5 14B models are ... sometimes? ... having their token vocabulary truncated down to 'actual'?
Issue -
State: open - Opened by ann-brown about 2 months ago
- 2 comments
#424 - Support for new Llama 3.2 - 1B / 3B ?
Issue -
State: closed - Opened by David-AU-github about 2 months ago
- 12 comments
#423 - Support for Vision Model such as ViT
Issue -
State: open - Opened by redagavin about 2 months ago
#422 - Support for xlm-roberta
Issue -
State: open - Opened by umiron about 2 months ago
- 2 comments
#421 - "mergekit-yaml" not created upon installation
Issue -
State: open - Opened by BovineOverlord about 2 months ago
- 2 comments
#420 - How to use multi GPUs
Issue -
State: open - Opened by liudan193 about 2 months ago
- 1 comment
#419 - would you like to support Qwen2.5 Model?
Issue -
State: closed - Opened by ArcherShirou about 2 months ago
- 1 comment
#418 - Input should be a valid dictionary or instance of MergeConfiguration
Issue -
State: open - Opened by Hugo-Calero about 2 months ago
- 2 comments
#417 - Make Cohere lm_head optional
Pull Request -
State: closed - Opened by cg123 about 2 months ago
#416 - Add Solar And Exaone Model
Pull Request -
State: closed - Opened by shing100 2 months ago
- 1 comment
#415 - Add support Exaone Model
Pull Request -
State: closed - Opened by shing100 2 months ago
- 2 comments
#414 - Re-Train every block with reduced width
Issue -
State: closed - Opened by snapo 2 months ago
#413 - Fix README links
Pull Request -
State: closed - Opened by cg123 2 months ago
#412 - Broken links on main page - " Arcee App"
Issue -
State: closed - Opened by David-AU-github 2 months ago
#411 - The DARE-TIES experiment.
Issue -
State: open - Opened by David-AU-github 3 months ago
- 4 comments
#410 - Cloud Merging
Pull Request -
State: closed - Opened by Jacobsolawetz 3 months ago
#409 - I am having problem merging GPT-Neo
Issue -
State: open - Opened by 2625554780 3 months ago
- 1 comment
#408 - support for GPT-Neo needed!
Issue -
State: closed - Opened by 2625554780 3 months ago
- 2 comments
#407 - Is it possible to merge Mistral 7B and Mistral NeMo 12B?
Issue -
State: open - Opened by azulika 3 months ago
- 1 comment
#406 - Set Gemma2 lm_head optional instead of aliasing to embed_tokens
Pull Request -
State: closed - Opened by cg123 3 months ago
#405 - Add Phi3SmallForCausalLM and tweak Phi3
Pull Request -
State: closed - Opened by cg123 3 months ago
#404 - 小白怎么合并模型 yaml文件配置
Issue -
State: open - Opened by yhyub 3 months ago
- 1 comment
#402 - 解决运行错误
Issue -
State: open - Opened by yhyub 3 months ago
#401 - Merging two mistral based models with different architectures. Looking for some guidance.
Issue -
State: open - Opened by AshD 3 months ago
- 1 comment
#400 - Example of a config file for task_arithmetic 'negative' operation and a case for 'Task analogies'
Issue -
State: open - Opened by eunbin079 3 months ago
- 1 comment
#399 - Working Example of the Mergkit-Evo
Issue -
State: open - Opened by nthangelane 3 months ago
#398 - passthrough merge error: Tensor model.layers.86.self_attn.k_norm.weight required but not present in model mistralai/Mistral-Large-Instruct-2407
Issue -
State: closed - Opened by AshD 3 months ago
- 2 comments
#397 - MergeKit GUI not working.
Issue -
State: closed - Opened by Abdulhanan535 3 months ago
#396 - Support for Phi-3-Small [Feature ?]
Issue -
State: open - Opened by hammoudhasan 3 months ago
#395 - Error at MoE Qwen 1.5B
Issue -
State: closed - Opened by ehristoforu 3 months ago
- 3 comments
#394 - Null vocab_file Issue with mistral v03 based models when using union tokenizer source
Issue -
State: open - Opened by guillermo-gabrielli-fer 3 months ago
- 2 comments
#393 - Is there a way to run LORA extraction using multi GPU? 70B LORA extraction OOM on 24GB 3090Ti
Issue -
State: open - Opened by Nero10578 3 months ago
- 1 comment
#392 - Example case of task_arithmetic needed
Issue -
State: open - Opened by Opdoop 3 months ago
- 1 comment
#391 - MoE exits itself after expert prompts 100% 2/2
Issue -
State: open - Opened by SameedHusayn 3 months ago
#390 - mergekit saves tied and ignored weights unlike what transformers does when saving
Issue -
State: open - Opened by nyxkrage 3 months ago
#389 - Create Communication Channels for MergeKit
Issue -
State: open - Opened by aditya-cherukuru 3 months ago
#388 - The speed issue with the GTATask.
Issue -
State: open - Opened by daidaiershidi 3 months ago
- 3 comments
#387 - ABM corrections
Pull Request -
State: open - Opened by metric-space 4 months ago
#386 - How to Create a New Merging Method
Issue -
State: open - Opened by Guozhenyuan 4 months ago
- 1 comment
#385 - Result of merging 2 Gemma2 9B models gains 1B parameters somehow
Issue -
State: closed - Opened by jim-plus 4 months ago
- 6 comments
#383 - does not appear to have a file named config.json
Issue -
State: open - Opened by bxf1001 4 months ago
- 2 comments
#382 - Added support for DeepseekV2 model
Pull Request -
State: open - Opened by aditya-29 4 months ago
- 3 comments
#379 - mergekit-moe支持qwen吗?
Issue -
State: open - Opened by hoooooli 4 months ago
- 3 comments
#378 - Questions about Config
Issue -
State: open - Opened by Zheng-Jay 4 months ago
- 2 comments
#377 - mergekit-evolve doesn't account for higher_is_better: false tasks.
Issue -
State: open - Opened by mekaneeky 4 months ago
- 1 comment
#375 - Network is unreachable
Issue -
State: closed - Opened by guanfaqian 4 months ago
- 1 comment
#370 - remove strict version of pydantic
Pull Request -
State: closed - Opened by sreev 4 months ago
- 1 comment
#366 - Add Della merge method
Pull Request -
State: closed - Opened by Tej-Deep 4 months ago
- 6 comments
#364 - gracefully pause evolutionary optimization?
Issue -
State: open - Opened by johnwee1 4 months ago
- 1 comment
#360 - Condense a models layers.
Issue -
State: open - Opened by DewEfresh 4 months ago
- 1 comment
#350 - qwen2-0.5B cannot be merged into MoE
Issue -
State: closed - Opened by letterk 5 months ago
- 4 comments
#341 - Evolutionary Merging out of memory
Issue -
State: open - Opened by ArcherShirou 5 months ago
- 4 comments
#340 - Weights Metrics
Pull Request -
State: open - Opened by ElliotStein 5 months ago
#335 - Merge arbitrary pytorch models
Pull Request -
State: open - Opened by cg123 6 months ago
#333 - `extract_lora.py` improvements and fixes
Pull Request -
State: closed - Opened by jukofyork 6 months ago
- 12 comments
#332 - Add --load-in-4bit and --load-in-8bit for HF eval backend
Pull Request -
State: open - Opened by cg123 6 months ago
#319 - How to merge a VLM and LLM with different model type.
Issue -
State: open - Opened by tanyakansal30 6 months ago
- 1 comment
#312 - Qwen/Qwen1.5-1.8B MoE Merging fails
Issue -
State: closed - Opened by dgolchin 6 months ago
- 4 comments
#251 - Attempt to make zipit work speak the same language as rest of mergekit
Pull Request -
State: closed - Opened by metric-space 7 months ago
#249 - Mainly adding modified M_U computation.
Pull Request -
State: closed - Opened by shamanez 7 months ago
#243 - _pickle.UnpicklingError: Unsupported type torch._tensor._rebuild_from_type_v2
Issue -
State: open - Opened by rangan2510 7 months ago
- 5 comments
#207 - Evolutionary Merging Method
Issue -
State: open - Opened by codelauncher444 8 months ago
- 19 comments
#198 - Idea: Downscaling the K and/or Q matrices for repeated layers in franken-merges?
Issue -
State: open - Opened by jukofyork 8 months ago
- 63 comments
#195 - Add support for GPTBigCodeForCausalLM
Pull Request -
State: closed - Opened by cg123 8 months ago
- 2 comments
#179 - Automatic Weight Calc based on NearSwap
Pull Request -
State: closed - Opened by Steel-skull 9 months ago
- 2 comments
#168 - Support for Merge methods which require some input data?
Issue -
State: open - Opened by ita9naiwa 9 months ago
- 2 comments
#167 - Adds a new method to shuffle/swap values
Pull Request -
State: open - Opened by Ar57m 9 months ago
- 5 comments
#158 - qwen2 architecture definition
Pull Request -
State: closed - Opened by thomasgauthier 9 months ago
- 6 comments
#150 - Fix phi-2 merging to MoE.
Pull Request -
State: closed - Opened by PhilipMay 9 months ago
- 4 comments
#101 - moe - ValidationError: 1 validation error for MergeConfiguration
Issue -
State: closed - Opened by naseerfaheem 10 months ago
- 1 comment
#100 - Adds a way of merging models with different sizes(B)
Pull Request -
State: closed - Opened by Ar57m 10 months ago
- 10 comments
#99 - Add JAISLMHeadModel
Pull Request -
State: closed - Opened by cg123 10 months ago
#98 - JapaneseStableLMAlphaForCausalLM support
Pull Request -
State: closed - Opened by cg123 10 months ago