Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / arcee-ai/mergekit issues and pull requests

#453 - N-model ModelStock merging

Issue - State: open - Opened by vishaal27 8 days ago - 1 comment

#452 - Moe merging failed

Issue - State: open - Opened by PsoriasiIR 8 days ago - 2 comments

#451 - Use sst2 to eval merging

Pull Request - State: closed - Opened by VivianeGalvao 9 days ago

#449 - [question] multi gpu available?

Issue - State: open - Opened by eunbin079 13 days ago

#448 - Bump version number

Pull Request - State: closed - Opened by cg123 14 days ago

#447 - mergekit-extract-lora does not extract - the destination is empty

Issue - State: open - Opened by raulod 16 days ago - 2 comments

#446 - KeyError model[0] did not exist in tensor?

Issue - State: open - Opened by FrozzDay 17 days ago

#445 - Report issues regarding the architecture-agnostic branch.

Issue - State: open - Opened by win10ogod 20 days ago - 3 comments

#444 - Bump dependencies

Pull Request - State: closed - Opened by cg123 21 days ago

#442 - RuntimeError: Need to specify cache dir to merge adapters

Issue - State: closed - Opened by Zolilio 24 days ago - 1 comment

#441 - Add methods from https://arxiv.org/abs/2405.07813

Pull Request - State: open - Opened by zsgvivo 25 days ago - 2 comments

#440 - add methods from https://arxiv.org/abs/2405.07813

Pull Request - State: closed - Opened by zsgvivo 25 days ago

#439 - 11

Issue - State: closed - Opened by meiyiyeshi 27 days ago

#438 - [question] `task_arithmetic` simple question

Issue - State: closed - Opened by eunbin079 27 days ago - 2 comments

#435 - Fixed the YML/YAML documentation for Qwen MoE creation

Pull Request - State: open - Opened by Nottlespike about 1 month ago - 1 comment

#434 - [request] Support for Vision Language Models

Issue - State: closed - Opened by NickGao96 about 1 month ago - 8 comments

#432 - Initial implementation of PCB merge method

Pull Request - State: open - Opened by cg123 about 1 month ago

#431 - Update actors.py

Pull Request - State: open - Opened by kwon13 about 1 month ago

#430 - Handle merges stored as list instead of space-separated string

Pull Request - State: closed - Opened by cg123 about 1 month ago

#429 - Update Llama architecture to handle 3b/1b

Pull Request - State: closed - Opened by cg123 about 1 month ago

#428 - Broken tokenizer in Yi-34B merge

Issue - State: closed - Opened by Asherathe about 1 month ago - 3 comments

#426 - Merging Lora fine-tuned models with MoE

Issue - State: open - Opened by AmineBechar07 about 2 months ago

#424 - Support for new Llama 3.2 - 1B / 3B ?

Issue - State: closed - Opened by David-AU-github about 2 months ago - 12 comments

#423 - Support for Vision Model such as ViT

Issue - State: open - Opened by redagavin about 2 months ago

#422 - Support for xlm-roberta

Issue - State: open - Opened by umiron about 2 months ago - 2 comments

#421 - "mergekit-yaml" not created upon installation

Issue - State: open - Opened by BovineOverlord about 2 months ago - 2 comments

#420 - How to use multi GPUs

Issue - State: open - Opened by liudan193 about 2 months ago - 1 comment

#419 - would you like to support Qwen2.5 Model?

Issue - State: closed - Opened by ArcherShirou about 2 months ago - 1 comment

#418 - Input should be a valid dictionary or instance of MergeConfiguration

Issue - State: open - Opened by Hugo-Calero about 2 months ago - 2 comments

#417 - Make Cohere lm_head optional

Pull Request - State: closed - Opened by cg123 about 2 months ago

#416 - Add Solar And Exaone Model

Pull Request - State: closed - Opened by shing100 2 months ago - 1 comment

#415 - Add support Exaone Model

Pull Request - State: closed - Opened by shing100 2 months ago - 2 comments

#414 - Re-Train every block with reduced width

Issue - State: closed - Opened by snapo 2 months ago

#413 - Fix README links

Pull Request - State: closed - Opened by cg123 2 months ago

#412 - Broken links on main page - " Arcee App"

Issue - State: closed - Opened by David-AU-github 2 months ago

#411 - The DARE-TIES experiment.

Issue - State: open - Opened by David-AU-github 3 months ago - 4 comments

#410 - Cloud Merging

Pull Request - State: closed - Opened by Jacobsolawetz 3 months ago

#409 - I am having problem merging GPT-Neo

Issue - State: open - Opened by 2625554780 3 months ago - 1 comment

#408 - support for GPT-Neo needed!

Issue - State: closed - Opened by 2625554780 3 months ago - 2 comments

#407 - Is it possible to merge Mistral 7B and Mistral NeMo 12B?

Issue - State: open - Opened by azulika 3 months ago - 1 comment

#406 - Set Gemma2 lm_head optional instead of aliasing to embed_tokens

Pull Request - State: closed - Opened by cg123 3 months ago

#405 - Add Phi3SmallForCausalLM and tweak Phi3

Pull Request - State: closed - Opened by cg123 3 months ago

#404 - 小白怎么合并模型 yaml文件配置

Issue - State: open - Opened by yhyub 3 months ago - 1 comment

#402 - 解决运行错误

Issue - State: open - Opened by yhyub 3 months ago

#399 - Working Example of the Mergkit-Evo

Issue - State: open - Opened by nthangelane 3 months ago

#397 - MergeKit GUI not working.

Issue - State: closed - Opened by Abdulhanan535 3 months ago

#396 - Support for Phi-3-Small [Feature ?]

Issue - State: open - Opened by hammoudhasan 3 months ago

#395 - Error at MoE Qwen 1.5B

Issue - State: closed - Opened by ehristoforu 3 months ago - 3 comments

#392 - Example case of task_arithmetic needed

Issue - State: open - Opened by Opdoop 3 months ago - 1 comment

#391 - MoE exits itself after expert prompts 100% 2/2

Issue - State: open - Opened by SameedHusayn 3 months ago

#389 - Create Communication Channels for MergeKit

Issue - State: open - Opened by aditya-cherukuru 3 months ago

#388 - The speed issue with the GTATask.

Issue - State: open - Opened by daidaiershidi 3 months ago - 3 comments

#387 - ABM corrections

Pull Request - State: open - Opened by metric-space 4 months ago

#386 - How to Create a New Merging Method

Issue - State: open - Opened by Guozhenyuan 4 months ago - 1 comment

#385 - Result of merging 2 Gemma2 9B models gains 1B parameters somehow

Issue - State: closed - Opened by jim-plus 4 months ago - 6 comments

#383 - does not appear to have a file named config.json

Issue - State: open - Opened by bxf1001 4 months ago - 2 comments

#382 - Added support for DeepseekV2 model

Pull Request - State: open - Opened by aditya-29 4 months ago - 3 comments

#379 - mergekit-moe支持qwen吗?

Issue - State: open - Opened by hoooooli 4 months ago - 3 comments

#378 - Questions about Config

Issue - State: open - Opened by Zheng-Jay 4 months ago - 2 comments

#377 - mergekit-evolve doesn't account for higher_is_better: false tasks.

Issue - State: open - Opened by mekaneeky 4 months ago - 1 comment

#375 - Network is unreachable

Issue - State: closed - Opened by guanfaqian 4 months ago - 1 comment

#370 - remove strict version of pydantic

Pull Request - State: closed - Opened by sreev 4 months ago - 1 comment

#366 - Add Della merge method

Pull Request - State: closed - Opened by Tej-Deep 4 months ago - 6 comments

#364 - gracefully pause evolutionary optimization?

Issue - State: open - Opened by johnwee1 4 months ago - 1 comment

#360 - Condense a models layers.

Issue - State: open - Opened by DewEfresh 4 months ago - 1 comment

#350 - qwen2-0.5B cannot be merged into MoE

Issue - State: closed - Opened by letterk 5 months ago - 4 comments

#341 - Evolutionary Merging out of memory

Issue - State: open - Opened by ArcherShirou 5 months ago - 4 comments

#340 - Weights Metrics

Pull Request - State: open - Opened by ElliotStein 5 months ago

#335 - Merge arbitrary pytorch models

Pull Request - State: open - Opened by cg123 6 months ago

#333 - `extract_lora.py` improvements and fixes

Pull Request - State: closed - Opened by jukofyork 6 months ago - 12 comments

#332 - Add --load-in-4bit and --load-in-8bit for HF eval backend

Pull Request - State: open - Opened by cg123 6 months ago

#319 - How to merge a VLM and LLM with different model type.

Issue - State: open - Opened by tanyakansal30 6 months ago - 1 comment

#312 - Qwen/Qwen1.5-1.8B MoE Merging fails

Issue - State: closed - Opened by dgolchin 6 months ago - 4 comments

#249 - Mainly adding modified M_U computation.

Pull Request - State: closed - Opened by shamanez 7 months ago

#207 - Evolutionary Merging Method

Issue - State: open - Opened by codelauncher444 8 months ago - 19 comments

#195 - Add support for GPTBigCodeForCausalLM

Pull Request - State: closed - Opened by cg123 8 months ago - 2 comments

#179 - Automatic Weight Calc based on NearSwap

Pull Request - State: closed - Opened by Steel-skull 9 months ago - 2 comments

#168 - Support for Merge methods which require some input data?

Issue - State: open - Opened by ita9naiwa 9 months ago - 2 comments

#167 - Adds a new method to shuffle/swap values

Pull Request - State: open - Opened by Ar57m 9 months ago - 5 comments

#158 - qwen2 architecture definition

Pull Request - State: closed - Opened by thomasgauthier 9 months ago - 6 comments

#150 - Fix phi-2 merging to MoE.

Pull Request - State: closed - Opened by PhilipMay 9 months ago - 4 comments

#101 - moe - ValidationError: 1 validation error for MergeConfiguration

Issue - State: closed - Opened by naseerfaheem 10 months ago - 1 comment

#100 - Adds a way of merging models with different sizes(B)

Pull Request - State: closed - Opened by Ar57m 10 months ago - 10 comments

#99 - Add JAISLMHeadModel

Pull Request - State: closed - Opened by cg123 10 months ago

#98 - JapaneseStableLMAlphaForCausalLM support

Pull Request - State: closed - Opened by cg123 10 months ago