Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / InternLM/lmdeploy issues and pull requests

#2473 - Support user-sepcified data type

Pull Request - State: open - Opened by lvhan028 4 days ago
Labels: enhancement

#2471 - Ascend NPU support

Issue - State: open - Opened by zer0py2c 4 days ago - 2 comments
Labels: awaiting response

#2469 - Add silu mul kernel

Pull Request - State: open - Opened by grimoire 5 days ago

#2466 - Refactor lora

Pull Request - State: open - Opened by grimoire 6 days ago

#2465 - Support minicpm3-4b

Pull Request - State: open - Opened by AllentDan 7 days ago - 4 comments
Labels: enhancement

#2461 - Fix initialization of runtime_min_p

Pull Request - State: closed - Opened by irexyc 7 days ago
Labels: Bug:P1

#2460 - fix MultinomialSampling operator builder

Pull Request - State: closed - Opened by grimoire 7 days ago
Labels: Bug:P2

#2458 - [Feature] support s-lora in turbomind backend

Issue - State: open - Opened by torinchen 8 days ago - 2 comments

#2455 - MiniCPM3-4B会支持吗?

Issue - State: open - Opened by LIUKAI0815 8 days ago - 1 comment

#2454 - fix tensors on different devices when deploying MiniCPM-V-2_6 with tensor parallelism

Pull Request - State: closed - Opened by irexyc 8 days ago
Labels: Bug:P1

#2453 - [Bug] TypeError: Got unsupported ScalarType BFloat16

Issue - State: open - Opened by SeitaroShinagawa 8 days ago - 2 comments

#2452 - [Bug] lmdeploy部署minicpm-v2_6推理报错

Issue - State: closed - Opened by dfe2342 8 days ago - 4 comments

#2450 - [Bug] LongCite-glm4-9b awq quantization error

Issue - State: open - Opened by maxin9966 8 days ago

#2449 - support Qwen2-VL with pytorch backend

Pull Request - State: open - Opened by irexyc 8 days ago - 3 comments
Labels: enhancement

#2448 - [Feature] pipe如何输出scores

Issue - State: open - Opened by KooSung 8 days ago - 1 comment
Labels: awaiting response

#2447 - add docs about ascend

Pull Request - State: closed - Opened by yao-fengchen 8 days ago - 1 comment

#2446 - Fix ascend readme

Pull Request - State: closed - Opened by jinminxi104 9 days ago - 1 comment

#2445 - bump version to v0.6.0

Pull Request - State: closed - Opened by lvhan028 9 days ago - 1 comment

#2444 - fix llama3 rotary in pytorch engine

Pull Request - State: closed - Opened by grimoire 9 days ago
Labels: Bug:P1

#2443 - [Bug] How to use w4a16 model in PytorchEngine

Issue - State: closed - Opened by xzmates 9 days ago - 2 comments

#2442 - [Bug] CUDA runtime error when running Llama-3.1-70B-Instruct-AWQ-INT4

Issue - State: open - Opened by rtadewald 9 days ago - 3 comments
Labels: awaiting response

#2440 - refactor pytorch engine(ascend)

Pull Request - State: closed - Opened by yao-fengchen 10 days ago
Labels: enhancement

#2439 - [Bug] lmdeploy does not support the regularized lora target module

Issue - State: open - Opened by orzgugu 10 days ago - 1 comment
Labels: awaiting response

#2438 - Support pytorch engine kv int4/int8 quantization

Pull Request - State: open - Opened by AllentDan 10 days ago - 1 comment

#2436 - [Feature] 能否支持一下qwenvl2

Issue - State: open - Opened by Ranking666 11 days ago - 3 comments
Labels: awaiting response

#2435 - [Bug] main分支EngineGenerationConfig不在初始化中了

Issue - State: closed - Opened by RandomCoins 11 days ago - 3 comments

#2434 - automatically set max_batch_size according to the device when it is not specified

Pull Request - State: closed - Opened by lvhan028 12 days ago
Labels: improvement

#2433 - build nccl in dockerfile for cuda11.8

Pull Request - State: closed - Opened by RunningLeon 13 days ago
Labels: improvement

#2432 - 是否支持embedding模型部署

Issue - State: open - Opened by Toblame 14 days ago - 1 comment

#2431 - [ci] regular update

Pull Request - State: open - Opened by zhulinJulia24 14 days ago

#2430 - [Bug] cogvlm2支持的问题

Issue - State: open - Opened by tdf1995 14 days ago - 1 comment

#2428 - Fix some issues encountered by modelscope and community

Pull Request - State: closed - Opened by irexyc 15 days ago
Labels: Bug:P1

#2427 - inplace logits process as default

Pull Request - State: closed - Opened by grimoire 15 days ago
Labels: improvement

#2426 - ignore *.pth when download model from model hub

Pull Request - State: closed - Opened by lvhan028 15 days ago - 1 comment
Labels: improvement

#2424 - [Feature] Profiling GeMM kernel in lmdeploy

Issue - State: open - Opened by DerrickYLJ 15 days ago - 1 comment

#2423 - [Feature] when --tp 2

Issue - State: open - Opened by maxin9966 15 days ago - 6 comments
Labels: awaiting response

#2422 - [Docs] AWQ / GPTQ 部分

Issue - State: open - Opened by Skyseaee 16 days ago

#2421 - build: update ascend dockerfile

Pull Request - State: closed - Opened by CyCle1024 16 days ago
Labels: improvement

#2420 - support min_p sampling parameter

Pull Request - State: closed - Opened by irexyc 16 days ago - 1 comment
Labels: enhancement

#2419 - update actions/download-artifact to v4 to fix security issue

Pull Request - State: closed - Opened by lvhan028 16 days ago

#2418 - [Docs] 关于kv cache 量化

Issue - State: closed - Opened by Root970103 16 days ago - 5 comments
Labels: awaiting response

#2417 - add Ascend get_started

Pull Request - State: closed - Opened by jinminxi104 16 days ago
Labels: documentation

#2416 - [Bug] Met a error when deploying an AWQ model on H20.

Issue - State: closed - Opened by medwang1 17 days ago - 17 comments

#2415 - [Feature] Is there a plan to support the deployment of Qwen2-VL?

Issue - State: open - Opened by ldknight 17 days ago - 2 comments
Labels: awaiting response

#2413 - import dlinfer before imageencoding

Pull Request - State: closed - Opened by jinminxi104 18 days ago - 3 comments
Labels: improvement

#2411 - [Feature] Would you consider add qwen2vl?

Issue - State: closed - Opened by PredyDaddy 18 days ago - 2 comments

#2410 - fix get_started user guide unaccessible

Pull Request - State: closed - Opened by lvhan028 18 days ago
Labels: documentation

#2409 - [Bug] Aborted (core dumped)

Issue - State: open - Opened by suwenzhuo 18 days ago - 1 comment

#2408 - [Bug] 多卡部署InternVL2-8B报错Aborted (core dumped)

Issue - State: closed - Opened by gxlover0625 19 days ago - 1 comment

#2407 - [Bug] internlm模型进行bitsandbytes int8量化

Issue - State: closed - Opened by EvoNexusX 19 days ago - 14 comments

#2403 - rename the ascend dockerfile

Pull Request - State: closed - Opened by lvhan028 21 days ago

#2402 - Torchrun launching multiple api_server

Pull Request - State: open - Opened by AllentDan 21 days ago

#2401 - [ci] add daily test's coverage report

Pull Request - State: closed - Opened by zhulinJulia24 21 days ago

#2399 - [Bug] [TM][ERROR] CUDA runtime error: misaligned address

Issue - State: open - Opened by sleepwalker2017 21 days ago - 6 comments

#2398 - 支持对qwen2-audio-instruct的加速吗

Issue - State: open - Opened by zhanghanweii 21 days ago - 4 comments

#2397 - [Feature] Does/Can lmdeploy work with XLA/TPUs

Issue - State: closed - Opened by radna0 22 days ago - 1 comment

#2396 - fix: make main process exit properly when tp>1 on ascend backend

Pull Request - State: closed - Opened by CyCle1024 22 days ago - 1 comment
Labels: Bug:P1

#2395 - Fix /v1/completions batch order wrong

Pull Request - State: closed - Opened by AllentDan 22 days ago
Labels: Bug:P1

#2394 - [Feature] 增加对于虚拟内存的缓冲时间

Issue - State: closed - Opened by NB-Group 22 days ago - 14 comments

#2393 - [Feature] InternVL2 inference is slower than InternLM-Xcomposer2

Issue - State: closed - Opened by zhaoning1987 23 days ago - 2 comments

#2392 - Inquiry

Issue - State: closed - Opened by xiaoajie738 23 days ago - 2 comments

#2392 - Inquiry

Issue - State: open - Opened by xiaoajie738 23 days ago

#2391 - [BUG]session id is not threadsafe

Issue - State: closed - Opened by tp-nan 23 days ago - 1 comment

#2390 - [Bug] InternLM 2.5 function calling

Issue - State: open - Opened by coffeecode24 23 days ago - 3 comments

#2389 - Model Parallel

Issue - State: closed - Opened by beichenzbc 23 days ago - 2 comments

#2388 - fix cache position for pytorch engine

Pull Request - State: closed - Opened by RunningLeon 23 days ago
Labels: Bug:P2

#2388 - fix cache position for pytorch engine

Pull Request - State: closed - Opened by RunningLeon 23 days ago
Labels: Bug:P2

#2386 - [Feature] 海光DCU简单测试,希望能支持

Issue - State: open - Opened by luckfu 24 days ago - 2 comments

#2385 - [Bug] 無法在windows上部署 Phi-3.5-vision-instruct

Issue - State: open - Opened by HSIAOKUOWEI 24 days ago - 4 comments

#2385 - [Bug] 無法在windows上部署 Phi-3.5-vision-instruct

Issue - State: open - Opened by HSIAOKUOWEI 24 days ago - 4 comments

#2384 - [Bug] 0.6.0 glm4-9b gptq还是会出现无限吐字的问题

Issue - State: open - Opened by maxin9966 24 days ago - 20 comments

#2384 - [Bug] 0.6.0 glm4-9b gptq还是会出现无限吐字的问题

Issue - State: closed - Opened by maxin9966 24 days ago - 20 comments

#2382 - [Bug] run out of tokens error when using llama3-llava-next-8b-hf

Issue - State: closed - Opened by binzhang01 24 days ago - 5 comments

#2381 - [Bug] CUDA runtime error: out of memory /lmdeploy/src/turbomind/utils/memory_utils.cu:32

Issue - State: open - Opened by AmazDeng 24 days ago - 5 comments
Labels: awaiting response, Stale

#2381 - [Bug] CUDA runtime error: out of memory /lmdeploy/src/turbomind/utils/memory_utils.cu:32

Issue - State: closed - Opened by AmazDeng 24 days ago - 6 comments
Labels: awaiting response, Stale