xorbitsai/inference issues and pull requests

#2771 - FIX: [UI] normalize language input to ensure consistent array format.

Pull Request - State: closed - Opened by yiboyasss 1 day ago
Labels: bug

#2770 - ENH: CosyVoice2 support SFT speakers

Pull Request - State: open - Opened by codingl2k1 2 days ago
Labels: enhancement

#2769 - TST: compatible with mlx-vlm 0.1.11

Pull Request - State: closed - Opened by qinxuye 2 days ago
Labels: testing

#2768 - 多模态embeding 支持和llamacpp embeding后端支持

Issue - State: open - Opened by ZanePoe 4 days ago - 1 comment
Labels: feature

#2767 - CosyVoice2 requires prompt_speech

Issue - State: open - Opened by peterliang5678 5 days ago - 6 comments
Labels: gpu

#2766 - MiniCPM-o-2_6全模态模型有支持计划吗？

Issue - State: open - Opened by yhfgyyf 5 days ago - 3 comments
Labels: feature

#2765 - v1.2.0 版本是不是与 pydantic 冲突了

Issue - State: open - Opened by JasonHZS 5 days ago - 1 comment
Labels: gpu

#2764 - 如果xinference启动模型失败后再重启会显示模型已存在list

Issue - State: open - Opened by pyaaaa 5 days ago - 1 comment
Labels: feature

#2763 - ENH: support cline style messages for all backend engines

Pull Request - State: closed - Opened by liunux4odoo 6 days ago
Labels: enhancement

#2762 - 大模型

Issue - State: open - Opened by YSblack 6 days ago - 1 comment
Labels: feature

#2761 - DOC: update new models in README and doc

Pull Request - State: closed - Opened by qinxuye 7 days ago
Labels: documentation

#2760 - FEAT: Support MeloTTS

Pull Request - State: closed - Opened by codingl2k1 7 days ago
Labels: feature

#2759 - BUG: Compat with openai extra body

Pull Request - State: closed - Opened by codingl2k1 9 days ago
Labels: bug

#2758 - HunyuanDiT-v1.2-Distilled生成图片报错expected scalar type Float but found Half

Issue - State: open - Opened by githust66 9 days ago - 4 comments
Labels: gpu

#2758 - HunyuanDiT-v1.2-Distilled生成图片报错expected scalar type Float but found Half

Issue - State: open - Opened by githust66 9 days ago
Labels: gpu

#2757 - Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"}

Issue - State: open - Opened by Oaklight 9 days ago - 2 comments
Labels: gpu

#2757 - Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"}

Issue - State: open - Opened by Oaklight 9 days ago
Labels: gpu

#2756 - Failed to infer device type

Issue - State: closed - Opened by leoterry-ulrica 9 days ago - 2 comments
Labels: gpu

#2755 - ENH: add model config for Whisper

Pull Request - State: closed - Opened by fonsc 11 days ago
Labels: enhancement

#2754 - Improve kv block transfer

Issue - State: open - Opened by codingl2k1 11 days ago - 1 comment
Labels: feature, stale

#2753 - FEAT: [UI] Add gguf_quantization, gguf_model_path, and cpu_offload for image models.

Pull Request - State: closed - Opened by yiboyasss 11 days ago
Labels: feature

#2752 - BUG: pin mlx<0.22.0 to prevent qwen2_vl failing in mlx-vlm

Pull Request - State: closed - Opened by qinxuye 11 days ago
Labels: bug

#2751 - embedding模型接口调用后名称发生变化

Issue - State: open - Opened by sliontc 11 days ago - 3 comments

#2750 - 无法在新部署的xinference中启动已经下载好的模型

Issue - State: closed - Opened by frankSARU 12 days ago - 3 comments
Labels: gpu

#2749 - FEAT: Support Marco-o1

Pull Request - State: closed - Opened by Jun-Howie 12 days ago
Labels: feature

#2748 - FIX: [UI] Fix dark mode background bug.

Pull Request - State: closed - Opened by yiboyasss 12 days ago
Labels: bug

#2747 - FIX: [UI] Resolve bug preventing '/' input in model_path.

Pull Request - State: closed - Opened by yiboyasss 13 days ago
Labels: bug

#2746 - ENH: [UI] Update Button Style and Interaction Logic for Editing Cache in Model Card.

Pull Request - State: closed - Opened by yiboyasss 13 days ago
Labels: enhancement

#2745 - docker启动报错 Illegal instruction (core dumped)如何排查

Issue - State: closed - Opened by kimi360 13 days ago - 5 comments

#2744 - FEAT: [UI] Add language toggle for i18n support.

Pull Request - State: closed - Opened by yiboyasss 13 days ago
Labels: feature

#2743 - model card UI redesign

Issue - State: closed - Opened by ilovesouthpark 14 days ago - 2 comments
Labels: feature

#2742 - 增加melotts等快速TTS推理模型

Issue - State: closed - Opened by Wiziechen 14 days ago - 1 comment
Labels: feature

#2741 - FEAT: support qwen2vl run on ascend npu

Pull Request - State: closed - Opened by Xu-pixel 14 days ago - 2 comments
Labels: feature

#2740 - FEAT: Support cogagent-9b

Pull Request - State: closed - Opened by amumu96 15 days ago
Labels: feature

#2739 - xprobe/xinference:v1.1.1-cpu中无法使用fishspeech1.5

Issue - State: closed - Opened by zhudemiao 16 days ago - 6 comments
Labels: stale

#2738 - ENH: Improve error message

Pull Request - State: closed - Opened by codingl2k1 17 days ago - 2 comments
Labels: enhancement

#2737 - 添加对deepseek v3的支持

Issue - State: closed - Opened by jqhr 17 days ago - 2 comments
Labels: duplicate, feature

#2736 - 添加对deepseek v3的支持

Issue - State: open - Opened by jqhr 17 days ago - 6 comments
Labels: feature

#2735 - 增加跨域的配置

Issue - State: open - Opened by closer-finger 17 days ago - 2 comments
Labels: good first issue, pr welcome

#2734 - FEAT: support cline for vllm engine

Pull Request - State: closed - Opened by hwzhuhao 18 days ago - 1 comment
Labels: feature

#2733 - Feat: Support cogagent-9b

Pull Request - State: closed - Opened by amumu96 18 days ago
Labels: feature

#2732 - FEAT: Xavier: Share KV cache between VLLM replicas

Pull Request - State: closed - Opened by ChengjieLi28 18 days ago - 5 comments
Labels: feature

#2731 - 增加Fishspeech等TTS模型的音色配置功能

Issue - State: closed - Opened by lywy233 19 days ago - 4 comments
Labels: feature, stale

#2730 - xinference didn't support qwen2-vl-72B?

Issue - State: closed - Opened by cqray1990 19 days ago - 15 comments
Labels: stale

#2729 - 请问咱们平台支持中国移动的九天模型吗？

Issue - State: open - Opened by jieguolove 19 days ago - 1 comment
Labels: feature

#2729 - 请问咱们平台支持中国移动的九天模型吗？

Issue - State: open - Opened by jieguolove 19 days ago - 2 comments
Labels: feature, stale, pr welcome

#2728 - Internal error for batch inference: probability tensor contains either `inf`, `nan` or element < 0.

Issue - State: closed - Opened by lukuanwang-delta 21 days ago - 4 comments
Labels: gpu, stale

#2727 - FEAT: support hunyuan-dit text2image

Pull Request - State: closed - Opened by qinxuye 21 days ago
Labels: feature

#2726 - 在win11中启动Qwen2.5-Coder-14B-Instruct报错

Issue - State: closed - Opened by zhangwonderful 21 days ago - 3 comments
Labels: gpu, stale

#2725 - cannot import name 'shard_checkpoint' from 'transformers.modeling_utils' (/root/miniconda3/envs/xinference/lib/python3.11/site-packages/transformers/modeling_utils.py)

Issue - State: closed - Opened by c935289832 21 days ago - 10 comments
Labels: gpu, stale

#2724 - BUG: adapt mlx-vlm v0.1.7

Pull Request - State: closed - Opened by qinxuye 22 days ago
Labels: bug

#2723 - 启动xinference平台报错，可能是异步超时

Issue - State: closed - Opened by feifei05 22 days ago - 9 comments
Labels: gpu, stale

#2722 - litellm.exceptions.BadRequestError: litellm.BadRequestError: XinferenceException - Error code: 400 - {'detail': 'Invalid input. Please specify the prompt.'}在crewai上运行xinference部署的qwen2.5模型报错

Issue - State: closed - Opened by DCJsenior 22 days ago - 3 comments
Labels: stale

#2721 - FEAT: support HunyuanVideo

Pull Request - State: closed - Opened by qinxuye 22 days ago
Labels: feature, gpu

#2720 - 调用restful api时报错

Issue - State: closed - Opened by zhangwonderful 23 days ago - 3 comments
Labels: gpu, stale

#2719 - qwen2-vl视频推理报错 Not support video input now

Issue - State: closed - Opened by 948024326 23 days ago - 4 comments
Labels: gpu, stale

#2718 - qwen2-vl的视觉模型支持sglang框架启动

Issue - State: closed - Opened by 948024326 23 days ago - 7 comments
Labels: feature, stale

#2717 - 报错: cannot import name 'build_regex_from_schema' from 'outlines.fsm.json_schema'

Issue - State: closed - Opened by 948024326 23 days ago - 2 comments
Labels: gpu

#2716 - sglang框架启动模型报错 no module :sql_kernel

Issue - State: closed - Opened by 948024326 23 days ago - 1 comment

#2715 - 需求 Xinference 能够在docker 的介绍里面将cuda12.4 修改为最小支持cuda12.2

Issue - State: closed - Opened by vss80p585 24 days ago - 2 comments
Labels: feature, gpu, stale

#2714 - 在dify 中使用 xinference flux-dev step超过20或并发超过1时，生成失败

Issue - State: closed - Opened by geekidentity 24 days ago - 2 comments
Labels: gpu, stale

#2713 - CHORE: Update new models in readme

Pull Request - State: closed - Opened by codingl2k1 25 days ago

#2712 - FEAT: Support QvQ-72B-Preview

Pull Request - State: closed - Opened by Jun-Howie 25 days ago
Labels: feature

#2711 - REF: Reduce code redundancy by setting default values

Pull Request - State: closed - Opened by pengjunfeng11 26 days ago
Labels: refactor

#2710 - 分布式节点部署，launch时指定卡有问题

Issue - State: open - Opened by syd1997 26 days ago - 12 comments
Labels: gpu, stale

#2709 - 界面部署的时候无法选择vllm部署

Issue - State: closed - Opened by ybsbbw 26 days ago - 1 comment
Labels: gpu

#2708 - 关于单卡多模型加载

Issue - State: closed - Opened by luckfu 26 days ago - 2 comments
Labels: gpu, stale

#2707 - embedding模型指定gpu启动报错

Issue - State: closed - Opened by jingzl 26 days ago - 3 comments
Labels: gpu, stale

#2706 - FEAT: support SD3.5 series model

Pull Request - State: closed - Opened by qinxuye 27 days ago
Labels: feature

#2705 - Gradio-Web界面推理和Python推理输出不一致

Issue - State: closed - Opened by m00nLi 27 days ago - 10 comments
Labels: gpu, stale

#2704 - windows机器，提示Cluster is not available after multiple attempts，用0.0.0.0启动

Issue - State: closed - Opened by smalldeer1982 27 days ago - 5 comments
Labels: stale

#2703 - phi-4大模型及其awq等量化系列集成建议

Issue - State: closed - Opened by moshilangzi 27 days ago - 2 comments
Labels: feature, stale, pr welcome

#2702 - 部署qwen-vl-chat 出错

Issue - State: closed - Opened by amzfc 27 days ago - 10 comments
Labels: gpu, stale

#2701 - 支持CosyVoice2.0-0.5B

Issue - State: closed - Opened by Wiziechen 28 days ago - 3 comments
Labels: feature

#2700 - FEAT: support scheduling-policy for vllm

Pull Request - State: closed - Opened by hwzhuhao 28 days ago - 5 comments
Labels: feature

#2699 - docker部署加上OAuth2后反复调整登录页面，或者模型启动的时候跳转登录页面

Issue - State: closed - Opened by rundreamsFly 28 days ago - 3 comments
Labels: gpu, stale

#2698 - 支持 gguf 量化的 FLUX.1

Issue - State: closed - Opened by geekidentity 28 days ago - 1 comment
Labels: feature, gpu

#2697 - FEAT: Support minicpm-4B on vllm

Pull Request - State: closed - Opened by Jun-Howie 29 days ago
Labels: feature

#2696 - qwen2-vl-7b vllm 设置默认显存

Issue - State: closed - Opened by GXKIM 29 days ago - 1 comment
Labels: gpu

#2695 - BUG: `glm4-chat` cannot apply for continuous batching with transformers backend

Pull Request - State: closed - Opened by ChengjieLi28 29 days ago
Labels: bug

#2694 - cpu服务器，通过curl调用接口推理，报"NoneType' object has no attribute 'size'

Issue - State: closed - Opened by lizhao-8202 29 days ago - 3 comments
Labels: gpu, stale

#2694 - cpu服务器，通过curl调用接口推理，报"NoneType' object has no attribute 'size'

Issue - State: closed - Opened by lizhao-8202 29 days ago - 3 comments
Labels: gpu, stale

#2693 - 后面的版本minicpm3-4b会支持vllm推理吗

Issue - State: closed - Opened by tu-160 30 days ago - 2 comments

#2692 - Xinference[mlx] fish audio Hydra error:

Issue - State: closed - Opened by twilwa about 1 month ago - 8 comments
Labels: stale

#2691 - 启动服务器下载好的本地音频模型报错

Issue - State: closed - Opened by moshenwu about 1 month ago - 11 comments
Labels: gpu, stale

#2690 - 使用自带的聊天页面或者接入dify之后输出很慢

Issue - State: closed - Opened by congge27 about 1 month ago - 4 comments
Labels: stale

#2689 - 无法在多GPU环境下在指定GPU上启动3个及以上数量的副本

Issue - State: closed - Opened by epic1219 about 1 month ago - 3 comments
Labels: gpu, stale

#2688 - sensevoice的timestamp功能

Issue - State: closed - Opened by leslie2046 about 1 month ago - 4 comments
Labels: feature, stale, pr welcome

#2687 - 启动报错

Issue - State: closed - Opened by feifei05 about 1 month ago - 4 comments
Labels: stale

#2686 - 期待集成DeepSeek-VL2

Issue - State: closed - Opened by moshilangzi about 1 month ago - 2 comments
Labels: feature, stale

#2685 - Xin cannot perceive whether the service is running normally.

Issue - State: closed - Opened by liuzhenghua about 1 month ago - 3 comments
Labels: gpu, stale

#2684 - ENH: Update cosyvoice 2

Pull Request - State: closed - Opened by codingl2k1 about 1 month ago - 1 comment
Labels: enhancement

#2683 - OpenGVLab/InternVL2_5-78B 架构变了，导致不能注册至xinf 希望能集成InternVL2_5

Issue - State: closed - Opened by Kevin-qwx about 1 month ago - 1 comment
Labels: duplicate, feature

#2682 - 报错Segmentation fault (core dumped)

Issue - State: closed - Opened by SKKKKYLAR about 1 month ago - 3 comments
Labels: gpu, stale

#2681 - FEAT: Support qwen2.5-coder-instruct model for tool calls

Pull Request - State: closed - Opened by Timmy-web about 1 month ago
Labels: feature

#2680 - BUG: Fix f5tts audio ref

Pull Request - State: closed - Opened by codingl2k1 about 1 month ago
Labels: bug

#2679 - GOT-OCR2.0模型运行出现问题

Issue - State: open - Opened by zjx140 about 1 month ago - 1 comment
Labels: gpu

#2679 - GOT-OCR2.0模型运行出现问题

Issue - State: closed - Opened by zjx140 about 1 month ago - 3 comments
Labels: gpu, stale

#2678 - ENH: resample f5-tts-mlx ref audio when sample rate not synching.

Pull Request - State: open - Opened by qinxuye about 1 month ago
Labels: enhancement

#2678 - ENH: resample f5-tts-mlx ref audio when sample rate not synching.

Pull Request - State: closed - Opened by qinxuye about 1 month ago
Labels: enhancement

GitHub / xorbitsai/inference issues and pull requests