Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / tlntin/qwen-tensorrt-llm issues and pull requests
#103 - web_demo无法显示模型响应
Issue -
State: closed - Opened by elegant-bot 8 months ago
- 2 comments
#103 - web_demo无法显示模型响应
Issue -
State: closed - Opened by elegant-bot 8 months ago
- 2 comments
#102 - Adjust the semantics of max_output_len to be consistent with TRT-LLM
Pull Request -
State: closed - Opened by BasicCoder 8 months ago
- 1 comment
#102 - Adjust the semantics of max_output_len to be consistent with TRT-LLM
Pull Request -
State: closed - Opened by BasicCoder 8 months ago
- 1 comment
#101 - triton 部署, 生成乱码
Issue -
State: closed - Opened by maozixi1 8 months ago
- 18 comments
#101 - triton 部署, 生成乱码
Issue -
State: closed - Opened by maozixi1 8 months ago
- 18 comments
#100 - qwen_14b_chat build error
Issue -
State: closed - Opened by dangerous-xu 8 months ago
- 3 comments
#100 - qwen_14b_chat build error
Issue -
State: closed - Opened by dangerous-xu 8 months ago
- 3 comments
#99 - Fix bug
Pull Request -
State: closed - Opened by zhaohb 8 months ago
#99 - Fix bug
Pull Request -
State: closed - Opened by zhaohb 8 months ago
#98 - Qwen 2 build.py multi gpu with 2 different GPU's issue
Issue -
State: open - Opened by teis-e 8 months ago
- 17 comments
#98 - Qwen 2 build.py multi gpu with 2 different GPU's issue
Issue -
State: open - Opened by teis-e 8 months ago
- 17 comments
#97 - update verstion to 0.8.0
Pull Request -
State: closed - Opened by Tlntin 8 months ago
#97 - update verstion to 0.8.0
Pull Request -
State: closed - Opened by Tlntin 8 months ago
#96 - 编译tritonserver 镜像 失败
Issue -
State: closed - Opened by maozixi1 8 months ago
- 4 comments
#96 - 编译tritonserver 镜像 失败
Issue -
State: closed - Opened by maozixi1 8 months ago
- 4 comments
#95 - Triton 和 Langchain部署问题
Issue -
State: closed - Opened by plt12138 8 months ago
- 6 comments
#95 - Triton 和 Langchain部署问题
Issue -
State: closed - Opened by plt12138 8 months ago
- 6 comments
#94 - how to build Qwen-72B-Chat-Int4 with tp=2
Issue -
State: closed - Opened by liyunhan 8 months ago
- 27 comments
#94 - how to build Qwen-72B-Chat-Int4 with tp=2
Issue -
State: closed - Opened by liyunhan 8 months ago
- 27 comments
#93 - 运行run.py报错,Segmentation fault (core dumped)
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 8 comments
#93 - 运行run.py报错,Segmentation fault (core dumped)
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 8 comments
#92 - ModuleNotFoundError: No module named 'transformers.models.qwen2'
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 2 comments
#92 - ModuleNotFoundError: No module named 'transformers.models.qwen2'
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 2 comments
#91 - triton同步异步接口询问
Issue -
State: closed - Opened by dongteng 8 months ago
- 15 comments
#91 - triton同步异步接口询问
Issue -
State: closed - Opened by dongteng 8 months ago
- 15 comments
#90 - 请问目前的Qwen-VL实现方式,是否仅支持输入单张图片,且图片必须在输入的开头?
Issue -
State: closed - Opened by xikaluo 8 months ago
- 5 comments
#90 - 请问目前的Qwen-VL实现方式,是否仅支持输入单张图片,且图片必须在输入的开头?
Issue -
State: closed - Opened by xikaluo 8 months ago
- 5 comments
#88 - 请问如何支持正常的batch infer ?
Issue -
State: closed - Opened by zhangyu68 8 months ago
- 2 comments
#88 - 请问如何支持正常的batch infer ?
Issue -
State: closed - Opened by zhangyu68 8 months ago
- 2 comments
#87 - 请问为什么smoothquant量化后显存占用不降低呢
Issue -
State: closed - Opened by tp-nan 8 months ago
- 6 comments
#87 - 请问为什么smoothquant量化后显存占用不降低呢
Issue -
State: closed - Opened by tp-nan 8 months ago
- 6 comments
#86 - 运行build文件报错: TypeError: RowLinear.__init__() got an unexpected keyword argument 'instance_id'
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 2 comments
#86 - 运行build文件报错: TypeError: RowLinear.__init__() got an unexpected keyword argument 'instance_id'
Issue -
State: closed - Opened by ArlanCooper 8 months ago
- 2 comments
#85 - 有人能共享Build好的qwen或qwen1.5 int4的trt_engine(4gpu)文件吗?
Issue -
State: closed - Opened by zhangjiekui 8 months ago
- 11 comments
#85 - 有人能共享Build好的qwen或qwen1.5 int4的trt_engine(4gpu)文件吗?
Issue -
State: closed - Opened by zhangjiekui 8 months ago
- 11 comments
#84 - 想问一下,为什么72B模型是实验性的呢?架构应该是一样的呀,原因是什么呢?谢谢
Issue -
State: closed - Opened by zhangjiekui 8 months ago
- 2 comments
#84 - 想问一下,为什么72B模型是实验性的呢?架构应该是一样的呀,原因是什么呢?谢谢
Issue -
State: closed - Opened by zhangjiekui 8 months ago
- 2 comments
#83 - 使用auto-gptq编译qwen_1_8B-Chat-int4官方报错'KeyError: 'transformer.h.0.attn.c_attn.qweight'
Issue -
State: closed - Opened by fmozer 8 months ago
- 5 comments
#83 - 使用auto-gptq编译qwen_1_8B-Chat-int4官方报错'KeyError: 'transformer.h.0.attn.c_attn.qweight'
Issue -
State: closed - Opened by fmozer 8 months ago
- 5 comments
#82 - Qwen-72B-Chat-Int4 killed
Issue -
State: closed - Opened by Hukongtao 8 months ago
- 2 comments
#82 - Qwen-72B-Chat-Int4 killed
Issue -
State: closed - Opened by Hukongtao 8 months ago
- 2 comments
#81 - 测试hf吞吐OOM以及triton并发、流式输出问题
Issue -
State: closed - Opened by dongteng 8 months ago
- 23 comments
Labels: bug
#81 - 测试hf吞吐OOM以及triton并发、流式输出问题
Issue -
State: closed - Opened by dongteng 8 months ago
- 23 comments
Labels: bug
#80 - Qwen2 编译错误
Issue -
State: closed - Opened by mogoxx 8 months ago
- 5 comments
#80 - Qwen2 编译错误
Issue -
State: closed - Opened by mogoxx 8 months ago
- 5 comments
#79 - TensorRT_LLM 0.7.0 编译 Qwen-7B-Chat 模型,编译后启动API似乎无法支持并发访问API?
Issue -
State: closed - Opened by CedricHwong 9 months ago
- 2 comments
#79 - TensorRT_LLM 0.7.0 编译 Qwen-7B-Chat 模型,编译后启动API似乎无法支持并发访问API?
Issue -
State: closed - Opened by CedricHwong 9 months ago
- 2 comments
#78 - Qwen1.5 GPTQ编译错误
Issue -
State: closed - Opened by compass-star 9 months ago
- 1 comment
#78 - Qwen1.5 GPTQ编译错误
Issue -
State: closed - Opened by compass-star 9 months ago
- 1 comment
#77 - Qwen1.5 GPTQ-Int4 编译失败
Issue -
State: closed - Opened by ljhssga 9 months ago
- 15 comments
#77 - Qwen1.5 GPTQ-Int4 编译失败
Issue -
State: closed - Opened by ljhssga 9 months ago
- 15 comments
#76 - Qwen1.5 GPTQ用不了
Issue -
State: closed - Opened by Pevernow 9 months ago
- 2 comments
#76 - Qwen1.5 GPTQ用不了
Issue -
State: closed - Opened by Pevernow 9 months ago
- 2 comments
#75 - swift微调的qwen-vl支持吗
Issue -
State: closed - Opened by xs818818 9 months ago
- 1 comment
#75 - swift微调的qwen-vl支持吗
Issue -
State: closed - Opened by xs818818 9 months ago
- 1 comment
#74 - 函数调用会报错
Issue -
State: closed - Opened by xzmagic 9 months ago
#74 - 函数调用会报错
Issue -
State: closed - Opened by xzmagic 9 months ago
#73 - 大佬请问个问题:AttributeError: 'QWenForCausalLM' object has no attribute 'embedding'
Issue -
State: closed - Opened by dongteng 10 months ago
#73 - 大佬请问个问题:AttributeError: 'QWenForCausalLM' object has no attribute 'embedding'
Issue -
State: closed - Opened by dongteng 10 months ago
#72 - 大佬有没有对比和VLLM的推理效果?
Issue -
State: open - Opened by white-wolf-tech 10 months ago
- 2 comments
#72 - 大佬有没有对比和VLLM的推理效果?
Issue -
State: open - Opened by white-wolf-tech 10 months ago
- 2 comments
#71 - ERROR: Failed to create instance: unexpected error when creating modelInstanceState
Issue -
State: closed - Opened by lyc728 10 months ago
- 3 comments
#71 - ERROR: Failed to create instance: unexpected error when creating modelInstanceState
Issue -
State: closed - Opened by lyc728 10 months ago
- 3 comments
#70 - web demo error
Issue -
State: closed - Opened by HappyKerry 10 months ago
- 1 comment
#70 - web demo error
Issue -
State: closed - Opened by HappyKerry 10 months ago
- 1 comment
#69 - TensorRT的采样可以和QWen官方generation_config.json里面提供的采样参数对齐吗?
Issue -
State: open - Opened by Hukongtao 10 months ago
- 4 comments
#69 - TensorRT的采样可以和QWen官方generation_config.json里面提供的采样参数对齐吗?
Issue -
State: open - Opened by Hukongtao 10 months ago
- 4 comments
#68 - Qwen-14B-Chat-Int4运行后预测结果不对
Issue -
State: closed - Opened by takemars 10 months ago
- 4 comments
Labels: bug
#68 - Qwen-14B-Chat-Int4运行后预测结果不对
Issue -
State: closed - Opened by takemars 10 months ago
- 4 comments
Labels: bug
#67 - Qwen-VL build.py: error: unrecognized arguments: --use_rmsnorm_plugin --use_lookup_plugin float16 --max_prompt_embedding_table_size 2048
Issue -
State: closed - Opened by 77h2l 10 months ago
- 1 comment
#67 - Qwen-VL build.py: error: unrecognized arguments: --use_rmsnorm_plugin --use_lookup_plugin float16 --max_prompt_embedding_table_size 2048
Issue -
State: closed - Opened by 77h2l 10 months ago
- 1 comment
#66 - inflight_batching
Issue -
State: closed - Opened by lyc728 10 months ago
- 24 comments
#66 - inflight_batching
Issue -
State: closed - Opened by lyc728 10 months ago
- 24 comments
#65 - TypeError: missing a required argument: 'host_sink_token_length'
Issue -
State: closed - Opened by Hukongtao 10 months ago
- 2 comments
#65 - TypeError: missing a required argument: 'host_sink_token_length'
Issue -
State: closed - Opened by Hukongtao 10 months ago
- 2 comments
#64 - qwen-14b int4-awq 量化失败
Issue -
State: closed - Opened by zhisunyy 10 months ago
- 7 comments
#64 - qwen-14b int4-awq 量化失败
Issue -
State: closed - Opened by zhisunyy 10 months ago
- 7 comments
#63 - triron部署成功后,每个卡上多出来几个进程
Issue -
State: closed - Opened by white-wolf-tech 10 months ago
- 12 comments
#63 - triron部署成功后,每个卡上多出来几个进程
Issue -
State: closed - Opened by white-wolf-tech 10 months ago
- 12 comments
#62 - 使用triton + inflight_batching 后吞吐反而降了
Issue -
State: closed - Opened by zhisunyy 10 months ago
- 2 comments
#62 - 使用triton + inflight_batching 后吞吐反而降了
Issue -
State: closed - Opened by zhisunyy 10 months ago
- 2 comments
#61 - 推理加速效果怎么样?
Issue -
State: closed - Opened by yanguowei316 11 months ago
- 1 comment
#61 - 推理加速效果怎么样?
Issue -
State: closed - Opened by yanguowei316 11 months ago
- 1 comment
#60 - Triton部署TensorRT-LLM报错
Issue -
State: closed - Opened by zhisunyy 11 months ago
- 9 comments
#60 - Triton部署TensorRT-LLM报错
Issue -
State: closed - Opened by zhisunyy 11 months ago
- 9 comments
#59 - 请问是否有尝试过在mpirun -n 大于1的情况下提供http服务?
Issue -
State: closed - Opened by xikaluo 11 months ago
- 8 comments
Labels: enhancement
#59 - 请问是否有尝试过在mpirun -n 大于1的情况下提供http服务?
Issue -
State: closed - Opened by xikaluo 11 months ago
- 8 comments
Labels: enhancement
#58 - Qwen-14B INT4-AWQ 用tp=2时量化失败
Issue -
State: closed - Opened by comeby 11 months ago
- 1 comment
#58 - Qwen-14B INT4-AWQ 用tp=2时量化失败
Issue -
State: closed - Opened by comeby 11 months ago
- 1 comment
#57 - 使用官方的Qwen-xxB-Chat-Int4转TRT,都用greedy sereach,TRT和torch的结果不一致正常吗
Issue -
State: open - Opened by byjswr 11 months ago
- 9 comments
#57 - 使用官方的Qwen-xxB-Chat-Int4转TRT,都用greedy sereach,TRT和torch的结果不一致正常吗
Issue -
State: open - Opened by byjswr 11 months ago
- 9 comments
#56 - summarize.py运行解答
Issue -
State: closed - Opened by lyc728 11 months ago
- 1 comment
#56 - summarize.py运行解答
Issue -
State: closed - Opened by lyc728 11 months ago
- 1 comment
#55 - Qwen-14B-chat 多batch 报错
Issue -
State: closed - Opened by zhisunyy 11 months ago
- 3 comments
#55 - Qwen-14B-chat 多batch 报错
Issue -
State: closed - Opened by zhisunyy 11 months ago
- 3 comments
#54 - 使用autodl编译tensorrt-llm有问题
Issue -
State: closed - Opened by oreo-lp 11 months ago
- 6 comments
#53 - Use official int4 weights, e.g. Qwen-1_8B-Chat-Int4 model(recommended) - Build TRT-LLM engine
Issue -
State: closed - Opened by byjswr 11 months ago
- 6 comments
#52 - 想使用baichuan2部署api的话该修改什么地方适配百川模型呢?
Issue -
State: closed - Opened by secain 11 months ago
- 3 comments
#51 - Triton的显存占用是TensorRT—llm的两倍
Issue -
State: open - Opened by lyc728 11 months ago
- 20 comments