Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / NVIDIA/TensorRT-LLM issues and pull requests
#222 - Build engine for a different architecture than the current device
Issue -
State: closed - Opened by yunfeng-scale about 1 year ago
- 9 comments
Labels: good first issue, question, triaged
#197 - failed to load 'tensorrt_llm' version 1: Invalid argument: unable to find 'tensorrtllm/model.py' for model 'tensorrt_llm', in /opt/tritonserver/backends/tensorrtllm
Issue -
State: closed - Opened by AatroxZZ about 1 year ago
- 11 comments
Labels: triaged
#174 - Support for DeBerta
Issue -
State: open - Opened by kamalkraj about 1 year ago
- 2 comments
Labels: triaged, feature request, new model
#157 - Support for Zephyr 7B model
Issue -
State: open - Opened by rishabh279 about 1 year ago
- 7 comments
Labels: triaged, feature request, new model
#124 - [FEA] Support Roberta model
Issue -
State: open - Opened by esnvidia about 1 year ago
- 2 comments
Labels: triaged, feature request, new model
#101 - Benchmarking errors out for GPTJ_6B model on single L4 GPU due to CUDA ERROR: 2
Issue -
State: closed - Opened by RajeshThallam about 1 year ago
- 2 comments
#100 - Unable to compile Tensorrt_llm image,error msg is : error adding symbols: file in wrong format
Issue -
State: closed - Opened by WangxuP about 1 year ago
- 1 comment
Labels: triaged
#99 - parallel on 4 RTX3090 for llama 7b fails
Issue -
State: closed - Opened by forrestjgq about 1 year ago
- 5 comments
Labels: triaged
#98 - Weight cannot be divisible by tensor_parallel_size
Issue -
State: closed - Opened by haojiwei about 1 year ago
- 8 comments
Labels: triaged
#97 - Compilation issue about weightOnlyBatchedGemvBs4Int8b.cu
Issue -
State: closed - Opened by isky-cd about 1 year ago
- 2 comments
Labels: triaged
#96 - How can we take advantage of in-flight batch with Python API to improve server throughput?
Issue -
State: closed - Opened by gesanqiu about 1 year ago
- 1 comment
#95 - Error build llama-2-7b-chat-hf:first input has type Half but second input has type Float
Issue -
State: closed - Opened by a1164714 about 1 year ago
- 7 comments
Labels: triaged
#94 - error: libtensorrt_llm_batch_manager_static.a: file format not recting as linker script
Issue -
State: closed - Opened by WangxuP about 1 year ago
- 2 comments
Labels: triaged
#93 - does the chatglm2-6B example support codegeex2-6b's building and running?
Issue -
State: closed - Opened by thendwk about 1 year ago
- 9 comments
Labels: bug, triaged
#92 - gmake: Makefile: No such file or directory
Issue -
State: closed - Opened by Kelsey2018 about 1 year ago
- 6 comments
Labels: triaged
#91 - Error when quantize llama-2-7b-chat-hf with format `int4_awq`
Issue -
State: closed - Opened by gesanqiu about 1 year ago
- 8 comments
Labels: triaged
#90 - stop words can not work. how to set it
Issue -
State: closed - Opened by yoyopdc about 1 year ago
- 25 comments
Labels: triaged
#89 - Failed to launch Triton for Llama
Issue -
State: closed - Opened by sleepwalker2017 about 1 year ago
- 18 comments
Labels: triaged
#88 - set max_input_len, max_output_len=4096, But The actual input cannot reach this level
Issue -
State: closed - Opened by callmezhangchenchenokay about 1 year ago
- 2 comments
Labels: bug, triaged
#87 - In addition to these models in /examples folder, are other models supported? Qwen-7B or Qwen-14B
Issue -
State: closed - Opened by ThinkPadRiver about 1 year ago
- 2 comments
#86 - [Feature Request] Support InternLM Model
Issue -
State: closed - Opened by vansin about 1 year ago
- 3 comments
#85 - Failed to Run batch size larger than 8 with LLaMA 13B
Issue -
State: closed - Opened by CarsonSo about 1 year ago
- 12 comments
Labels: duplicate, triaged
#84 - Building chatglm2-6b on single node multi gpus failed.
Issue -
State: closed - Opened by shaunxiong about 1 year ago
- 1 comment
Labels: triaged
#83 - cuda driver version
Issue -
State: closed - Opened by Kelsey2018 about 1 year ago
- 6 comments
Labels: bug, triaged
#82 - errors arised when build docker image
Issue -
State: closed - Opened by shanekong about 1 year ago
- 8 comments
#81 - Chinese, Korean, and other Asian languages not working
Issue -
State: closed - Opened by michaelroyzen about 1 year ago
- 6 comments
Labels: triaged
#80 - How to output intermediate result of model?
Issue -
State: closed - Opened by yukavio about 1 year ago
- 2 comments
Labels: good first issue, triaged
#79 - Complex beam search
Issue -
State: closed - Opened by haramjo about 1 year ago
- 6 comments
Labels: triaged
#78 - Donglu branch
Pull Request -
State: closed - Opened by dongluw about 1 year ago
#77 - segmentation fault : llama 2 with num_beams > 1
Issue -
State: closed - Opened by JosephChenHub about 1 year ago
- 5 comments
Labels: triaged
#76 - Add Python bindings to `GptManager`
Pull Request -
State: open - Opened by linden-li about 1 year ago
#75 - Performance decay when using paged attention
Issue -
State: closed - Opened by sleepwalker2017 about 1 year ago
- 9 comments
Labels: bug, triaged
#74 - Will the batch_manager be open-sourced in the future?
Issue -
State: closed - Opened by chenzhengda about 1 year ago
- 1 comment
#73 - How to benchmark offline throughput?
Issue -
State: open - Opened by zhaoyang-star about 1 year ago
- 6 comments
Labels: question, triaged
#72 - Build Failure: libtensorrt_llm_batch_manager_static.a:1: syntax error
Issue -
State: closed - Opened by CarsonSo about 1 year ago
- 5 comments
Labels: triaged
#71 - How to build the docker image
Issue -
State: closed - Opened by LeoCeasar about 1 year ago
- 6 comments
Labels: triaged
#70 - How to build Tensorrt-LLM without docker?
Issue -
State: closed - Opened by beginlner about 1 year ago
- 1 comment
Labels: question, triaged
#69 - CMake Error at CMakeLists.txt:288 (file): file STRINGS file "/usr/local/tensorrt/include/NvInferVersion.h" cannot be read.
Issue -
State: closed - Opened by tlogn about 1 year ago
- 4 comments
Labels: triaged
#68 - Llama 2 with LoRA
Issue -
State: closed - Opened by JosephChenHub about 1 year ago
- 11 comments
Labels: triaged, feature request
#67 - I'm benchmarking llama-7b with 8 batch size in A40,but oom happened,I'm curious why 7B model need to cost too much memory?
Issue -
State: closed - Opened by qihang720 about 1 year ago
- 8 comments
Labels: triaged
#66 - gptManagerBenchmark std::bad_alloc error
Issue -
State: closed - Opened by clockfly about 1 year ago
- 19 comments
Labels: triaged
#65 - Difference between max_batch_size in the engine builder and max_num_sequences in TrtGptModelOptionalParams?
Issue -
State: closed - Opened by michaelroyzen about 1 year ago
- 6 comments
Labels: question
#64 - build failure: libnvparsers not found in tensorRT 9.1.0.4
Issue -
State: closed - Opened by forrestjgq about 1 year ago
- 8 comments
Labels: triaged
#63 - Improved Readability of Readme.md File
Pull Request -
State: closed - Opened by Sanyam-2026 about 1 year ago
- 1 comment
Labels: documentation
#62 - Nvidia Jetson device Support
Issue -
State: open - Opened by shahizat about 1 year ago
- 31 comments
Labels: triaged, feature request
#61 - Question:Why cannot the topk exceed 1024?
Issue -
State: closed - Opened by SuperCB about 1 year ago
- 1 comment
Labels: question, triaged
#60 - Update windows related documentation to main branch
Pull Request -
State: closed - Opened by juney-nvidia about 1 year ago
#59 - Update windows related documentation
Pull Request -
State: closed - Opened by juney-nvidia about 1 year ago
#58 - Perceived
Pull Request -
State: closed - Opened by RichardScottOZ about 1 year ago
- 1 comment
Labels: documentation
#57 - Seeking Clarification on TensorRT-LLM Workflow for Extending Model Support
Issue -
State: closed - Opened by robosina about 1 year ago
- 1 comment
#56 - Fix typo in batchScheduler.h
Pull Request -
State: closed - Opened by eltociear about 1 year ago
- 1 comment
Labels: documentation
#54 - starcoder engine building failed
Issue -
State: closed - Opened by Missmiaom about 1 year ago
- 2 comments
#53 - wrong output in GPT2 example
Issue -
State: closed - Opened by r3sist-uniq about 1 year ago
- 6 comments
Labels: triaged
#52 - Docker image
Issue -
State: closed - Opened by SehajDxstiny about 1 year ago
- 9 comments
Labels: triaged
#51 - Handling kv-cache in multi-modal GPT
Issue -
State: closed - Opened by Selimonder about 1 year ago
- 7 comments
Labels: triaged
#50 - x86_64-conda_cos6-linux-gnu-cc: command not found
Issue -
State: open - Opened by JiushengChen about 1 year ago
- 2 comments
Labels: triaged
#49 - Mistral 7B support
Issue -
State: closed - Opened by casper-hansen about 1 year ago
- 7 comments
Labels: triaged, feature request
#48 - Error on docker file build
Issue -
State: closed - Opened by robosina about 1 year ago
- 2 comments
#47 - Any support for RWKV plz?
Issue -
State: open - Opened by Pevernow about 1 year ago
- 17 comments
Labels: triaged, feature request, new model
#46 - fix Forward Compatibility mode is UNAVAILABLE error
Pull Request -
State: closed - Opened by BasicCoder about 1 year ago
- 2 comments
#45 - Failed to build with CUDA 11.8 due to change in cudaGraphExecUpdate parameter
Issue -
State: closed - Opened by haojiwei about 1 year ago
- 7 comments
Labels: triaged
#44 - Failed to run benchmark llama-7b.
Issue -
State: closed - Opened by matichon-vultureprime about 1 year ago
- 2 comments
#43 - Methods to evaluate Throughput (tokens/s)
Issue -
State: closed - Opened by tiandiao123 about 1 year ago
- 1 comment
#42 - Failed to run batch inference
Issue -
State: closed - Opened by sleepwalker2017 about 1 year ago
- 13 comments
Labels: triaged
#41 - TensorRT-LLM Releases
Issue -
State: closed - Opened by jdemouth-nvidia about 1 year ago
Labels: documentation
#40 - Update docs/source/batch_manager.md
Pull Request -
State: closed - Opened by kaiyux about 1 year ago
#39 - run tritonserver failed for chatglm2
Issue -
State: closed - Opened by Lzhang-hub about 1 year ago
- 24 comments
Labels: triaged
#38 - Cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/21 to main branch
Pull Request -
State: closed - Opened by kaiyux about 1 year ago
#37 - Baichuan V2 13B,INT4 weight only, 输出可能有问题
Issue -
State: closed - Opened by callmezhangchenchenokay about 1 year ago
- 20 comments
Labels: bug, triaged
#36 - Building in TensorRT docker container
Issue -
State: closed - Opened by makaveli10 about 1 year ago
- 9 comments
Labels: triaged
#35 - TensorRT 9 not available
Issue -
State: closed - Opened by david-PHR about 1 year ago
- 5 comments
Labels: triaged
#34 - Attribute issue while executing build.py file
Issue -
State: closed - Opened by atharvnagrikar about 1 year ago
- 18 comments
Labels: triaged
#33 - Will it be integrated into tritonserver-inference?
Issue -
State: closed - Opened by coderchem about 1 year ago
- 4 comments
Labels: triaged
#32 - Build failures
Issue -
State: closed - Opened by divchenko about 1 year ago
- 17 comments
Labels: bug, triaged
#31 - Does the project team plan to support MiniGPT4?
Issue -
State: closed - Opened by xiexiaoshinick about 1 year ago
- 1 comment
#30 - Fix link jump in windows readme.md
Pull Request -
State: closed - Opened by yuanlehome about 1 year ago
- 3 comments
#29 - Tactic running out of memory during Code Llama 34B build
Issue -
State: closed - Opened by michaelroyzen about 1 year ago
- 18 comments
Labels: triaged
#28 - a error when build tensorrt_llm
Issue -
State: closed - Opened by nanmi about 1 year ago
- 3 comments
Labels: triaged
#27 - can i use this for GPT-2?
Issue -
State: closed - Opened by SehajDxstiny about 1 year ago
- 1 comment
Labels: triaged
#26 - Will this repo support A10 in the future?
Issue -
State: closed - Opened by Fangzhou-Ai about 1 year ago
- 20 comments
Labels: bug, triaged
#25 - Support Medusa Sampling
Issue -
State: closed - Opened by forpanyang about 1 year ago
- 10 comments
Labels: Community want to contribute
#24 - Speed compare with vllm?
Issue -
State: closed - Opened by lucasjinreal about 1 year ago
- 6 comments
Labels: triaged
#23 - ERROR: This container was built for NVIDIA Driver Release 535.86 or later, compatibility mode is UNAVAILABLE.
Issue -
State: closed - Opened by sleepwalker2017 about 1 year ago
- 3 comments
Labels: triaged
#22 - Can we install TensorRT-LLM without docker?
Issue -
State: closed - Opened by ifromeast about 1 year ago
- 24 comments
Labels: triaged
#21 - Fix two deadlinks in README.md
Pull Request -
State: closed - Opened by wangkuiyi about 1 year ago
#20 - Main doc fix
Pull Request -
State: closed - Opened by juney-nvidia about 1 year ago
#19 - Fix small doc issue
Pull Request -
State: closed - Opened by juney-nvidia about 1 year ago
#18 - Include a working Windows wheel link..
Issue -
State: closed - Opened by oscarbg about 1 year ago
- 5 comments
Labels: triaged
#17 - Where is download for Tensorrt 9.1.0.4 for Windows?
Issue -
State: closed - Opened by oscarbg about 1 year ago
- 1 comment
#16 - Huggingface Transformers version should be bumped
Issue -
State: closed - Opened by michaelroyzen about 1 year ago
- 2 comments
Labels: bug, triaged
#15 - Fix the link to the documentation
Pull Request -
State: closed - Opened by jdemouth about 1 year ago
#14 - revise the homepage (release)
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#13 - revise the homepage (main)
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
- 1 comment
#12 - add git-lfs dependency for binaries (main branch)
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#11 - add git-lfs dependency for binaries (release)
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#10 - update aarch64 batch manager libraries to release/0.5.0
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#9 - update aarch64 batch manager libraries to main
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#8 - Fix memory leak in falcon weight loader
Pull Request -
State: closed - Opened by kaiyux about 1 year ago
#7 - update aarch64 libraries to release/0.5.0 branch
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago
#6 - update aarch64 libraries to main branch
Pull Request -
State: closed - Opened by Shixiaowei02 about 1 year ago