Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / microsoft/VPTQ issues and pull requests
#171 - Question about Inference Speed
Issue -
State: closed - Opened by Flying-Cloud 6 days ago
- 4 comments
#170 - update algorithm with norm support
Pull Request -
State: closed - Opened by YangWang92 12 days ago
#169 - refactor (csrc): Restructure C++ code organization to facilitate adding new kernels
Pull Request -
State: closed - Opened by lcy-seso 13 days ago
#168 - Support different gpu for cuml
Pull Request -
State: closed - Opened by wejoncy 13 days ago
#167 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 15 days ago
#166 - fix(csrc): Remove strong dependency on specific Torch version.
Pull Request -
State: closed - Opened by lcy-seso 15 days ago
#165 - bump to 0.0.5
Pull Request -
State: closed - Opened by wejoncy 17 days ago
#164 - fix(cmake): building dynamic library for specified GPU architectures and support multi threads compile
Pull Request -
State: closed - Opened by lcy-seso 19 days ago
- 3 comments
#163 - Bugs: `setup.py` fails to correctly distribute built library
Issue -
State: closed - Opened by lcy-seso 20 days ago
Labels: bug
#162 - fix(build): fix the undefined symbols runtime error.
Pull Request -
State: closed - Opened by lcy-seso 20 days ago
#161 - Update README.md
Pull Request -
State: closed - Opened by wejoncy 20 days ago
#160 - quick fix
Pull Request -
State: closed - Opened by wejoncy 20 days ago
#159 - add phi-4 support
Pull Request -
State: closed - Opened by YangWang92 20 days ago
#158 - add tools
Pull Request -
State: closed - Opened by wejoncy 20 days ago
#157 - fix perm
Pull Request -
State: closed - Opened by wejoncy 20 days ago
#156 - add perm absorb
Pull Request -
State: closed - Opened by YangWang92 23 days ago
#155 - absorb perm
Pull Request -
State: closed - Opened by wejoncy 23 days ago
#154 - fix loading
Pull Request -
State: closed - Opened by wejoncy 24 days ago
#153 - update version
Pull Request -
State: closed - Opened by YangWang92 about 1 month ago
#152 - fix: Fix the bug where the Torch library is not correctly linked.
Pull Request -
State: closed - Opened by lcy-seso about 1 month ago
#151 - update setuptools version
Pull Request -
State: closed - Opened by YangWang92 about 1 month ago
#150 - fix a small bug in pack.py. (#148)
Pull Request -
State: closed - Opened by ForAxel about 1 month ago
#149 - fix(build): build using cmake.
Pull Request -
State: closed - Opened by lcy-seso about 1 month ago
- 2 comments
#148 - Potential BUG in pack.py?
Issue -
State: closed - Opened by ForAxel about 1 month ago
- 3 comments
#145 - π§ Refactor and optimize python code implementations.
Pull Request -
State: open - Opened by lcy-seso about 1 month ago
#144 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 about 1 month ago
#143 - update huggingface transformers support
Pull Request -
State: closed - Opened by YangWang92 about 1 month ago
#142 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin about 1 month ago
#141 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin about 2 months ago
#140 - [WIP] Fix: adaptation to make it work
Pull Request -
State: closed - Opened by JulienBalianSonos about 2 months ago
#139 - Adds nvembed model to quantization algorithm
Pull Request -
State: closed - Opened by bndos about 2 months ago
- 4 comments
#138 - Update vptq_example.ipynb
Pull Request -
State: closed - Opened by YangWang92 about 2 months ago
#137 - Colab demo: AttributeError: 'Qwen2Config' object has no attribute 'quant_config'
Issue -
State: closed - Opened by ch1y0q about 2 months ago
- 2 comments
Labels: bug
#136 - Question about fine-tuning
Issue -
State: open - Opened by kimwin2 2 months ago
- 4 comments
Labels: enhancement, question, alogrithm
#135 - ops.gemm
Issue -
State: closed - Opened by xzjwillbethin 2 months ago
- 1 comment
Labels: question
#134 - Detailed code of the implementation of ops.gemm && ops.dequant
Issue -
State: closed - Opened by xzjwillbethin 2 months ago
- 1 comment
#133 - Question about result reproduction
Issue -
State: closed - Opened by ShawnzzWu 2 months ago
- 1 comment
Labels: question
#132 - where are the layer-wise fine-tune codes in algorithm branch?
Issue -
State: closed - Opened by Huangdequ 2 months ago
- 1 comment
Labels: question
#131 - Question about the design motivation behind VPTQ
Issue -
State: closed - Opened by KoalaYuFeng 2 months ago
- 2 comments
Labels: question
#130 - tow stage code
Issue -
State: closed - Opened by xzjwillbethin 2 months ago
- 2 comments
Labels: question
#129 - Update README.md
Pull Request -
State: closed - Opened by wejoncy 2 months ago
#128 - How to dequantize a model with 4 groups and centroids greater than 4096?
Issue -
State: open - Opened by ShawnzzWu 2 months ago
- 6 comments
Labels: bug, question
#127 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 2 months ago
#126 - How to Generate a 2-bit Quantized Meta-Llama-3.1-8B-Instruct Model?
Issue -
State: open - Opened by ForAxel 2 months ago
- 7 comments
Labels: question
#125 - Sometimes models load very slowly
Issue -
State: closed - Opened by Jotakak-yu 2 months ago
- 9 comments
Labels: question
#124 - Update model_base.py
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#123 - fix setup version number
Pull Request -
State: closed - Opened by wejoncy 3 months ago
#122 - fix version__
Pull Request -
State: closed - Opened by wejoncy 3 months ago
#121 - Bump to 0.0.4
Pull Request -
State: closed - Opened by wejoncy 3 months ago
#120 - fix config format for transformers
Pull Request -
State: closed - Opened by wejoncy 3 months ago
#119 - When I use the parameter npercent=1 to quantize the model, I have the following problemοΌ
Issue -
State: closed - Opened by half-lang 3 months ago
- 2 comments
Labels: question
#118 - Enhance the implementation of the CUDA inference kernel.
Issue -
State: open - Opened by haruhi55 3 months ago
- 2 comments
Labels: features
#117 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#116 - VLM Support
Issue -
State: closed - Opened by YangWang92 3 months ago
- 1 comment
Labels: new models
#115 - Huggingface Transformer Support
Issue -
State: open - Opened by YangWang92 3 months ago
- 4 comments
Labels: inference
#114 - CPU support
Issue -
State: open - Opened by YangWang92 3 months ago
Labels: inference
#113 - Custom Model support
Issue -
State: open - Opened by huangtingwei9988 3 months ago
- 3 comments
Labels: question, new models
#112 - Add CUDA_HOME instructions to README
Pull Request -
State: closed - Opened by caronzh03 3 months ago
#111 - Docker image for development
Issue -
State: closed - Opened by caronzh03 3 months ago
- 6 comments
Labels: question
#110 - update algorithm
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#109 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#108 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#107 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#106 - update pyproject
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#105 - update algorithm
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#104 - add package info
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#103 - Update vqlinear.py
Pull Request -
State: closed - Opened by laomao0 3 months ago
#102 - Add evaluation codes
Issue -
State: open - Opened by YangWang92 3 months ago
Labels: enhancement
#101 - index unpack problem
Issue -
State: closed - Opened by laomao0 3 months ago
- 2 comments
#100 - update version
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#99 - update version
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#98 - fix compiling error
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#97 - Set __version__
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#96 - fix format
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#95 - fix format
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#94 - Update README.md
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#93 - init algorithm
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#92 - update main
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#91 - Init quantization algorithm
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#90 - Use absolute imports
Pull Request -
State: closed - Opened by bndos 3 months ago
- 3 comments
#89 - CUDA Kernel Not Found: Fallback to Torch Implementation During vptq Execution in Local Setup
Issue -
State: closed - Opened by bndos 3 months ago
#88 - Add FP8/INT8 support
Issue -
State: open - Opened by YangWang92 3 months ago
- 3 comments
Labels: alogrithm
#87 - about permutation and quant channel
Issue -
State: closed - Opened by laomao0 3 months ago
- 3 comments
Labels: question, alogrithm
#86 - Slow Inference on A2 with VPTQ Compared to Ollama
Issue -
State: closed - Opened by WpythonW 3 months ago
- 5 comments
Labels: question, inference
#85 - add acknowledgement and disclaimer
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#84 - add math example
Pull Request -
State: closed - Opened by YangWang92 3 months ago
#83 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin 3 months ago
#82 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin 3 months ago
#81 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin 3 months ago
#80 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin 3 months ago
#79 - Improve hessian/invhessian collection
Issue -
State: open - Opened by YangWang92 3 months ago
- 3 comments
Labels: alogrithm
#78 - New Quantized Model Request
Issue -
State: closed - Opened by JoesSattes 3 months ago
- 3 comments
Labels: question, new models
#77 - Add Ollama/llama.cpp/ggml support
Issue -
State: open - Opened by YangWang92 4 months ago
Labels: inference
#76 - Add VLM/Multimodality support
Issue -
State: open - Opened by YangWang92 4 months ago
Labels: new models, alogrithm
#75 - Add vLLM support
Issue -
State: open - Opened by YangWang92 4 months ago
- 3 comments
Labels: inference
#74 - update device map
Pull Request -
State: closed - Opened by YangWang92 4 months ago
#73 - Multiple GPU Support
Issue -
State: closed - Opened by twoxfh 4 months ago
- 3 comments
Labels: bug, question
#72 - Update setup.py
Pull Request -
State: closed - Opened by YangWang92 4 months ago
#71 - RuntimeError: un-supported index_bits:10
Issue -
State: closed - Opened by Excuses123 4 months ago
- 3 comments
#70 - Update README.md
Pull Request -
State: closed - Opened by OpenSourceRonin 4 months ago