Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / microsoft/VPTQ issues and pull requests

#124 - Update model_base.py

Pull Request - State: closed - Opened by YangWang92 6 days ago

#123 - fix setup version number

Pull Request - State: closed - Opened by wejoncy 6 days ago

#122 - fix version__

Pull Request - State: closed - Opened by wejoncy 6 days ago

#121 - Bump to 0.0.4

Pull Request - State: closed - Opened by wejoncy 7 days ago

#120 - fix config format for transformers

Pull Request - State: closed - Opened by wejoncy 7 days ago

#119 - When I use the parameter npercent=1 to quantize the model, I have the following problem:

Issue - State: closed - Opened by half-lang 9 days ago - 2 comments
Labels: question

#118 - Enhance the implementation of the CUDA inference kernel.

Issue - State: open - Opened by haruhi55 12 days ago
Labels: features

#117 - Update README.md

Pull Request - State: closed - Opened by YangWang92 13 days ago

#116 - VLM Support

Issue - State: closed - Opened by YangWang92 18 days ago - 1 comment
Labels: new models

#115 - Huggingface Transformer Support

Issue - State: open - Opened by YangWang92 18 days ago - 1 comment
Labels: inference

#114 - CPU support

Issue - State: open - Opened by YangWang92 18 days ago
Labels: inference

#113 - Custom Model support

Issue - State: open - Opened by huangtingwei9988 18 days ago - 3 comments
Labels: question, new models

#112 - Add CUDA_HOME instructions to README

Pull Request - State: closed - Opened by caronzh03 19 days ago

#111 - Docker image for development

Issue - State: closed - Opened by caronzh03 20 days ago - 6 comments
Labels: question

#110 - update algorithm

Pull Request - State: closed - Opened by YangWang92 24 days ago

#109 - Update README.md

Pull Request - State: closed - Opened by YangWang92 24 days ago

#108 - Update README.md

Pull Request - State: closed - Opened by YangWang92 24 days ago

#107 - Update README.md

Pull Request - State: closed - Opened by YangWang92 24 days ago

#106 - update pyproject

Pull Request - State: closed - Opened by YangWang92 24 days ago

#105 - update algorithm

Pull Request - State: closed - Opened by YangWang92 24 days ago

#104 - add package info

Pull Request - State: closed - Opened by YangWang92 24 days ago

#103 - Update vqlinear.py

Pull Request - State: closed - Opened by laomao0 25 days ago

#102 - Add evaluation codes

Issue - State: open - Opened by YangWang92 25 days ago
Labels: enhancement

#101 - index unpack problem

Issue - State: closed - Opened by laomao0 25 days ago - 2 comments

#100 - update version

Pull Request - State: closed - Opened by YangWang92 27 days ago

#99 - update version

Pull Request - State: closed - Opened by YangWang92 27 days ago

#98 - fix compiling error

Pull Request - State: closed - Opened by YangWang92 28 days ago

#97 - Set __version__

Pull Request - State: closed - Opened by YangWang92 28 days ago

#96 - fix format

Pull Request - State: closed - Opened by YangWang92 28 days ago

#95 - fix format

Pull Request - State: closed - Opened by YangWang92 28 days ago

#94 - Update README.md

Pull Request - State: closed - Opened by YangWang92 28 days ago

#93 - init algorithm

Pull Request - State: closed - Opened by YangWang92 28 days ago

#92 - update main

Pull Request - State: closed - Opened by YangWang92 28 days ago

#91 - Init quantization algorithm

Pull Request - State: closed - Opened by YangWang92 28 days ago

#90 - Use absolute imports

Pull Request - State: closed - Opened by bndos 28 days ago - 3 comments

#88 - Add FP8/INT8 support

Issue - State: open - Opened by YangWang92 30 days ago
Labels: alogrithm

#87 - about permutation and quant channel

Issue - State: closed - Opened by laomao0 about 1 month ago - 3 comments
Labels: question, alogrithm

#86 - Slow Inference on A2 with VPTQ Compared to Ollama

Issue - State: closed - Opened by WpythonW about 1 month ago - 5 comments
Labels: question, inference

#85 - add acknowledgement and disclaimer

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#84 - add math example

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#83 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 1 month ago

#82 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 1 month ago

#81 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 1 month ago

#80 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 1 month ago

#79 - Improve hessian/invhessian collection

Issue - State: open - Opened by YangWang92 about 1 month ago - 3 comments
Labels: alogrithm

#78 - New Quantized Model Request

Issue - State: closed - Opened by JoesSattes about 1 month ago - 3 comments
Labels: question, new models

#77 - Add Ollama/llama.cpp/ggml support

Issue - State: open - Opened by YangWang92 about 1 month ago
Labels: inference

#76 - Add VLM/Multimodality support

Issue - State: open - Opened by YangWang92 about 1 month ago
Labels: new models, alogrithm

#75 - Add vLLM support

Issue - State: open - Opened by YangWang92 about 1 month ago - 2 comments
Labels: inference

#74 - update device map

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#73 - Multiple GPU Support

Issue - State: closed - Opened by twoxfh about 1 month ago - 3 comments
Labels: bug, question

#72 - Update setup.py

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#71 - RuntimeError: un-supported index_bits:10

Issue - State: closed - Opened by Excuses123 about 1 month ago - 3 comments

#70 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 1 month ago

#69 - refine online demo

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#68 - Update setup.py

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#67 - Is it possible to run VPTQ in CPU only?

Issue - State: closed - Opened by yueqianh about 1 month ago - 5 comments
Labels: question, inference

#66 - improve web app demo

Pull Request - State: closed - Opened by YangWang92 about 1 month ago

#65 - rocm fix

Pull Request - State: closed - Opened by wejoncy about 1 month ago

#64 - Is this a typo in the paper?

Issue - State: closed - Opened by FdyCN about 1 month ago - 2 comments

#63 - support rocm

Pull Request - State: closed - Opened by wejoncy about 1 month ago

#62 - may be some issuses about llama-2-7b model.

Issue - State: closed - Opened by laomao0 about 2 months ago - 5 comments

#61 - Revert "fix offload bug in accelerator"

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#60 - fix offload bug in accelerator

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#59 - 'centroids must be a CUDA tensor' error when running Qwen2.5-72B-Instruct 2-bit in RTX4090

Issue - State: closed - Opened by yueqianh about 2 months ago - 2 comments
Labels: bug, question

#58 - Delete models directory

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#57 - Possibility of quantising multimodal models like Qwen-VL and llama 3.2 Vision?

Issue - State: closed - Opened by yueqianh about 2 months ago - 4 comments
Labels: question, alogrithm

#56 - How can I use VPTQ to quantize my own models?

Issue - State: closed - Opened by IEI-mjx about 2 months ago - 15 comments
Labels: question, alogrithm

#55 - update installation

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#54 - CUDA kernel not found, please check CUDA and VPTQ installation

Issue - State: closed - Opened by laomao0 about 2 months ago - 7 comments
Labels: question

#53 - Does not work in Oobabooga

Issue - State: open - Opened by Kaszebe about 2 months ago - 1 comment
Labels: question, inference

#52 - cc1plus: out of memory

Issue - State: closed - Opened by DietmarGrabowski about 2 months ago - 5 comments
Labels: question

#51 - support bf16

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#50 - add gpu monitor at web app

Pull Request - State: closed - Opened by TITC about 2 months ago - 3 comments

#49 - add catlog and index for readme

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#48 - add notebook

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#47 - docs: update README.md

Pull Request - State: closed - Opened by eltociear about 2 months ago - 1 comment

#46 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#45 - add sm_89

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#44 - RuntimeError: un-supported dtype: bfloat16

Issue - State: closed - Opened by dillfrescott about 2 months ago - 16 comments
Labels: question, investigate

#43 - Update README.md

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#42 - stdout captures and injects userwarnings into TextStreamer

Issue - State: open - Opened by JoeHelbing about 2 months ago - 3 comments
Labels: question

#41 - (Eventually) submit to exllama?

Issue - State: closed - Opened by Downtown-Case about 2 months ago - 3 comments
Labels: question, inference

#40 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#39 - bump to 0.0.2

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#38 - support cuda-arch_list

Pull Request - State: closed - Opened by wejoncy about 2 months ago

#37 - update readme and tech report

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#36 - add prompt args and check cuda kernel

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#35 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#34 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#33 - Update README.md

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#32 - Update README.md

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#31 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#30 - Update README.md

Pull Request - State: closed - Opened by YangWang92 about 2 months ago

#29 - Release date for quantization code?

Issue - State: closed - Opened by cakeng about 2 months ago - 13 comments
Labels: question, alogrithm

#28 - Update README.md

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#27 - Patch 1

Pull Request - State: closed - Opened by OpenSourceRonin about 2 months ago

#26 - Llama-2-7b models

Issue - State: closed - Opened by laomao0 about 2 months ago - 5 comments
Labels: question, new models

#25 - Update README.md

Pull Request - State: closed - Opened by YangWang92 about 2 months ago