Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / turboderp/exllamav2 issues and pull requests
#115 - Implement HyperAttention - Long-context Attention in Near-Linear Time: outperforms FlashAttention and offers up to 5x speedup on long contexts
Issue -
State: closed - Opened by kabachuha about 1 year ago
- 4 comments
#115 - Implement HyperAttention - Long-context Attention in Near-Linear Time: outperforms FlashAttention and offers up to 5x speedup on long contexts
Issue -
State: closed - Opened by kabachuha about 1 year ago
- 4 comments
#114 - Ninja build stopped
Issue -
State: closed - Opened by jayeshthk about 1 year ago
- 3 comments
#114 - Ninja build stopped
Issue -
State: closed - Opened by jayeshthk about 1 year ago
- 3 comments
#113 - Production of quantitative datasets and expansion of models
Issue -
State: closed - Opened by venxzw about 1 year ago
- 2 comments
#113 - Production of quantitative datasets and expansion of models
Issue -
State: closed - Opened by venxzw about 1 year ago
- 2 comments
#112 - support 8bit kv cache
Pull Request -
State: closed - Opened by zgce about 1 year ago
- 3 comments
#112 - support 8bit kv cache
Pull Request -
State: closed - Opened by zgce about 1 year ago
- 3 comments
#111 - Use Pytorch 2.1 for CUDA 11.8+ and ROCm builds
Pull Request -
State: closed - Opened by jllllll about 1 year ago
- 2 comments
#111 - Use Pytorch 2.1 for CUDA 11.8+ and ROCm builds
Pull Request -
State: closed - Opened by jllllll about 1 year ago
- 2 comments
#110 - Problem at _tsize
Issue -
State: closed - Opened by ParisNeo about 1 year ago
- 9 comments
#110 - Problem at _tsize
Issue -
State: closed - Opened by ParisNeo about 1 year ago
- 9 comments
#109 - Weird performance
Issue -
State: closed - Opened by jianyuheng about 1 year ago
- 2 comments
#109 - Weird performance
Issue -
State: closed - Opened by jianyuheng about 1 year ago
- 2 comments
#108 - Streaming and Stop tokens for speculative sampling
Issue -
State: closed - Opened by CyberTimon about 1 year ago
- 6 comments
#108 - Streaming and Stop tokens for speculative sampling
Issue -
State: closed - Opened by CyberTimon about 1 year ago
- 6 comments
#107 - Early perplexity broken on higher quants. Gibberish outputs.
Issue -
State: closed - Opened by 11415142513152119 about 1 year ago
- 5 comments
#106 - Added zephyr chatformat
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 5 comments
#106 - Added zephyr chatformat
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 5 comments
#105 - Error after the generation. AssertionError: Total sequence length exceeds cache size in model.forward
Issue -
State: closed - Opened by Rajmehta123 about 1 year ago
- 8 comments
#105 - Error after the generation. AssertionError: Total sequence length exceeds cache size in model.forward
Issue -
State: closed - Opened by Rajmehta123 about 1 year ago
- 8 comments
#104 - Check if Ampere GPUs or newer before using flash-attn
Pull Request -
State: closed - Opened by oobabooga about 1 year ago
- 2 comments
#104 - Check if Ampere GPUs or newer before using flash-attn
Pull Request -
State: closed - Opened by oobabooga about 1 year ago
- 2 comments
#103 - Sliding Attention Window
Issue -
State: closed - Opened by anujnayyar1 about 1 year ago
- 7 comments
#103 - Sliding Attention Window
Issue -
State: closed - Opened by anujnayyar1 about 1 year ago
- 7 comments
#102 - Batched Inference
Issue -
State: closed - Opened by anujnayyar1 about 1 year ago
- 1 comment
#102 - Batched Inference
Issue -
State: closed - Opened by anujnayyar1 about 1 year ago
- 1 comment
#101 - Endless flood of 'rfm max: x.xx bpw x.xx' on quant
Issue -
State: closed - Opened by discordianbelle about 1 year ago
- 3 comments
#101 - Endless flood of 'rfm max: x.xx bpw x.xx' on quant
Issue -
State: closed - Opened by discordianbelle about 1 year ago
- 3 comments
#100 - calling lora.unload() gives key error
Issue -
State: closed - Opened by Ph0rk0z about 1 year ago
- 4 comments
#99 - `regex` needs to be added to requirements/setup.py
Issue -
State: closed - Opened by andrewgross about 1 year ago
- 1 comment
#99 - `regex` needs to be added to requirements/setup.py
Issue -
State: closed - Opened by andrewgross about 1 year ago
- 1 comment
#98 - undefined symbol error during inference
Issue -
State: closed - Opened by AmineDjeghri about 1 year ago
- 11 comments
#98 - undefined symbol error during inference
Issue -
State: closed - Opened by AmineDjeghri about 1 year ago
- 11 comments
#97 - [BUG] CUDA error: invalid configuration argument /exllamav2/exllamav2/exllamav2_ext/cuda/rope.cu 131
Issue -
State: closed - Opened by Facico about 1 year ago
- 7 comments
#97 - [BUG] CUDA error: invalid configuration argument /exllamav2/exllamav2/exllamav2_ext/cuda/rope.cu 131
Issue -
State: closed - Opened by Facico about 1 year ago
- 7 comments
#96 - Feature Request: support for exllamav2 lora training
Issue -
State: closed - Opened by LZY-the-boys about 1 year ago
- 2 comments
#96 - Feature Request: support for exllamav2 lora training
Issue -
State: closed - Opened by LZY-the-boys about 1 year ago
- 2 comments
#95 - Parallel decoding
Issue -
State: closed - Opened by nivibilla about 1 year ago
- 11 comments
#95 - Parallel decoding
Issue -
State: closed - Opened by nivibilla about 1 year ago
- 11 comments
#93 - Error with most recent changes to Sampler
Issue -
State: closed - Opened by Rajmehta123 about 1 year ago
- 2 comments
#93 - Error with most recent changes to Sampler
Issue -
State: closed - Opened by Rajmehta123 about 1 year ago
- 2 comments
#92 - test_inference.py : AttributeError: module 'exllamav2_ext' has no attribute 'rms_norm'
Issue -
State: closed - Opened by DFuller134 about 1 year ago
- 13 comments
#92 - test_inference.py : AttributeError: module 'exllamav2_ext' has no attribute 'rms_norm'
Issue -
State: closed - Opened by DFuller134 about 1 year ago
- 13 comments
#91 - EXL2 quants at 4.65 bits in dual 3090 gpu´s
Issue -
State: closed - Opened by jostack about 1 year ago
- 4 comments
#90 - Dimensions overflow bug
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 5 comments
#90 - Dimensions overflow bug
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 5 comments
#89 - Maximum bitrate
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 3 comments
#89 - Maximum bitrate
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 3 comments
#88 - RAM requirements on quantization
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 2 comments
#88 - RAM requirements on quantization
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 2 comments
#86 - Added ChatML format to chat.py
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 7 comments
#86 - Added ChatML format to chat.py
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 7 comments
#85 - GPU Peer Fix needs to come back
Issue -
State: closed - Opened by andrewgross about 1 year ago
- 8 comments
#85 - GPU Peer Fix needs to come back
Issue -
State: closed - Opened by andrewgross about 1 year ago
- 8 comments
#84 - Beam Search Implementation
Issue -
State: open - Opened by ChrisCates about 1 year ago
- 4 comments
#83 - test_inference.py is broken
Issue -
State: closed - Opened by 11415142513152119 about 1 year ago
- 16 comments
#82 - remake README+WS OPTIMIZE
Pull Request -
State: closed - Opened by Kerushii about 1 year ago
- 2 comments
#81 - Chat format: Recognize specified language and offloaded lexguessing to every newline
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 2 comments
#81 - Chat format: Recognize specified language and offloaded lexguessing to every newline
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 2 comments
#80 - Using Venv or Dockerisation
Issue -
State: closed - Opened by bkutasi about 1 year ago
- 5 comments
#79 - README+WS OPTIMIZE
Pull Request -
State: closed - Opened by Kerushii about 1 year ago
#78 - WS update, readme puzzle
Pull Request -
State: closed - Opened by Kerushii about 1 year ago
#76 - Exclude python caches from git repository
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
#75 - Fixed Speculative Generator
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 2 comments
#74 - Fixed Speculative Generator
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 1 comment
#72 - Feature request: Add support for phi-1_5 quantization
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 1 comment
#72 - Feature request: Add support for phi-1_5 quantization
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 1 comment
#71 - Code highlighting in chat CLI
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 17 comments
#71 - Code highlighting in chat CLI
Pull Request -
State: closed - Opened by SinanAkkoyun about 1 year ago
- 17 comments
#69 - Feature request: Tail-free sampling
Issue -
State: closed - Opened by jakaline-dev about 1 year ago
- 2 comments
#69 - Feature request: Tail-free sampling
Issue -
State: closed - Opened by jakaline-dev about 1 year ago
- 2 comments
#68 - test_inference.py PPL evaluation for >4K context
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 1 comment
#68 - test_inference.py PPL evaluation for >4K context
Issue -
State: closed - Opened by grimulkan about 1 year ago
- 1 comment
#64 - Exlv2 8.13bit max quants at very low context produce gibberish
Issue -
State: closed - Opened by 11415142513152119 about 1 year ago
#64 - Exlv2 8.13bit max quants at very low context produce gibberish
Issue -
State: closed - Opened by 11415142513152119 about 1 year ago
#63 - Support AwQ quantization in the future?
Issue -
State: closed - Opened by yhyu13 about 1 year ago
- 1 comment
#63 - Support AwQ quantization in the future?
Issue -
State: closed - Opened by yhyu13 about 1 year ago
- 1 comment
#61 - Build wheels with pre-compiled CUDA kernels
Pull Request -
State: closed - Opened by jllllll about 1 year ago
- 15 comments
#61 - Build wheels with pre-compiled CUDA kernels
Pull Request -
State: closed - Opened by jllllll about 1 year ago
- 15 comments
#60 - will it be possible to make something like this for diffusion?
Issue -
State: closed - Opened by Shaistrong about 1 year ago
- 2 comments
#60 - will it be possible to make something like this for diffusion?
Issue -
State: closed - Opened by Shaistrong about 1 year ago
- 2 comments
#59 - Stream Bug
Issue -
State: closed - Opened by wangyu1997 about 1 year ago
- 3 comments
#59 - Stream Bug
Issue -
State: closed - Opened by wangyu1997 about 1 year ago
- 3 comments
#58 - Grid number of GPTQ matrix reconstruction
Pull Request -
State: closed - Opened by chu-tianxiang about 1 year ago
- 1 comment
#58 - Grid number of GPTQ matrix reconstruction
Pull Request -
State: closed - Opened by chu-tianxiang about 1 year ago
- 1 comment
#57 - Possible bug in discarding overflow
Issue -
State: closed - Opened by Antollo about 1 year ago
- 1 comment
#57 - Possible bug in discarding overflow
Issue -
State: closed - Opened by Antollo about 1 year ago
- 1 comment
#56 - publish the releases to github
Issue -
State: closed - Opened by happysalada about 1 year ago
- 5 comments
#56 - publish the releases to github
Issue -
State: closed - Opened by happysalada about 1 year ago
- 5 comments
#54 - Feature request: encode special tokens
Issue -
State: closed - Opened by vt404v2 about 1 year ago
- 8 comments
#54 - Feature request: encode special tokens
Issue -
State: closed - Opened by vt404v2 about 1 year ago
- 8 comments
#53 - Feature request: typical_p
Issue -
State: closed - Opened by vt404v2 about 1 year ago
- 2 comments
#53 - Feature request: typical_p
Issue -
State: closed - Opened by vt404v2 about 1 year ago
- 2 comments
#52 - RuntimeError: start (0) + length (32032) exceeds dimension size (32001).
Issue -
State: closed - Opened by Thireus about 1 year ago
- 2 comments
#52 - RuntimeError: start (0) + length (32032) exceeds dimension size (32001).
Issue -
State: closed - Opened by Thireus about 1 year ago
- 2 comments
#51 - Pass stop words to a model
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 2 comments
#51 - Pass stop words to a model
Issue -
State: closed - Opened by Nikita-Sherstnev about 1 year ago
- 2 comments
#50 - inference script starts producing bullshit at low temperatures
Issue -
State: closed - Opened by alimadelshin about 1 year ago
- 3 comments
#50 - inference script starts producing bullshit at low temperatures
Issue -
State: closed - Opened by alimadelshin about 1 year ago
- 3 comments