Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / fasterdecoding/snapkv issues and pull requests

#28 - Bug on Qwen2-VL

Issue - State: open - Opened by LiJunscs 23 days ago

#27 - The first generation token output sees the whole cache key and value

Issue - State: open - Opened by PengWenChen 27 days ago - 3 comments

#26 - Llama-3 Implementation

Issue - State: closed - Opened by kunlun531 2 months ago

#25 - why not use the last token for kv cache compression

Issue - State: open - Opened by Arist12 2 months ago

#24 - Question: is key_state_compressed used for inference?

Issue - State: open - Opened by jq-wei 2 months ago - 1 comment

#22 - Group Query Attention

Issue - State: open - Opened by SimJeg 4 months ago - 4 comments

#21 - Question on H2O experiment reproduction

Issue - State: open - Opened by CUHKSZzxy 6 months ago

#20 - Closed issue

Issue - State: closed - Opened by JulietLJY 7 months ago

#17 - observation window size and consistency between layers

Issue - State: closed - Opened by Cooperx521 8 months ago - 1 comment

#16 - Question on GQA implementation

Issue - State: open - Opened by cyLi-Tiger 8 months ago - 1 comment

#15 - Can I use the SnapKV without the flash-attention ?

Issue - State: closed - Opened by pengshuang 8 months ago - 1 comment

#14 - What prompt was used in Needle in a Haystack test?

Issue - State: closed - Opened by 66RING 8 months ago - 1 comment

#12 - Can't not run longbench!

Issue - State: open - Opened by HarryWu99 8 months ago - 3 comments

#11 - why only decode do compress?

Issue - State: open - Opened by CSEEduanyu 9 months ago

#8 - Observation

Pull Request - State: closed - Opened by leeyeehoo 9 months ago

#7 - yl: remove unnessecary

Pull Request - State: closed - Opened by leeyeehoo 9 months ago

#6 - yl: fix a bug

Pull Request - State: closed - Opened by leeyeehoo 9 months ago

#5 - yl: fix typo

Pull Request - State: closed - Opened by leeyeehoo 9 months ago

#4 - Grouped query attention implementation

Issue - State: closed - Opened by guozhiyu 9 months ago - 1 comment

#3 - maybe a bug in `update_kv` function

Issue - State: open - Opened by HarryWu99 9 months ago - 1 comment

#2 - The effect of Clustering via Pooling may be greater?

Issue - State: open - Opened by HarryWu99 9 months ago - 1 comment