Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / bigcode-project/transformers issues and pull requests
#30 - add embed and residual dropout
Pull Request -
State: closed - Opened by RaymondLi0 about 1 year ago
#29 - For visibility: conversion scripts from fast-llm
Pull Request -
State: open - Opened by RaymondLi0 about 1 year ago
#28 - Starcoder2 model
Pull Request -
State: open - Opened by jlamypoirier about 1 year ago
#27 - log tensors
Pull Request -
State: open - Opened by RaymondLi0 about 1 year ago
#26 - change KV splitting based on Megatron-LM
Pull Request -
State: closed - Opened by suiyoubi about 1 year ago
#25 - For visibility: Gqa megatron rope
Pull Request -
State: open - Opened by RaymondLi0 about 1 year ago
#24 - Move megatron conversion script and add rope arguments
Pull Request -
State: open - Opened by loubnabnl about 1 year ago
- 4 comments
#23 - Make modeling compatible with Nanotron + few optims
Pull Request -
State: closed - Opened by NouamaneTazi over 1 year ago
- 3 comments
#22 - For visibility: conversion scripts for fast-llm
Pull Request -
State: closed - Opened by RaymondLi0 over 1 year ago
#21 - Conversion of MegatronLM checkpoint to HF transformer checkpoint fails. (ALIBI used during training)
Issue -
State: open - Opened by gagangayari over 1 year ago
#20 - Simplified kv caching
Pull Request -
State: open - Opened by jlamypoirier almost 2 years ago
#19 - Add flash attention
Pull Request -
State: open - Opened by jlamypoirier almost 2 years ago
#18 - Flash attention experiments
Pull Request -
State: open - Opened by jlamypoirier almost 2 years ago
#17 - Add back experimental features
Pull Request -
State: closed - Opened by jlamypoirier almost 2 years ago
#16 - Diff from Huggingface main
Pull Request -
State: open - Opened by jlamypoirier almost 2 years ago
#15 - Transformers can no longer load santacoder-fast-inference model
Issue -
State: open - Opened by beale201 almost 2 years ago
#14 - Add gpu optimizations to base model
Pull Request -
State: closed - Opened by jlamypoirier almost 2 years ago
#13 - More optimizations
Pull Request -
State: closed - Opened by jlamypoirier almost 2 years ago
#12 - Running Santcoder-fast-inference throws UserWarning: FALLBACK path has been taken inside
Issue -
State: open - Opened by murthyrudra almost 2 years ago
- 1 comment
#11 - add test to ensure mqa and mha have the same behaviour
Pull Request -
State: closed - Opened by minimario almost 2 years ago
#10 - Upcasting, scaling, masking and fused kernels to match Megatron-LM
Pull Request -
State: closed - Opened by jlamypoirier almost 2 years ago
#9 - Add santacoder model
Pull Request -
State: closed - Opened by jlamypoirier almost 2 years ago
- 1 comment
#8 - Megatron conversion script
Pull Request -
State: closed - Opened by jlamypoirier about 2 years ago
#7 - Fast inference
Pull Request -
State: closed - Opened by jlamypoirier about 2 years ago
#6 - Fork the model into GPTBigCode
Pull Request -
State: closed - Opened by jlamypoirier about 2 years ago
- 1 comment
#5 - Fast inference
Pull Request -
State: closed - Opened by jlamypoirier about 2 years ago
#4 - Multi-query attention
Pull Request -
State: closed - Opened by jlamypoirier about 2 years ago
- 3 comments
#3 - Just to see the diff
Pull Request -
State: open - Opened by Muennighoff about 2 years ago
- 4 comments
#2 - add: 2 variants of multi query implementation; printing some details
Pull Request -
State: closed - Opened by bigximik over 2 years ago
#1 - Benchmark multi-query attention in HF transformers
Issue -
State: closed - Opened by harm-devries over 2 years ago
- 1 comment
Labels: inference, architecture