Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / lucidrains/megabyte-pytorch issues and pull requests
#22 - Is Code Correcct?
Issue -
State: open - Opened by jaytimbadia 1 day ago
#21 - Missing package in setup
Issue -
State: closed - Opened by eegli about 1 month ago
- 3 comments
#20 - Allow usage as single-stage transformer
Issue -
State: closed - Opened by eegli 5 months ago
- 1 comment
#19 - Regression in padding value and loss calculation
Issue -
State: closed - Opened by eegli 5 months ago
- 3 comments
#18 - chore: update flash attention config
Pull Request -
State: closed - Opened by eegli 5 months ago
- 2 comments
#17 - fix: padding and positional embeddings
Pull Request -
State: closed - Opened by eegli 5 months ago
- 3 comments
#16 - Why does it expect tokens?
Issue -
State: closed - Opened by tonydavis629 about 1 year ago
- 1 comment
#15 - GPU used for original paper experiments
Issue -
State: closed - Opened by itsnamgyu about 1 year ago
- 1 comment
#14 - Evaluation metric bits-per-byte
Issue -
State: open - Opened by jxiw over 1 year ago
- 1 comment
#13 - Why your Attention impl use kv dimention dim_head instead of inner_dim?
Issue -
State: closed - Opened by Earthson over 1 year ago
- 1 comment
#12 - Training Results and Scaling
Issue -
State: open - Opened by MiscellaneousStuff over 1 year ago
- 1 comment
#11 - the patch embbeder implementations are different from the original paper
Issue -
State: closed - Opened by mikegreen7892003 over 1 year ago
- 4 comments
#10 - Minor shape error
Issue -
State: closed - Opened by anruigu over 1 year ago
- 1 comment
#9 - cleanup
Pull Request -
State: closed - Opened by lucidrains over 1 year ago
#8 - Paper variation
Pull Request -
State: closed - Opened by lucidrains over 1 year ago
#7 - some implementations are different from the original paper
Issue -
State: closed - Opened by ZihaoH over 1 year ago
- 2 comments
#6 - No available kernel error
Issue -
State: closed - Opened by missflash over 1 year ago
- 1 comment
#5 - translation of model sizes from paper to model definition
Issue -
State: open - Opened by winglian over 1 year ago
#4 - Some question about the MEGABYTE
Issue -
State: closed - Opened by relic-yuexi over 1 year ago
- 4 comments
#3 - the string is still divided into pieces
Issue -
State: closed - Opened by wac81 over 1 year ago
- 1 comment
#2 - What are the implications of this model?
Issue -
State: closed - Opened by kyegomez over 1 year ago
- 4 comments
#1 - Simplified batch handling in forward_empty()
Pull Request -
State: closed - Opened by arman-hk over 1 year ago
- 2 comments