Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / google/maxtext issues and pull requests

#623 - change norm sharding

Pull Request - State: closed - Opened by ZhiyuLi-goog 10 months ago
Labels: pull ready

#622 - Asignación

Issue - State: closed - Opened by Cyberwoodd 10 months ago - 1 comment

#621 - Update jax.tree_map to jax.tree_util.tree_map

Pull Request - State: closed - Opened by RissyRan 10 months ago
Labels: pull ready

#620 - docs: update Run_MaxText_via_multihost_runner.md

Pull Request - State: open - Opened by eltociear 10 months ago - 1 comment

#619 - Update 128B config on v5e to use qkv_proj_offloaded remat_policy

Pull Request - State: closed - Opened by raymondzouu 10 months ago
Labels: pull ready

#618 - Change l2norm to use jnp.sqrt

Pull Request - State: closed - Opened by raymondzouu 10 months ago - 1 comment
Labels: pull ready

#617 - Correct the Run_Gemma.md path in README.md

Pull Request - State: open - Opened by hengtaoguo 10 months ago

#616 - Split Mixtral test into two scripts

Pull Request - State: closed - Opened by RissyRan 10 months ago
Labels: pull ready

#615 - Move AQTP pin up on GPU-pinned

Pull Request - State: closed - Opened by gobbleturk 10 months ago
Labels: pull ready

#614 - DEFAULT_MASK_VALUE causes gradient explosion and nan loss on deep models

Issue - State: open - Opened by logicchains 10 months ago - 2 comments
Labels: bug

#613 - Revert "Mark nvidia devtools repo as trusted"

Pull Request - State: open - Opened by chajath 10 months ago

#612 - Consolidate inference related logic under jetstream-maxtext

Issue - State: closed - Opened by ahg-g 10 months ago - 1 comment

#611 - Mark nvidia devtools repo as trusted

Pull Request - State: closed - Opened by chajath 10 months ago
Labels: pull ready

#610 - Allow inference microbenchmark to time prefill only

Pull Request - State: closed - Opened by morgandu 10 months ago
Labels: pull ready

#609 - Support LoRA training

Issue - State: open - Opened by hxssgaa 10 months ago - 2 comments
Labels: feature request

#608 - Update Run_MaxText_via_xpk.md

Pull Request - State: closed - Opened by RoshaniN 10 months ago
Labels: pull ready

#607 - Question: Gradient Accumulation

Issue - State: closed - Opened by thiagolaitz 10 months ago - 6 comments

#606 - Clarification: how does Llama-2-7b fit on a v4-8 when using Adam?

Issue - State: closed - Opened by rodrigo-f-nogueira 10 months ago - 3 comments

#605 - Support for RecurrentGemma

Issue - State: open - Opened by cyrilzakka 10 months ago
Labels: feature request

#603 - Add dummy patch

Pull Request - State: closed - Opened by ko3n1g 10 months ago

#602 - Run `codespell` using pre-commit

Pull Request - State: closed - Opened by khatwanimohit 10 months ago
Labels: pull ready

#601 - Update instructions for installing snap.

Pull Request - State: closed - Opened by RoshaniN 10 months ago
Labels: pull ready

#600 - Removes batch size from prefill attention calculation.

Pull Request - State: closed - Opened by patemotter 10 months ago
Labels: pull ready

#599 - Update First_run.md - Fixed broken links path

Pull Request - State: open - Opened by shivajid 10 months ago
Labels: pull ready

#598 - Move apt install from `rto_setup.sh` to `setup.sh`

Pull Request - State: closed - Opened by tonyjohnchen 10 months ago
Labels: pull ready

#597 - Fix Microbenchmark Profiling Memory Issues

Pull Request - State: closed - Opened by morgandu 10 months ago - 1 comment
Labels: pull ready

#596 - adding script to fix the style and adding modified/fixed files with l…

Pull Request - State: closed - Opened by ssusie 10 months ago
Labels: pull ready

#595 - Cannot do inference in float32

Issue - State: open - Opened by borisdayma 10 months ago - 2 comments
Labels: bug, good first issue

#594 - Support beam search

Issue - State: open - Opened by borisdayma 10 months ago
Labels: inference, feature request

#593 - Revert "Pinned build mode for GPU, with prebuilt Transformer Engine bdist"

Pull Request - State: closed - Opened by khatwanimohit 10 months ago
Labels: pull ready

#592 - add HF input pipeline

Pull Request - State: open - Opened by aireenmei 10 months ago

#591 - adding script to fix the style and adding modified/fixed files

Pull Request - State: closed - Opened by ssusie 11 months ago - 2 comments
Labels: pull ready

#590 - Reviewed MaxText README.md

Pull Request - State: open - Opened by mikegre-google 11 months ago - 2 comments
Labels: pull ready

#589 - Nina move tpu end-to-end test scripts to tpu folder

Pull Request - State: closed - Opened by NinaCai 11 months ago
Labels: pull ready

#588 - Call max_utils.get_project() only when Vertex Tensorboard is enabled

Pull Request - State: closed - Opened by SurbhiJainUSC 11 months ago
Labels: pull ready

#587 - Unify WORKDIR to /deps

Pull Request - State: closed - Opened by michelle-yooh 11 months ago
Labels: pull ready

#586 - Fix subset of hosts dataloading for TPU v4

Pull Request - State: closed - Opened by khatwanimohit 11 months ago
Labels: pull ready

#585 - Support Qwen1.5

Issue - State: closed - Opened by Muhtasham 11 months ago - 1 comment

#584 - [WIP] add batch inference

Pull Request - State: open - Opened by morgandu 11 months ago

#583 - Update Gemma 7b tests to use optionally internal GCS buckets for testing

Pull Request - State: closed - Opened by A9isha 11 months ago
Labels: pull ready

#582 - Add README for llama2-7B

Pull Request - State: closed - Opened by michelle-yooh 11 months ago
Labels: pull ready

#581 - Convert Orbax ckpt to HuggingFace

Pull Request - State: open - Opened by A9isha 11 months ago - 5 comments

#580 - Fix Gemma links

Pull Request - State: closed - Opened by khatwanimohit 11 months ago
Labels: pull ready

#579 - Gemma instructions were deleted in commit

Issue - State: closed - Opened by emergenz 11 months ago - 2 comments

#578 - separate tpu and gpu end-to-end scripts

Pull Request - State: closed - Opened by NinaCai 11 months ago
Labels: pull ready

#577 - README Updates

Pull Request - State: closed - Opened by rwitten 11 months ago
Labels: pull ready

#576 - Add llama2 configs for GPU A3

Pull Request - State: closed - Opened by michelle-yooh 11 months ago
Labels: pull ready

#575 - Match the dtype of sin and cos with inputs in RotaryEmbedding

Pull Request - State: closed - Opened by prrathi 11 months ago
Labels: pull ready

#574 - define default mode quantize_kvcache parameter

Pull Request - State: closed - Opened by kocchop 11 months ago - 1 comment
Labels: pull ready

#573 - Add minimal remat policy for flash attention

Pull Request - State: closed - Opened by michelle-yooh 11 months ago
Labels: pull ready

#572 - Issues running test_llama2_7b.sh on TPU VM v3-8

Issue - State: closed - Opened by korney3 11 months ago - 1 comment

#571 - Supported features

Issue - State: open - Opened by peregilk 11 months ago - 21 comments
Labels: feature request

#570 - Add functionality to automatically upload logs to Vertex Tensorboard

Pull Request - State: closed - Opened by SurbhiJainUSC 11 months ago - 1 comment
Labels: pull ready

#569 - Change docker upload schedule

Pull Request - State: closed - Opened by khatwanimohit 11 months ago
Labels: pull ready

#568 - Make Usage of Params Consistent v2

Pull Request - State: closed - Opened by anfals 11 months ago - 1 comment
Labels: pull ready

#567 - Add gobbleturk as code reviewer

Pull Request - State: closed - Opened by gobbleturk 11 months ago
Labels: pull ready

#566 - fix: typo

Pull Request - State: open - Opened by emergenz 11 months ago

#565 - Add llama2-13b tests

Pull Request - State: open - Opened by morgandu 11 months ago

#564 - Add docker support to maxtext base image

Pull Request - State: open - Opened by tonyjohnchen 11 months ago

#563 - Nina merging yuwei branch

Pull Request - State: closed - Opened by NinaCai 11 months ago

#562 - Revert "Make usage of params consistent"

Pull Request - State: closed - Opened by khatwanimohit 11 months ago
Labels: pull ready

#561 - Disabled a pylint check

Pull Request - State: closed - Opened by lukebaumann 11 months ago
Labels: pull ready

#560 - Support for T5

Issue - State: open - Opened by kishorenc 11 months ago - 6 comments
Labels: feature request

#559 - Fix typos

Pull Request - State: open - Opened by BioGeek 11 months ago - 4 comments

#558 - Fix readme for llama2

Pull Request - State: closed - Opened by A9isha 11 months ago
Labels: pull ready

#557 - Pin nvidia-cudnn-cu12==8.9.7.29

Pull Request - State: open - Opened by michelle-yooh 11 months ago

#556 - add validation for shardings

Pull Request - State: closed - Opened by ssusie 11 months ago
Labels: pull ready

#555 - cudnn flash attention GQA support

Pull Request - State: closed - Opened by kocchop 11 months ago - 2 comments
Labels: pull ready

#553 - fix: typo

Pull Request - State: closed - Opened by emergenz 11 months ago - 1 comment

#551 - Converting checkpoints

Issue - State: open - Opened by peregilk 11 months ago - 14 comments

#549 - Register the IFRT Proxy used for Pathways development.

Pull Request - State: closed - Opened by lukebaumann 11 months ago - 1 comment
Labels: pull ready

#548 - Add entropy to checkpoints generated by standalone checkpointer.

Pull Request - State: closed - Opened by RoshaniN 11 months ago - 2 comments
Labels: pull ready

#547 - Pinned build mode for GPU, with prebuilt Transformer Engine bdist

Pull Request - State: closed - Opened by chajath 11 months ago - 1 comment
Labels: pull ready

#544 - Add instructions for grain

Pull Request - State: closed - Opened by aireenmei 11 months ago
Labels: pull ready

#543 - Combine prefill/generate profiles; allow passing in profile name

Pull Request - State: closed - Opened by morgandu 11 months ago - 1 comment

#537 - Megatron style TFLOPs Calculation

Pull Request - State: open - Opened by abhinavgoel95 11 months ago - 2 comments

#531 - `attend_dtype` not used

Issue - State: open - Opened by zhixuan-lin 11 months ago - 1 comment
Labels: bug

#529 - Test decode and finetuning for Gemma

Pull Request - State: closed - Opened by khatwanimohit 11 months ago
Labels: pull ready

#524 - Add Llama2 13b

Pull Request - State: closed - Opened by morgandu 11 months ago

#522 - Cut a release branch for stable GPU runs WIP

Pull Request - State: open - Opened by chajath 11 months ago

#521 - Document use of Mistral

Issue - State: closed - Opened by borisdayma 11 months ago - 6 comments

#516 - Compatibility issue with tensorflow>=2.15.1 on GPU

Issue - State: closed - Opened by chajath 11 months ago - 1 comment

#501 - Consider installing local CUDA variant when building GPU image

Issue - State: closed - Opened by chajath 12 months ago - 1 comment

#498 - Adding FP8 support through Native XLA Support

Pull Request - State: closed - Opened by abhinavgoel95 12 months ago - 3 comments
Labels: pull ready

#496 - Adding support for KV Cache Quantization.

Pull Request - State: closed - Opened by singh-mitali 12 months ago - 1 comment

#482 - Goodput integration

Pull Request - State: closed - Opened by dipannita08 12 months ago - 2 comments
Labels: pull ready

#460 - Make small changes for running it via XPK.

Pull Request - State: closed - Opened by yangyuwei 12 months ago
Labels: pull ready

#453 - Ragged attention and test, not actively used.

Pull Request - State: open - Opened by patemotter 12 months ago

#418 - [DO NOT MERGE]test gamma only

Pull Request - State: closed - Opened by NinaCai about 1 year ago

#187 - Loading real data on subset of hosts

Pull Request - State: closed - Opened by khatwanimohit over 1 year ago - 6 comments
Labels: pull ready