Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tensorflow/mesh issues and pull requests

#396 - Error while importing Meshtensorflow

Issue - State: closed - Opened by billygrahamram 11 months ago

#394 - Update attention.py

Pull Request - State: open - Opened by sjw8793 about 1 year ago - 1 comment

#393 - Optimizer momentums not properly populated training model with DTensors

Issue - State: closed - Opened by pentney about 1 year ago - 1 comment

#391 - Does load-balanced loss help the loss converge?

Issue - State: open - Opened by mathfinder over 1 year ago

#388 - feat(ci): enable `pip` caching in CI

Pull Request - State: closed - Opened by SauravMaheshkar over 1 year ago - 1 comment

#387 - Remove legacy references from `ops.py`.

Pull Request - State: closed - Opened by copybara-service[bot] almost 2 years ago

#386 - Remove legacy references from `ops.py`.

Pull Request - State: closed - Opened by copybara-service[bot] almost 2 years ago

#385 - Fix docstring typos

Pull Request - State: closed - Opened by copybara-service[bot] about 2 years ago - 1 comment

#384 - Enable multi-file inference

Pull Request - State: closed - Opened by copybara-service[bot] about 2 years ago - 1 comment

#382 - Internal change

Pull Request - State: closed - Opened by copybara-service[bot] over 2 years ago - 1 comment

#378 - mask_1_flat and mask_2_flat applied to gates twice?

Issue - State: open - Opened by marhlder over 2 years ago

#376 - Remove unused comments related to Python 2 compatibility.

Pull Request - State: closed - Opened by copybara-service[bot] over 2 years ago

#375 - Make TPU variable name deterministic.

Pull Request - State: closed - Opened by copybara-service[bot] over 2 years ago

#371 - Split out optimizer call for internal purposes.

Pull Request - State: closed - Opened by copybara-service[bot] almost 3 years ago

#370 - fix typo in logging statement.

Pull Request - State: closed - Opened by copybara-service[bot] almost 3 years ago

#369 - About the mixture of expert model

Issue - State: open - Opened by fym0503 almost 3 years ago

#368 - Mesh-tf model conversion to onnx?

Issue - State: open - Opened by b-analyst about 3 years ago - 2 comments

#367 - Minor comment fix to refer to the correct argument name.

Pull Request - State: open - Opened by copybara-service[bot] about 3 years ago
Labels: cla: yes

#366 - Make sure gates are not normalized for n=1 for top_n routing

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago - 3 comments
Labels: cla: no

#365 - Fix some example code in readme for einsum operation

Pull Request - State: open - Opened by baragona about 3 years ago - 2 comments
Labels: cla: yes

#364 - How to freeze embedding layers

Issue - State: open - Opened by lintangsutawika about 3 years ago

#363 - Add a link to the Primer paper

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago - 4 comments
Labels: cla: no

#362 - Beam search

Issue - State: open - Opened by antonio-mastropaolo about 3 years ago

#361 - Output raw model outputs during eval

Pull Request - State: open - Opened by craffel about 3 years ago
Labels: cla: yes

#360 - Add utility to save score predictions to TFRecords for scoring large datasets.

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago
Labels: cla: yes

#359 - Save scores lazily.

Pull Request - State: open - Opened by copybara-service[bot] about 3 years ago
Labels: cla: yes

#358 - Remove unnecessary name and cwise in squared relu.

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago
Labels: cla: yes

#357 - Expert Attention Fixes:

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago - 3 comments
Labels: cla: no

#356 - Squared ReLU from Primer paper.

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago
Labels: cla: yes

#355 - Internal

Pull Request - State: closed - Opened by copybara-service[bot] about 3 years ago - 18 comments
Labels: cla: no

#354 - Remove dataset checkpoint policy override now that b/181765832 is resolved.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#353 - Add more extensive top-2 logging.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#352 - Ability to add Custom Tensorflow Hooks

Issue - State: open - Opened by trisongz over 3 years ago

#351 - Only add z_loss to losses if during training.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#350 - Expert Attention Fixes:

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#349 - Fix bug in shared_kv attention for autoregressive decoding.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 2 comments
Labels: cla: no

#347 - heterogeneous mixture of experts layer

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 5 comments
Labels: cla: no

#346 - Add more options to Experts Attention. These options remove 1/3 of the all2all communication costs:

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 2 comments
Labels: cla: no

#344 - Add in Z-loss to all routing algorithms.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#343 - Minor changes to make Experts Attention work.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 6 comments
Labels: cla: no

#342 - MODE models with hetereogeneous expert width

Pull Request - State: open - Opened by copybara-service[bot] over 3 years ago - 1 comment
Labels: cla: no

#339 - Using the soft loss dtype instead of hardcoding bfloat16.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#338 - - Fix casting for NTLB.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#337 - Switch logging to warm to not fail when using deterministic dataset checkpointing.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#336 - Next gen fish optimizations for MeshTF.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 4 comments
Labels: cla: no

#335 - Add z-loss to the top_2_gating method.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#334 - Add z-loss to the top_2_gating method.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#332 - Add option to stochastically use the non-top expert during training.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#331 - Allow tokens embeddings to be used for routing decisions.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#327 - Make directory if it doesn't exist.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#326 - Splitting tokens when routing

Pull Request - State: open - Opened by copybara-service[bot] over 3 years ago - 2 comments
Labels: cla: no

#323 - Unique variable names for ParallelLayer

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#319 - Use %g instead of %f for printing in mesh_tensorflow/transformer/utils.py.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 5 comments
Labels: cla: no

#318 - performing the opposite of mtf.lowering

Issue - State: open - Opened by DavidPeleg6 over 3 years ago - 1 comment

#317 - Rolls back a change that broke several clients.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 4 comments
Labels: cla: no

#316 - Minor fix to make sure printing does not crash if a filter_fn is used.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#315 - Internal only change : )

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#314 - Explicitly pass named-arg to mtf.dropout

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#313 - Fix ALBERT arXiv URL

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#312 - [MTF] Minor usability change in get_inputs_from_file for accidentally empty files.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#311 - Add in z_loss for router softmax for switch layer.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#310 - try to create gin related flags and pass if the flags are created.

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#308 - no public changes

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#307 - Add flexible checkpoint loading option to allow for loading checkpoints

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago - 3 comments
Labels: cla: no

#302 - Operation to linearly anneal dropout rate between start_step and end_step

Pull Request - State: closed - Opened by copybara-service[bot] over 3 years ago
Labels: cla: yes

#291 - Add loss functions for multiple-target objectives for distillation.

Pull Request - State: open - Opened by copybara-service[bot] almost 4 years ago - 2 comments
Labels: cla: no

#290 - Use multiple target objectives for distillation. Also see cl/356382304

Pull Request - State: open - Opened by copybara-service[bot] almost 4 years ago - 2 comments
Labels: cla: no

#289 - Change get_replicated_var_handle to accept resource tensors instead of variables

Pull Request - State: closed - Opened by copybara-service[bot] almost 4 years ago
Labels: cla: yes

#283 - internal

Pull Request - State: open - Opened by copybara-service[bot] almost 4 years ago - 1 comment
Labels: cla: no

#281 - Decode Unicode strings in inference mode.

Pull Request - State: open - Opened by copybara-service[bot] almost 4 years ago - 1 comment
Labels: cla: no

#278 - the `model_executor.py` example is broken

Issue - State: closed - Opened by XMaster96 almost 4 years ago

#259 - Fixing model export breakage.

Pull Request - State: open - Opened by copybara-service[bot] almost 4 years ago - 1 comment
Labels: cla: no

#235 - Debug in mesh Tensorflow

Issue - State: open - Opened by patrickvonplaten about 4 years ago - 3 comments

#181 - Future of this project?

Issue - State: open - Opened by Mistobaan about 4 years ago - 2 comments