BlackSamorez/tensor_parallel issues and pull requests

#137 - Compatibility with `transformers > 4.36`: error: `AttributeError: 'tuple' object has no attribute 'to_legacy_cache'`

Issue - State: open - Opened by Dr-Left about 1 month ago

#136 - Customized generate func support?

Issue - State: open - Opened by MonolithFoundation 5 months ago

#135 - when I use transformers==4.7.0，ValueError: TensorParallelPreTrainedModel does not support Flash Attention 2.0 yet。

Issue - State: open - Opened by Qiovo066 6 months ago

#134 - Add mixtral support

Pull Request - State: open - Opened by ReinForce-II 9 months ago

#133 - tensor_parallel int4 LLM is not working since release v2.0.0

Issue - State: open - Opened by ReinForce-II 9 months ago

#132 - Now, does tensor_parallel no longer support the huggingface trainer?

Issue - State: open - Opened by HanGyeol-Yoo 9 months ago

#131 - Can I use tensor_parallel to inference for a GPTQ quantized model?

Issue - State: open - Opened by minlik 11 months ago

#130 - No implement of generate() when using models from hugging face.

Issue - State: open - Opened by 342215448 11 months ago

#129 - TensorParallel object has no attribute save_pretrained

Issue - State: open - Opened by toufunao 11 months ago

#128 - No output when using tensor_parallel

Issue - State: open - Opened by yyya9 11 months ago - 1 comment

#127 - How to use the model in a scenario where it is stored in the Safetenors format?

Issue - State: closed - Opened by yxk9810 12 months ago

#126 - Out of GPU memory for two A10 GPUs

Issue - State: closed - Opened by JunyiYe 12 months ago - 1 comment

#125 - AttributeError: object has no attribute 'devices'

Issue - State: open - Opened by QiueY514 12 months ago

#124 - ValueError: Model parameters were moved to incorrect devices, did call on model.cuda() or model.to(device)? If so, please avoid doing that

Issue - State: open - Opened by Khyat 12 months ago

#123 - fix recursion error when setting tp_wrapped_module #122

Pull Request - State: open - Opened by Ar-Kareem 12 months ago - 4 comments

#122 - Max Recursion Error when using with lora

Issue - State: open - Opened by Ar-Kareem almost 1 year ago - 2 comments

#121 - RuntimeError: NCCL Error 3: internal error

Issue - State: open - Opened by smallmocha about 1 year ago - 1 comment

#120 - Segmentation fault (core dumped)

Issue - State: open - Opened by jameswu2014 about 1 year ago

#119 - Support of 8-bit and 4-bit quantization

Issue - State: closed - Opened by ludwigflo about 1 year ago - 1 comment

#118 - Would it suitable for the multi-GPU parallel inference for llama2?

Issue - State: open - Opened by aclie about 1 year ago

#117 - 2x slowdown using TP

Issue - State: open - Opened by jph00 about 1 year ago

#116 - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

Issue - State: open - Opened by SparkJiao about 1 year ago

#115 - distributed TP model forward output's requires_grad is False

Issue - State: open - Opened by lxuechen about 1 year ago - 5 comments

#114 - tensor_parallel method distributed=True

Issue - State: open - Opened by Johnno1011 about 1 year ago - 2 comments

#113 - Forwarding _prepare_model_inputs

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago

#112 - model.generate() with inputs_embeds

Issue - State: closed - Opened by ZhaoxuanWu about 1 year ago - 3 comments

#111 - Fix false positive in tests for finding predefined TP config

Pull Request - State: closed - Opened by tonywang16 about 1 year ago - 2 comments

#110 - find_predefined_tensor_parallel_config try-except fix

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago

#109 - Testing interfaces (soon to be refactored)

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago

#108 - Fix get_llama_config adding model attribute error.

Pull Request - State: closed - Opened by tonywang16 about 1 year ago - 3 comments

#107 - Error loading LLAMA model config

Issue - State: closed - Opened by tonywang16 about 1 year ago

#106 - [WIP] ZeRO-3 refactoring (sharding)

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago - 1 comment
Labels: enhancement

#105 - When I try to do the tensor_parallel on NLLB from meta, there is an error

Issue - State: open - Opened by 342215448 about 1 year ago

#104 - When I try to do the tensor_parallel on NLLB from meta, there is an error:

Issue - State: closed - Opened by 342215448 about 1 year ago - 1 comment
Labels: duplicate

#103 - Gpt2 fix

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago - 1 comment
Labels: bug

#102 - Version bump

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago

#101 - LLaMA-2

Pull Request - State: closed - Opened by BlackSamorez about 1 year ago - 1 comment

#100 - explicitly choose whether or not to use torch.distribute

Pull Request - State: closed - Opened by tomoki0924 about 1 year ago - 2 comments
Labels: enhancement

#99 - GPT-2 broken starting in v1.2.5

Issue - State: closed - Opened by eric-mitchell about 1 year ago - 1 comment

#98 - Issues if GPU > 2

Issue - State: closed - Opened by Tom-Ryder about 1 year ago - 6 comments

#97 - Cloud Tensor_parallel add multiple accelerator inference support with torch.distributed?

Issue - State: closed - Opened by helloaigc about 1 year ago - 4 comments

#96 - Example Question (got error) : Try new 40B LLMs demo in Kaggle

Issue - State: closed - Opened by YooSungHyun about 1 year ago - 2 comments

#95 - why raised cuda error?

Issue - State: closed - Opened by YooSungHyun about 1 year ago - 18 comments

#94 - Possibility to run on different GPUs

Issue - State: closed - Opened by Ch4mpa9ne over 1 year ago - 2 comments

#93 - readme fixes

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#92 - Falcon lm_head split hotfix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#91 - Falcon predefined config

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#90 - Fixed dispatch of tp.Sharded models

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#89 - TypeError when multi-thread inference using tensor_parallel

Issue - State: closed - Opened by liulhdarks over 1 year ago - 1 comment

#88 - Question on custom models

Issue - State: open - Opened by vince62s over 1 year ago - 23 comments

#87 - Removing PEFT from dependencies. Replacing with runtime checks

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#86 - Does tensor_parallel support the model inference concurrently or in multi-threads?

Issue - State: closed - Opened by zoubaihan over 1 year ago - 2 comments

#85 - Does tensor_parallel support data parallel and tensor parallel hybrid training?

Issue - State: open - Opened by liguodongiot over 1 year ago

#84 - Does tensor_parallel support multi-node tensor parallel training?

Issue - State: open - Opened by liguodongiot over 1 year ago - 5 comments

#83 - Can I parallelize just one large layer?

Issue - State: open - Opened by chinmayjog13 over 1 year ago - 1 comment

#82 - Actually using SplitInsideChunks for gpt2

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#81 - Request to fix the content about parallelformers in README.

Issue - State: closed - Opened by hyunwoongko over 1 year ago - 1 comment

#80 - Support for PEFT LoRA and 4-bit quantization

Issue - State: closed - Opened by morecry over 1 year ago - 6 comments

#79 - Not work with 4bit quant

Issue - State: closed - Opened by laoda513 over 1 year ago - 6 comments
Labels: duplicate

#78 - tp.convert_state_dict readme example fix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#77 - Error in README.Md, hence not able to load model with limited memory.

Issue - State: closed - Opened by vishakudupa over 1 year ago - 5 comments

#76 - Torch version requirement

Issue - State: closed - Opened by treya-lin over 1 year ago - 4 comments

#75 - Great work！ and can this work with deepspeedzero?

Issue - State: open - Opened by laoda513 over 1 year ago

#74 - Huggingface Accelerate

Issue - State: closed - Opened by conceptofmind over 1 year ago - 1 comment

#73 - State dict fixes for tied weights

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#72 - What is the difference between this project and autotp of deepspeed?

Issue - State: closed - Opened by frankxyy over 1 year ago - 1 comment

#71 - cuda memory not evenly distributed between devices

Issue - State: closed - Opened by frankxyy over 1 year ago - 6 comments

#70 - Torch distributed hotfix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#69 - set distributed=True, return AttributeError: 'NoneType' object

Issue - State: closed - Opened by rocketsearch over 1 year ago - 2 comments

#68 - Peft LoRA support

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#67 - How to load lora weights？

Issue - State: closed - Opened by Vincent131499 over 1 year ago - 13 comments

#66 - Slow inference performance for large Llama models compared to naive MP

Issue - State: open - Opened by sgsdxzy over 1 year ago - 26 comments

#65 - Mention linear speedup in Readme

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#64 - Set seed for tests reproducibility

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#63 - Small readme patch

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#62 - Added int8 LLMs demo link

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#61 - CodeGen config

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#60 - Converting state dicts without model creation

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 3 comments

#59 - New version for dispatch hotfix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#58 - Shard parameters initial dispatch fix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#57 - Unpersistent buffers meta loading fix

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#56 - _reorder_cache fix for generation utils

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#55 - GPT NeoX config

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago
Labels: enhancement

#54 - Meta devices support

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment
Labels: enhancement

#53 - LLaMa models

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago
Labels: enhancement

#52 - Removing accelerate hooks before splitting the model

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#51 - Support LLaMA Models, including HuggingFace-adapted variants

Issue - State: closed - Opened by zoidbb over 1 year ago - 7 comments
Labels: bug, enhancement

#50 - Version update

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#49 - Saving utilities

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#48 - How to use trained models?

Issue - State: closed - Opened by Den4ikAI over 1 year ago - 3 comments

#47 - Adding support for more model architectures

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 2 comments

#46 - False negative test results for test_convs. Flaky test

Issue - State: closed - Opened by BlackSamorez over 1 year ago - 1 comment

#45 - Add more predefined configs

Issue - State: open - Opened by BlackSamorez over 1 year ago
Labels: enhancement, good first issue

#44 - Replace architecture with model_type

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#43 - Config refactoring

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago - 4 comments

#42 - New ideas

Issue - State: open - Opened by aizamaksutova over 1 year ago

#41 - GPU Contention

Issue - State: open - Opened by aizamaksutova over 1 year ago

#40 - Fixed PyPi link in readme

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#39 - Version update

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

#38 - _TensorParallelWrapper attribute forwarding

Pull Request - State: closed - Opened by BlackSamorez over 1 year ago

GitHub / BlackSamorez/tensor_parallel issues and pull requests