Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / microsoft/SpeechT5 issues and pull requests
#91 - gaokao_audio can not be download? something error
Issue -
State: open - Opened by liyunlongaaa 2 days ago
- 1 comment
#90 - Why can WavLLM understand audio sounds as well?
Issue -
State: open - Opened by BenoitWang 13 days ago
- 1 comment
#89 - Setup Error about WavLLM
Issue -
State: open - Opened by StupidDebugger 17 days ago
- 4 comments
#88 - Request for Assistance with VATLM Implementation: Accessing Wave2Vec Model File
Issue -
State: open - Opened by yeonju7kim 20 days ago
#87 - I found minor typo in Readme
Issue -
State: open - Opened by yeonju7kim 23 days ago
#86 - Please fix the broken download link!!! So many models cann't be used without checkpoint.
Issue -
State: open - Opened by world1tree about 1 month ago
#85 - How to fine-tune SpeechT5 HifiGAN vocoder?
Issue -
State: open - Opened by yukiarimo about 2 months ago
#84 - soundfile.LibsndfileError: <exception str() failed>
Issue -
State: closed - Opened by ciwei6107563 about 2 months ago
#83 - Unable to Download wavLLM Due to Error
Issue -
State: open - Opened by minkyu119 about 2 months ago
- 1 comment
#82 - What languages are supported? How to specify a language?
Issue -
State: open - Opened by secsilm 3 months ago
#81 - SpeechUT does not have a link for download
Issue -
State: open - Opened by world1tree 4 months ago
- 2 comments
#80 - What's the model_path and data_name on inference code?
Issue -
State: open - Opened by YepJin 4 months ago
- 1 comment
#79 - Confusion/Question about SpeechT5SpeechDecoderPostnet output
Issue -
State: open - Opened by Student204161 5 months ago
#78 - Error in loading WavLLM model
Issue -
State: open - Opened by rishabh004-ai 5 months ago
- 9 comments
#77 - Single Task Training
Issue -
State: closed - Opened by yangjiabupt 5 months ago
- 1 comment
#76 - WavLLM checkpoint
Issue -
State: open - Opened by ming024 5 months ago
- 5 comments
#75 - ASR fine-tuning loss goes to zero after several epochs
Issue -
State: closed - Opened by yunigma 5 months ago
- 2 comments
#74 - extract transorformer layer feature
Issue -
State: open - Opened by zbpjlc 7 months ago
- 2 comments
#73 - Does the pre-trained model for hidden unit tokenizer use speaker embeddings?
Issue -
State: open - Opened by Kodhandarama 7 months ago
#72 - What is the time taken to converge for the hidden unit tokenizer?
Issue -
State: open - Opened by Kodhandarama 7 months ago
#71 - Link to train_960.tsv is broken
Issue -
State: open - Opened by Kodhandarama 8 months ago
#70 - "SpeechT5" on Android OS
Issue -
State: open - Opened by taeyeonlee 8 months ago
#69 - British English TTS model
Issue -
State: closed - Opened by omega3 8 months ago
- 1 comment
#68 - Text feature extraction using SpeechLM
Issue -
State: open - Opened by wonjune-kang 9 months ago
#67 - Baseline implementation
Issue -
State: open - Opened by ussenuk 10 months ago
- 1 comment
#66 - How to setting language when do S2T
Issue -
State: open - Opened by nhha1602 10 months ago
- 1 comment
#65 - 是否支持中文转语音?
Issue -
State: open - Opened by xxm1668 11 months ago
- 4 comments
#64 - The size of tensor a (674) must match the size of tensor b (600) at non-singleton dimension 1
Issue -
State: open - Opened by poojitharamachandra 11 months ago
- 1 comment
#63 - SpeechT5 - TTS - Tokenizer adding `▁` token between newly added Vietnamese characters
Issue -
State: closed - Opened by GinUTE 12 months ago
- 1 comment
#62 - ASR SpeechT5 training - model predicts same output for different inputs
Issue -
State: open - Opened by L7uan 12 months ago
- 1 comment
#61 - Is end-to-end S2ST possible with Speecht5?
Issue -
State: open - Opened by elia-ashraf about 1 year ago
#60 - Generate the N-best (top few) hypotheses
Issue -
State: open - Opened by cyfer0618 about 1 year ago
#59 - Reproduce ASR experiment results in Hugging Face
Issue -
State: closed - Opened by jjyaoao about 1 year ago
#58 - Voice Conversion - Error with Some Mono, 16kHz, 16bit Audio
Issue -
State: open - Opened by fabiocat93 about 1 year ago
- 3 comments
#57 - Getting TTS output voice close to the training data - Finetuning on different language
Issue -
State: open - Opened by Srija616 about 1 year ago
- 2 comments
#56 - pretrain loss
Issue -
State: open - Opened by MarsMeng1994 about 1 year ago
- 4 comments
#55 - Bump scipy from 1.5.4 to 1.10.0 in /VATLM/vat_hubert
Pull Request -
State: open - Opened by dependabot[bot] about 1 year ago
Labels: dependencies
#54 - VATLM: Error when loading finetuned checkpoints for infer_s2s
Issue -
State: open - Opened by naraysa over 1 year ago
#52 - SpeechUT inference error in en_fr checkpoint
Issue -
State: open - Opened by ytf-philp over 1 year ago
- 1 comment
#51 - Using SpeechT5 Large for TTS
Issue -
State: open - Opened by imranmaj over 1 year ago
#50 - SpeechT5: extracting Chinese speaker embedding
Issue -
State: open - Opened by QQ-777777 over 1 year ago
- 6 comments
#49 - SpeechT5-tts fine-tuned on Chinese
Issue -
State: open - Opened by qlmbeck over 1 year ago
- 4 comments
#48 - add link to Hugging Face fine-tuning example
Pull Request -
State: closed - Opened by hollance over 1 year ago
- 1 comment
#47 - The link for Prosody-SpeechT5 in the Readme is dead/404
Issue -
State: closed - Opened by svantana over 1 year ago
- 2 comments
#46 - SpeechLM
Issue -
State: closed - Opened by blueblue-bubble over 1 year ago
- 2 comments
#45 - SpeechT5:how much epoch is set
Issue -
State: closed - Opened by QQ-777777 over 1 year ago
- 5 comments
#43 - how to pause between two words ?
Issue -
State: open - Opened by hulk10425 over 1 year ago
- 2 comments
#42 - how to fine tune sid on pretrained model?
Issue -
State: closed - Opened by haha010508 over 1 year ago
- 11 comments
#41 - hydra fine-tunning for speechT5?
Issue -
State: open - Opened by ramonsanabria over 1 year ago
#40 - [SpeechLM] About phoneme tokenizer in detail?
Issue -
State: closed - Opened by yuseungwoo over 1 year ago
- 1 comment
#39 - reproduction steps for inference
Issue -
State: open - Opened by ghost over 1 year ago
- 2 comments
#38 - Pretrain SpeechT5 on my own dataset
Issue -
State: closed - Opened by hungker over 1 year ago
- 3 comments
#37 - Missing speecht5 task
Issue -
State: closed - Opened by maximerenou over 1 year ago
- 1 comment
#36 - SpeechT5 Speech Enhancement
Issue -
State: open - Opened by avramandrei over 1 year ago
- 2 comments
#35 - Fine-tunning on Hugging Face
Issue -
State: open - Opened by ramonsanabria over 1 year ago
- 1 comment
#34 - SpeechUT inference and fine-tune problem
Issue -
State: closed - Opened by ytf-philp over 1 year ago
- 3 comments
#33 - add Hugging Face links
Pull Request -
State: closed - Opened by hollance over 1 year ago
- 2 comments
#32 - add SID in SpeechT5
Pull Request -
State: closed - Opened by mechanicalsea over 1 year ago
- 1 comment
#31 - SpeechT5: Finetuned SID model
Issue -
State: closed - Opened by entn-at over 1 year ago
- 2 comments
#30 - SpeechT5 pretrain
Issue -
State: open - Opened by benyang0506 over 1 year ago
- 5 comments
#29 - About the SpeechT5 pre-training curve
Issue -
State: closed - Opened by benyang0506 over 1 year ago
- 4 comments
#28 - SpeechT5 Pretrain ERROR
Issue -
State: closed - Opened by benyang0506 over 1 year ago
- 1 comment
#27 - Whether fp16 is enabled in VATLM during pre-training
Issue -
State: closed - Opened by xiabingquan almost 2 years ago
- 2 comments
#26 - SpeechLM:KeyError: 'text_transformer' while initing the SpeechLMConfig
Issue -
State: closed - Opened by JunZhan2000 almost 2 years ago
- 2 comments
#25 - VATLM: ModuleNotFoundError: No module named 'fairseq.data.audio.multi_corpus_dataset_audio'
Issue -
State: closed - Opened by xiabingquan almost 2 years ago
- 6 comments
#24 - Same benchmark, same architecture, but the WER is differenet, why?
Issue -
State: closed - Opened by splinter21 almost 2 years ago
- 2 comments
#23 - SpeechLM: How to train 'Phone-unit tokenizer for speech' using kaldi?
Issue -
State: closed - Opened by YWMditto almost 2 years ago
- 7 comments
#22 - Speech2C "Inf detected in output" while training
Issue -
State: closed - Opened by Sreyan88 almost 2 years ago
- 4 comments
#21 - Speech2C training error
Issue -
State: closed - Opened by Sreyan88 almost 2 years ago
- 6 comments
#20 - Missing SPM and Vocabulary files
Issue -
State: closed - Opened by sumanthd17 almost 2 years ago
- 2 comments
#19 - Port to Huggingface
Issue -
State: closed - Opened by StephennFernandes almost 2 years ago
- 1 comment
#18 - SpeechLM: How to resample phonemes' frame rate from 30ms to 20ms?
Issue -
State: closed - Opened by Arrivederci almost 2 years ago
- 3 comments
#17 - SpeechLM: how to prepare phoneme sequence for T2U generator
Issue -
State: closed - Opened by cwang621 almost 2 years ago
- 5 comments
#16 - SpeechT5: How to get speaker embeddings ?
Issue -
State: closed - Opened by Arrivederci almost 2 years ago
- 12 comments
#15 - Example values for finetuning asr
Issue -
State: closed - Opened by YWMditto almost 2 years ago
- 18 comments
#14 - Sample Rates are different between speech pre-training dataset and tts dataset
Issue -
State: closed - Opened by Maggione about 2 years ago
- 1 comment
#13 - Combining speech and text in the encoder
Issue -
State: closed - Opened by jacqle about 2 years ago
- 1 comment
#12 - Can you provide a voice conversion finetune recipe?
Issue -
State: closed - Opened by hpjang about 2 years ago
- 2 comments
#11 - This repo is missing important files
Issue -
State: closed - Opened by microsoft-github-policy-service[bot] about 2 years ago
#10 - Adding Microsoft SECURITY.MD
Pull Request -
State: closed - Opened by microsoft-github-policy-service[bot] about 2 years ago
#9 - Text data preparation
Issue -
State: closed - Opened by tskim9439 about 2 years ago
- 3 comments
#8 - No code for Speech Synthesis
Issue -
State: closed - Opened by petervickers about 2 years ago
- 4 comments
#7 - ArgumentError in SpeechT5Task.add_args() when running fairseq-generate
Issue -
State: closed - Opened by busukxuan about 2 years ago
- 1 comment
#6 - Does the quantizer is used when fine-tune the pretrained backbone for the downstream task ?
Issue -
State: closed - Opened by zhhao1 about 2 years ago
- 2 comments
#5 - Difficulties loading pre-trained weights!
Issue -
State: closed - Opened by sanchit-gandhi over 2 years ago
- 2 comments
#4 - Missing text_to_speech_dataset.py in speecht5/data
Issue -
State: closed - Opened by ayushtues over 2 years ago
- 1 comment
#3 - How to load the pretrained models in pytorch
Issue -
State: closed - Opened by ayushtues over 2 years ago
- 5 comments
#2 - Are you planning to open source the configuration of the baselines and downstream tasks?
Issue -
State: closed - Opened by Maggione over 2 years ago
- 1 comment
#1 - how to pre-train on a custom dataset ?
Issue -
State: closed - Opened by StephennFernandes over 2 years ago
- 16 comments