Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / microsoft/SpeechT5 issues and pull requests

#91 - gaokao_audio can not be download? something error

Issue - State: open - Opened by liyunlongaaa 2 days ago - 1 comment

#90 - Why can WavLLM understand audio sounds as well?

Issue - State: open - Opened by BenoitWang 13 days ago - 1 comment

#89 - Setup Error about WavLLM

Issue - State: open - Opened by StupidDebugger 17 days ago - 4 comments

#87 - I found minor typo in Readme

Issue - State: open - Opened by yeonju7kim 23 days ago

#85 - How to fine-tune SpeechT5 HifiGAN vocoder?

Issue - State: open - Opened by yukiarimo about 2 months ago

#84 - soundfile.LibsndfileError: <exception str() failed>

Issue - State: closed - Opened by ciwei6107563 about 2 months ago

#83 - Unable to Download wavLLM Due to Error

Issue - State: open - Opened by minkyu119 about 2 months ago - 1 comment

#81 - SpeechUT does not have a link for download

Issue - State: open - Opened by world1tree 4 months ago - 2 comments

#80 - What's the model_path and data_name on inference code?

Issue - State: open - Opened by YepJin 4 months ago - 1 comment

#78 - Error in loading WavLLM model

Issue - State: open - Opened by rishabh004-ai 5 months ago - 9 comments

#77 - Single Task Training

Issue - State: closed - Opened by yangjiabupt 5 months ago - 1 comment

#76 - WavLLM checkpoint

Issue - State: open - Opened by ming024 5 months ago - 5 comments

#75 - ASR fine-tuning loss goes to zero after several epochs

Issue - State: closed - Opened by yunigma 5 months ago - 2 comments

#74 - extract transorformer layer feature

Issue - State: open - Opened by zbpjlc 7 months ago - 2 comments

#71 - Link to train_960.tsv is broken

Issue - State: open - Opened by Kodhandarama 8 months ago

#70 - "SpeechT5" on Android OS

Issue - State: open - Opened by taeyeonlee 8 months ago

#69 - British English TTS model

Issue - State: closed - Opened by omega3 8 months ago - 1 comment

#68 - Text feature extraction using SpeechLM

Issue - State: open - Opened by wonjune-kang 9 months ago

#67 - Baseline implementation

Issue - State: open - Opened by ussenuk 10 months ago - 1 comment

#66 - How to setting language when do S2T

Issue - State: open - Opened by nhha1602 10 months ago - 1 comment

#65 - 是否支持中文转语音?

Issue - State: open - Opened by xxm1668 11 months ago - 4 comments

#62 - ASR SpeechT5 training - model predicts same output for different inputs

Issue - State: open - Opened by L7uan 12 months ago - 1 comment

#61 - Is end-to-end S2ST possible with Speecht5?

Issue - State: open - Opened by elia-ashraf about 1 year ago

#60 - Generate the N-best (top few) hypotheses

Issue - State: open - Opened by cyfer0618 about 1 year ago

#59 - Reproduce ASR experiment results in Hugging Face

Issue - State: closed - Opened by jjyaoao about 1 year ago

#58 - Voice Conversion - Error with Some Mono, 16kHz, 16bit Audio

Issue - State: open - Opened by fabiocat93 about 1 year ago - 3 comments

#56 - pretrain loss

Issue - State: open - Opened by MarsMeng1994 about 1 year ago - 4 comments

#55 - Bump scipy from 1.5.4 to 1.10.0 in /VATLM/vat_hubert

Pull Request - State: open - Opened by dependabot[bot] about 1 year ago
Labels: dependencies

#52 - SpeechUT inference error in en_fr checkpoint

Issue - State: open - Opened by ytf-philp over 1 year ago - 1 comment

#51 - Using SpeechT5 Large for TTS

Issue - State: open - Opened by imranmaj over 1 year ago

#50 - SpeechT5: extracting Chinese speaker embedding

Issue - State: open - Opened by QQ-777777 over 1 year ago - 6 comments

#49 - SpeechT5-tts fine-tuned on Chinese

Issue - State: open - Opened by qlmbeck over 1 year ago - 4 comments

#48 - add link to Hugging Face fine-tuning example

Pull Request - State: closed - Opened by hollance over 1 year ago - 1 comment

#47 - The link for Prosody-SpeechT5 in the Readme is dead/404

Issue - State: closed - Opened by svantana over 1 year ago - 2 comments

#46 - SpeechLM

Issue - State: closed - Opened by blueblue-bubble over 1 year ago - 2 comments

#45 - SpeechT5:how much epoch is set

Issue - State: closed - Opened by QQ-777777 over 1 year ago - 5 comments

#43 - how to pause between two words ?

Issue - State: open - Opened by hulk10425 over 1 year ago - 2 comments

#42 - how to fine tune sid on pretrained model?

Issue - State: closed - Opened by haha010508 over 1 year ago - 11 comments

#41 - hydra fine-tunning for speechT5?

Issue - State: open - Opened by ramonsanabria over 1 year ago

#40 - [SpeechLM] About phoneme tokenizer in detail?

Issue - State: closed - Opened by yuseungwoo over 1 year ago - 1 comment

#39 - reproduction steps for inference

Issue - State: open - Opened by ghost over 1 year ago - 2 comments

#38 - Pretrain SpeechT5 on my own dataset

Issue - State: closed - Opened by hungker over 1 year ago - 3 comments

#37 - Missing speecht5 task

Issue - State: closed - Opened by maximerenou over 1 year ago - 1 comment

#36 - SpeechT5 Speech Enhancement

Issue - State: open - Opened by avramandrei over 1 year ago - 2 comments

#35 - Fine-tunning on Hugging Face

Issue - State: open - Opened by ramonsanabria over 1 year ago - 1 comment

#34 - SpeechUT inference and fine-tune problem

Issue - State: closed - Opened by ytf-philp over 1 year ago - 3 comments

#33 - add Hugging Face links

Pull Request - State: closed - Opened by hollance over 1 year ago - 2 comments

#32 - add SID in SpeechT5

Pull Request - State: closed - Opened by mechanicalsea over 1 year ago - 1 comment

#31 - SpeechT5: Finetuned SID model

Issue - State: closed - Opened by entn-at over 1 year ago - 2 comments

#30 - SpeechT5 pretrain

Issue - State: open - Opened by benyang0506 over 1 year ago - 5 comments

#29 - About the SpeechT5 pre-training curve

Issue - State: closed - Opened by benyang0506 over 1 year ago - 4 comments

#28 - SpeechT5 Pretrain ERROR

Issue - State: closed - Opened by benyang0506 over 1 year ago - 1 comment

#27 - Whether fp16 is enabled in VATLM during pre-training

Issue - State: closed - Opened by xiabingquan almost 2 years ago - 2 comments

#26 - SpeechLM:KeyError: 'text_transformer' while initing the SpeechLMConfig

Issue - State: closed - Opened by JunZhan2000 almost 2 years ago - 2 comments

#24 - Same benchmark, same architecture, but the WER is differenet, why?

Issue - State: closed - Opened by splinter21 almost 2 years ago - 2 comments

#23 - SpeechLM: How to train 'Phone-unit tokenizer for speech' using kaldi?

Issue - State: closed - Opened by YWMditto almost 2 years ago - 7 comments

#22 - Speech2C "Inf detected in output" while training

Issue - State: closed - Opened by Sreyan88 almost 2 years ago - 4 comments

#21 - Speech2C training error

Issue - State: closed - Opened by Sreyan88 almost 2 years ago - 6 comments

#20 - Missing SPM and Vocabulary files

Issue - State: closed - Opened by sumanthd17 almost 2 years ago - 2 comments

#19 - Port to Huggingface

Issue - State: closed - Opened by StephennFernandes almost 2 years ago - 1 comment

#18 - SpeechLM: How to resample phonemes' frame rate from 30ms to 20ms?

Issue - State: closed - Opened by Arrivederci almost 2 years ago - 3 comments

#17 - SpeechLM: how to prepare phoneme sequence for T2U generator

Issue - State: closed - Opened by cwang621 almost 2 years ago - 5 comments

#16 - SpeechT5: How to get speaker embeddings ?

Issue - State: closed - Opened by Arrivederci almost 2 years ago - 12 comments

#15 - Example values for finetuning asr

Issue - State: closed - Opened by YWMditto almost 2 years ago - 18 comments

#14 - Sample Rates are different between speech pre-training dataset and tts dataset

Issue - State: closed - Opened by Maggione about 2 years ago - 1 comment

#13 - Combining speech and text in the encoder

Issue - State: closed - Opened by jacqle about 2 years ago - 1 comment

#12 - Can you provide a voice conversion finetune recipe?

Issue - State: closed - Opened by hpjang about 2 years ago - 2 comments

#10 - Adding Microsoft SECURITY.MD

Pull Request - State: closed - Opened by microsoft-github-policy-service[bot] about 2 years ago

#9 - Text data preparation

Issue - State: closed - Opened by tskim9439 about 2 years ago - 3 comments

#8 - No code for Speech Synthesis

Issue - State: closed - Opened by petervickers about 2 years ago - 4 comments

#7 - ArgumentError in SpeechT5Task.add_args() when running fairseq-generate

Issue - State: closed - Opened by busukxuan about 2 years ago - 1 comment

#5 - Difficulties loading pre-trained weights!

Issue - State: closed - Opened by sanchit-gandhi over 2 years ago - 2 comments

#4 - Missing text_to_speech_dataset.py in speecht5/data

Issue - State: closed - Opened by ayushtues over 2 years ago - 1 comment

#3 - How to load the pretrained models in pytorch

Issue - State: closed - Opened by ayushtues over 2 years ago - 5 comments

#1 - how to pre-train on a custom dataset ?

Issue - State: closed - Opened by StephennFernandes over 2 years ago - 16 comments