Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / facebookresearch/LASER issues and pull requests
#283 - weights_only=True warning from pytorch
Issue -
State: open - Opened by bbetters 13 days ago
#282 - Fix join of base url for windows machines
Pull Request -
State: closed - Opened by TheHappyLemon 5 months ago
- 5 comments
Labels: CLA Signed
#281 - Similarity calculation takes too long with LASER3 model, compared to XLM-R/LaBSE
Issue -
State: closed - Opened by aloka-fernando 6 months ago
- 1 comment
#280 - Unable to import LaserEncoderPipeline
Issue -
State: closed - Opened by sumedhan-r 6 months ago
- 10 comments
#280 - Unable to import LaserEncoderPipeline
Issue -
State: closed - Opened by sumedhan-r 6 months ago
- 10 comments
#279 - Can't download: 403 error on some CC segments.
Issue -
State: open - Opened by enn-nafnlaus 8 months ago
- 1 comment
#278 - How can I parallelize LASER embeddings?
Issue -
State: closed - Opened by prasunshrestha 8 months ago
- 3 comments
#277 - do it support qwen model?
Issue -
State: closed - Opened by xxm1668 9 months ago
- 1 comment
#276 - Sentiment Analysis Tutorial using laser
Pull Request -
State: closed - Opened by NIXBLACK11 10 months ago
Labels: CLA Signed
#275 - Add laser clustering example.
Pull Request -
State: closed - Opened by Paulooh007 10 months ago
- 2 comments
Labels: CLA Signed
#274 - Sentiment analysis laser
Pull Request -
State: closed - Opened by NIXBLACK11 10 months ago
- 1 comment
Labels: CLA Signed
#273 - Add 2-letter codes to the `laser_encoders` language list
Issue -
State: open - Opened by avidale 10 months ago
Labels: enhancement, MLH
#272 - Support for saving embeddings bin file in laser_encoder
Issue -
State: closed - Opened by vmenan 10 months ago
- 4 comments
#271 - Create a tutorial showing how LASER can be applied for downstream practical tasks (e.g. multilingual sentiment analysis)
Issue -
State: closed - Opened by avidale 10 months ago
- 1 comment
Labels: enhancement, MLH
#270 - Publish the performance of LASER2 and LASER3 encoders on each of the FLORES-200 languages
Issue -
State: open - Opened by avidale 10 months ago
Labels: enhancement
#269 - Update language_list.py
Pull Request -
State: closed - Opened by NIXBLACK11 11 months ago
- 4 comments
Labels: CLA Signed
#268 - Ensure `laser_encoders` has parity with existing LASER inference code for release
Pull Request -
State: closed - Opened by heffernankevin 11 months ago
Labels: CLA Signed
#267 - Reorganize the lists of languages
Issue -
State: closed - Opened by avidale 11 months ago
Labels: enhancement, MLH
#266 - Update the main README file with a mention of `laser_encoders`
Pull Request -
State: closed - Opened by avidale 11 months ago
Labels: CLA Signed
#265 - Added Contributers in the readme.
Pull Request -
State: closed - Opened by NIXBLACK11 11 months ago
Labels: CLA Signed
#264 - Decrease versions of numpy and torch required by laser-encoders
Pull Request -
State: closed - Opened by Paulooh007 11 months ago
Labels: CLA Signed
#263 - Try decreasing the minimal versions of numpy and torch required by laser-encoders
Issue -
State: closed - Opened by avidale 11 months ago
Labels: enhancement, MLH
#262 - Enhance LaserTokenizer with Perl Parity, Optional Punctuation Normalization, and Embedding Normalization
Pull Request -
State: closed - Opened by Paulooh007 11 months ago
- 2 comments
Labels: CLA Signed
#261 - Suggestion: add normalization option in the
Issue -
State: closed - Opened by avidale 11 months ago
Labels: enhancement, MLH
#260 - Suggestion: allow turning off punctuation normalization
Issue -
State: closed - Opened by avidale 11 months ago
Labels: enhancement, MLH
#259 - An error initializing English pipeline (on the MLH-dev branch)
Issue -
State: closed - Opened by avidale 11 months ago
Labels: bug, MLH
#258 - Extend Tokenizer to Support Single Strings and Lists of Strings
Pull Request -
State: closed - Opened by Paulooh007 11 months ago
Labels: CLA Signed
#257 - Adding Language Validation Test
Pull Request -
State: closed - Opened by NIXBLACK11 11 months ago
- 4 comments
Labels: CLA Signed
#256 - Refactor `initialize_encoder` to `LaserEncoderPipeline`
Pull Request -
State: closed - Opened by Paulooh007 12 months ago
Labels: CLA Signed
#255 - Downloading models and tokenizers does not work on Windows
Issue -
State: closed - Opened by avidale 12 months ago
- 1 comment
Labels: bug, MLH
#254 - Update laser_encoders README
Pull Request -
State: closed - Opened by Paulooh007 12 months ago
- 2 comments
Labels: CLA Signed
#253 - Handle Interrupted Model Weight Downloads
Pull Request -
State: closed - Opened by Paulooh007 12 months ago
- 3 comments
Labels: CLA Signed
#252 - Sacremoses install script
Pull Request -
State: closed - Opened by NIXBLACK11 12 months ago
- 2 comments
Labels: CLA Signed
#251 - Fix outdated Dockerfile and Flask app
Pull Request -
State: closed - Opened by Paulooh007 almost 1 year ago
- 2 comments
Labels: CLA Signed
#250 - SPM models not available
Issue -
State: closed - Opened by tbst730 about 1 year ago
- 2 comments
#249 - MLH fellowship contribution: adding the `laser_encoders` module
Pull Request -
State: closed - Opened by avidale about 1 year ago
- 1 comment
Labels: CLA Signed
#248 - refactor: modified the sentence encoder to tokenize a text before encoding it
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
Labels: CLA Signed
#247 - LASER3 encoders for English? (and other high res langs)
Issue -
State: closed - Opened by gordicaleksa about 1 year ago
- 2 comments
#246 - docs: Readme documentation for the laser_encoder package
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
Labels: CLA Signed
#245 - The included Dockerfile and associated Flask app is out of date in multiple ways.
Issue -
State: closed - Opened by andy-weinstein about 1 year ago
- 2 comments
#244 - feat: Add Python function to download LASER models
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
- 1 comment
Labels: CLA Signed
#243 - Change default tokenizer for sentence embeddings
Issue -
State: closed - Opened by julianpollmann about 1 year ago
- 1 comment
#242 - Containerized Application, Corrupted file for bilstm.93langs.2018-12-26.pt
Issue -
State: closed - Opened by celcof about 1 year ago
- 2 comments
#241 - Refactor embedder
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
Labels: CLA Signed
#240 - Bug Report: Issue when embedding in docker container
Issue -
State: closed - Opened by FavorMylikes about 1 year ago
- 1 comment
#239 - feat: make LASER pip installable
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
Labels: CLA Signed
#238 - feat: converted SPMapply function to use python script
Pull Request -
State: closed - Opened by CaptainVee about 1 year ago
- 1 comment
Labels: CLA Signed
#237 - Python interface for downloading and choosing models
Issue -
State: closed - Opened by avidale over 1 year ago
- 1 comment
#236 - Port LASER-2,3 text preprocessing into Python
Issue -
State: closed - Opened by avidale over 1 year ago
- 1 comment
#235 - Make LASER pip-installable
Issue -
State: closed - Opened by avidale over 1 year ago
- 1 comment
#234 - Laser v2, set Language to compute Embeddings on
Issue -
State: closed - Opened by celcof over 1 year ago
- 3 comments
#233 - CCMatrix Data download
Issue -
State: closed - Opened by yangyang0202 over 1 year ago
- 3 comments
#232 - Fix #231 Embed Sents Path parameter
Pull Request -
State: closed - Opened by julianpollmann over 1 year ago
Labels: CLA Signed
#231 - embed_sentences Path parameters
Issue -
State: closed - Opened by julianpollmann over 1 year ago
- 1 comment
#230 - mine_bitexts.py: Zero division RuntimeWarning with margin "ratio"
Issue -
State: closed - Opened by OrianeN over 1 year ago
- 2 comments
#229 - Using LASER 2 and LASER 3 to filter good quality sentence pairs from a webcrawled parallel dataset
Issue -
State: closed - Opened by vmenan over 1 year ago
- 8 comments
#228 - Problem with wet_lines
Issue -
State: closed - Opened by vmenan over 1 year ago
- 4 comments
#227 - Is there a limit of sentence length for LASER?
Issue -
State: closed - Opened by maohbao over 1 year ago
- 2 comments
#226 - Text preprocessing before embedding
Issue -
State: closed - Opened by mrkcdl over 1 year ago
- 2 comments
#225 - Clarification for Chinese language variants
Issue -
State: closed - Opened by raunaksinhacisco over 1 year ago
- 1 comment
#224 - how to get the sentence embedding using Librivox S2S?
Issue -
State: closed - Opened by zhhao1 over 1 year ago
- 2 comments
#223 - Laser support for non-official data
Issue -
State: closed - Opened by joshdehlong over 1 year ago
- 2 comments
#222 - spm_encode: No such file or directory
Issue -
State: closed - Opened by zzzul9 over 1 year ago
- 9 comments
#221 - Sentence Embedding Reads No Input
Issue -
State: closed - Opened by prasunshrestha over 1 year ago
- 2 comments
#220 - LASER2 vocab contains upper case characters
Issue -
State: closed - Opened by ZJaume over 1 year ago
- 3 comments
#219 - Add requirements.txt with dependencies
Pull Request -
State: open - Opened by jordimas almost 2 years ago
- 4 comments
Labels: CLA Signed
#218 - add padded indices in max_token count in embed.py
Pull Request -
State: open - Opened by jasonmusespresso almost 2 years ago
Labels: CLA Signed
#217 - requirements.txt with compatible third-party versions
Issue -
State: closed - Opened by senisioi almost 2 years ago
- 3 comments
#216 - LASER 3 eng_Latn encoder model checkpoint link is broken
Issue -
State: closed - Opened by jaygala24 almost 2 years ago
- 2 comments
#215 - Embedding tasks stop at pre-processing phase.
Issue -
State: closed - Opened by Vietdung113 about 2 years ago
- 5 comments
#214 - error occurs when using my own trained LASER model
Issue -
State: closed - Opened by weitaizhang about 2 years ago
- 4 comments
#213 - embed task stops at preprocessing
Issue -
State: closed - Opened by ruoyuxie about 2 years ago
- 2 comments
#212 - when run bash ./eval.sh & python eval.py & Call functions embed_sentences show error // spm_encode:No such file or directory
Issue -
State: closed - Opened by dcmouth about 2 years ago
- 38 comments
#211 - embed models
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 10 comments
#210 - Segmentation fault
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 1 comment
#209 - .enc and .candidates.tsv
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 3 comments
#208 - cannot install sentence-transformers
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 3 comments
#207 - two errors in embed.py and one error in indexing.py
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 1 comment
#206 - Update README.md
Pull Request -
State: closed - Opened by Harrry538 about 2 years ago
- 3 comments
Labels: CLA Signed
#205 - cannot install fastbpe
Issue -
State: closed - Opened by guangyuli-uoe about 2 years ago
- 7 comments
#204 - include text data in NLLB200 dataset
Issue -
State: closed - Opened by huseinzol05 about 2 years ago
- 1 comment
#203 - Embedder is not working with LASER3.
Issue -
State: closed - Opened by mahendra-gehlot about 2 years ago
- 2 comments
#202 - How to set everything up for a translation from a given language ex: Albanian to English
Issue -
State: closed - Opened by fatjoni about 2 years ago
- 1 comment
#201 - Question related to paper cc-matrix + code
Issue -
State: closed - Opened by vince62s about 2 years ago
- 1 comment
#200 - Sentences Couldn't Be Embedded by `embed.sh` (Encoder: 0 sentences in 0s)
Issue -
State: closed - Opened by ShunchiZhang about 2 years ago
- 5 comments
#199 - embed README: note about wmt22 model download
Pull Request -
State: closed - Opened by patrick-wilken over 2 years ago
- 2 comments
Labels: CLA Signed
#198 - mine_bitexts results
Issue -
State: closed - Opened by vince62s over 2 years ago
- 7 comments
#196 - Finetune LASER
Issue -
State: closed - Opened by mersad-esalati over 2 years ago
- 4 comments
#195 - https://data.statmt.org/cc-matrix/ is down?
Issue -
State: closed - Opened by alirezamshi-zz over 2 years ago
- 2 comments
#192 - [Help Wanted] Generate LASER embeddings for a large number of sentences (15.7 million)
Issue -
State: open - Opened by NomadXD about 3 years ago
- 1 comment
#189 - pip3 install cc_net not working
Issue -
State: closed - Opened by BeaLove over 3 years ago
- 3 comments
#188 - Docker API not handling batch requests
Issue -
State: open - Opened by vintrocode over 3 years ago
- 2 comments
#187 - Utility of the lower_case parameter in the TokenLine function
Issue -
State: closed - Opened by codedecde over 3 years ago
- 1 comment
#186 - fastBPE installation failure
Issue -
State: closed - Opened by parthe over 3 years ago
- 1 comment
#171 - failed to download tokenizer/detokenizer.perl & normalize-punctuation.perl
Issue -
State: closed - Opened by dcmouth over 3 years ago
- 2 comments
#169 - How is language id embedding is learned?
Issue -
State: open - Opened by mani-rai almost 4 years ago
- 2 comments
#168 - Can you please upload English Nepali model as well?
Issue -
State: closed - Opened by mani-rai almost 4 years ago
- 2 comments
#166 - joint multilingual space min/max dimensions
Issue -
State: closed - Opened by curryprogrammer almost 4 years ago
- 1 comment
#161 - similarity/wmt.sh example does not work
Issue -
State: closed - Opened by rabeehk almost 4 years ago
- 3 comments
#159 - added multi-core funtionality
Pull Request -
State: closed - Opened by vigneshmj1997 almost 4 years ago
- 3 comments
Labels: CLA Signed