Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / Helsinki-NLP/OpusTools issues and pull requests

#43 - Fix opusfilter interface

Pull Request - State: closed - Opened by svirpioj about 2 months ago

#43 - Fix opusfilter interface

Pull Request - State: closed - Opened by svirpioj about 2 months ago

#42 - Problems in OpusRead interface with moses preprocessing

Issue - State: closed - Opened by svirpioj 2 months ago

#41 - Cannot download resource due to `DH_KEY_TOO_SMALL`

Issue - State: open - Opened by cgbahk 8 months ago

#40 - change in OPUS yaml files

Issue - State: closed - Opened by jorgtied about 1 year ago - 1 comment
Labels: bug

#39 - Spaces before punctation marks on opus_read output

Issue - State: open - Opened by keith555 over 1 year ago - 1 comment

#38 - Recreate sample files shown in OpenSubtitles corpus

Issue - State: closed - Opened by keith555 over 1 year ago - 1 comment

#37 - Add yield tuple write mode

Pull Request - State: open - Opened by larrylawl almost 2 years ago

#36 - support search with 3-letter language codes or BCP-47

Issue - State: open - Opened by jorgtied almost 2 years ago
Labels: enhancement

#35 - DB for off-line search

Issue - State: closed - Opened by jorgtied almost 2 years ago - 1 comment
Labels: enhancement

#34 - fix ZeroDivisionError bug in progress printing

Pull Request - State: closed - Opened by svirpioj almost 2 years ago

#33 - Bump numpy from 1.16.4 to 1.22.0

Pull Request - State: open - Opened by dependabot[bot] over 2 years ago
Labels: dependencies

#32 - opus_read fails to extract CCMatrix

Issue - State: open - Opened by Waino almost 3 years ago - 3 comments

#31 - opus_get downloads *all* corpora with just the -s switch

Issue - State: closed - Opened by dumitrescustefan almost 3 years ago - 1 comment

#30 - convert newlines to spaces when outputting to moses formats

Pull Request - State: closed - Opened by svirpioj almost 3 years ago

#29 - Is it possible to download all corpus associate with the given language pair?

Issue - State: closed - Opened by BrightXiaoHan almost 3 years ago - 1 comment

#28 - Misleading logging information in opus_express

Issue - State: closed - Opened by aarnetalman almost 3 years ago - 1 comment

#27 - Add progress indicator to opus_express

Issue - State: closed - Opened by aarnetalman almost 3 years ago - 2 comments

#26 - Using opus_read with -az, -sz, -tz options

Issue - State: closed - Opened by pluiez about 3 years ago - 1 comment

#25 - malformed tmx from opus_read

Issue - State: closed - Opened by keith555 about 3 years ago - 1 comment

#24 - Format of downloaded files does not match the format expected by opus_read

Issue - State: closed - Opened by keith555 about 3 years ago - 1 comment

#23 - What is the tokenizer for all languages?

Issue - State: open - Opened by SefaZeng over 3 years ago

#22 - Missing alignment data for English(en) - Oromo(om)?

Issue - State: closed - Opened by ashaltu over 3 years ago - 1 comment

#21 - Memory Issue: opus_read fails to extract MultiCCAligned

Issue - State: closed - Opened by aflueckiger over 3 years ago - 1 comment

#20 - Where are the missing language pairs?

Issue - State: open - Opened by icaswell over 3 years ago - 1 comment

#19 - PyPI wheel includes old files

Issue - State: closed - Opened by compwiztobe almost 4 years ago - 2 comments

#18 - Alignment problem with JW300 corpora?

Issue - State: open - Opened by sklampfl almost 4 years ago - 1 comment

#17 - opus_express without confirmation?

Issue - State: closed - Opened by ZJaume almost 4 years ago - 1 comment

#16 - opus_express not checking correctly root directory

Issue - State: closed - Opened by ZJaume almost 4 years ago - 1 comment

#15 - fix opus_express shuffle broken due to missing import

Pull Request - State: closed - Opened by ymyt almost 4 years ago

#14 - create_bash_script.py : this file enable to create command lines for …

Pull Request - State: closed - Opened by Sohyo almost 4 years ago

#13 - List of datasets | Monolingual raw files

Issue - State: closed - Opened by jchwenger over 4 years ago - 5 comments

#12 - Opus_read: SentenceParserError

Issue - State: closed - Opened by Stamenov over 4 years ago - 5 comments

#11 - Query to get list of existing corpora (by language)

Issue - State: closed - Opened by dumitrescustefan over 4 years ago - 1 comment

#10 - Automatically switch token delimiter for languages not using whitespace

Pull Request - State: closed - Opened by pks over 4 years ago - 3 comments

#9 - problem with os.rename in opus_langid

Issue - State: closed - Opened by jorgtied almost 5 years ago - 1 comment
Labels: bug

#8 - OPUS returns no data

Issue - State: closed - Opened by george-roussos about 5 years ago - 2 comments

#7 - Question: monolingual dialogs (Finnish language)

Issue - State: closed - Opened by remotejob about 5 years ago - 2 comments

#6 - preserve inline tags

Issue - State: closed - Opened by jorgtied about 5 years ago - 1 comment
Labels: enhancement

#3 - implement opus-filter

Issue - State: closed - Opened by jorgtied over 5 years ago - 5 comments
Labels: enhancement

#2 - option for adding document boundaries

Issue - State: closed - Opened by jorgtied over 5 years ago - 1 comment
Labels: enhancement

#1 - tool for language identification

Issue - State: closed - Opened by jorgtied over 5 years ago - 3 comments
Labels: enhancement