Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / microsoft/BlingFire issues and pull requests
#181 - In some cases, blingfire models created with the new vocab.txt produce different results.
Issue -
State: open - Opened by springkim 18 days ago
#180 - Unable to load DLL 'blingfiretokdll' or one of its dependencies: 找不到指定的模块。 (0x8007007E) System.DllNotFoundException: Unable to load DLL 'blingfiretokdll' or one of its dependencies: 找不到指定的模块。 (0x8007007E)
Issue -
State: closed - Opened by gpww about 1 year ago
- 1 comment
#179 - replace nuget project
Pull Request -
State: closed - Opened by yhj9999999 about 1 year ago
#178 - Loading the bert_base_tok.bin model sometimes throws an exception
Issue -
State: open - Opened by jw444 about 1 year ago
#177 - c# example negative offset for Starts
Issue -
State: open - Opened by dbalikhin over 1 year ago
#176 - Build_Dll_For_Linux_ARM64 job fails
Issue -
State: open - Opened by Miloslav over 1 year ago
#175 - [MINOR:TYPO] Update README.md
Pull Request -
State: open - Opened by cakiki over 1 year ago
- 2 comments
#174 - Adding Microsoft SECURITY.MD
Pull Request -
State: closed - Opened by microsoft-github-policy-service[bot] over 1 year ago
#173 - Support for CLIP tokenizers from Hugging Face
Issue -
State: open - Opened by dkalinowski over 1 year ago
#172 - Fix memory leak when loading non-existent file
Pull Request -
State: open - Opened by dkalinowski over 1 year ago
#171 - Import issue on MacOS M1
Issue -
State: open - Opened by dariolamaj over 1 year ago
- 6 comments
#170 - /O2 in CMakeLists.txt is incompatible with vcpkg using Ninja
Issue -
State: open - Opened by davidmatson over 1 year ago
#169 - Unable to Modify Tokenization Logic
Issue -
State: open - Opened by Andrew03 over 1 year ago
#168 - Issues building on Mac OSX M2
Issue -
State: open - Opened by Kevdome3000 over 1 year ago
- 1 comment
#167 - Fix Unicode output on Windows.
Pull Request -
State: open - Opened by davidmatson over 1 year ago
#166 - "terminate called after throwing an instance of 'std::runtime_error'"
Issue -
State: open - Opened by michieal over 1 year ago
- 1 comment
#165 - BlingFire fails with all-lowercase text
Issue -
State: open - Opened by mhillebrand over 1 year ago
#164 - Fix Assignment to $[ is no longer supported errors
Pull Request -
State: open - Opened by twwhatever over 1 year ago
#163 - Fix 'Assigning non-zero to $[ is no longer possible' errors.
Pull Request -
State: closed - Opened by twwhatever over 1 year ago
- 1 comment
#161 - Removing extraneous cmake commands
Pull Request -
State: closed - Opened by TomasMorris almost 2 years ago
- 1 comment
#160 - Updating header for general consumption
Pull Request -
State: closed - Opened by TomasMorris almost 2 years ago
- 2 comments
#159 - Adding support for CMake Install target and c++ consumers
Pull Request -
State: closed - Opened by TomasMorris about 2 years ago
#158 - Add xlm-roberta-large tokenization support
Issue -
State: open - Opened by palenshus about 2 years ago
#157 - Could java call the tokenizer of bin file
Issue -
State: open - Opened by MichaleDong about 2 years ago
#156 - Missing vcruntime140.dll and vcruntime140_1.dll dependencies
Issue -
State: open - Opened by Joshua-Seekout over 2 years ago
- 4 comments
#155 - Missing numpy dependency on setup.py
Issue -
State: open - Opened by combacsa over 2 years ago
#154 - How to create i2w model
Issue -
State: open - Opened by MikiHunnter2 over 2 years ago
- 1 comment
#153 - Fix issues when loadmodel from model file which contains Unicode
Pull Request -
State: open - Opened by leti367 over 2 years ago
#152 - Add pdb files to nuget package
Pull Request -
State: open - Opened by leti367 over 2 years ago
#151 - use malloc instead of stackAlloc
Pull Request -
State: closed - Opened by leti367 over 2 years ago
- 2 comments
#150 - what is the last char of the last word from GetWords?
Issue -
State: open - Opened by tiagu-sh over 2 years ago
#149 - Update azure-pipelines.ym to be able to build for osx-arm64 and linux-arm64
Pull Request -
State: closed - Opened by leti367 over 2 years ago
#148 - Added: international models for URL tokenization.
Pull Request -
State: closed - Opened by SergeiAlonichau over 2 years ago
#147 - Support win-arm64 in NuGet package
Pull Request -
State: closed - Opened by leti367 over 2 years ago
#146 - Support M1 Macs in python bindings
Pull Request -
State: closed - Opened by pypae over 2 years ago
- 4 comments
#145 - M2M100 Marianmt tokenizers
Issue -
State: open - Opened by RaulRoPer over 2 years ago
#144 - Trouble installing for custom model creation
Issue -
State: open - Opened by Darth-Carrotpie over 2 years ago
- 2 comments
#143 - Byte offsets for original input bytes to allow non-destructive tokenization
Issue -
State: open - Opened by ELind77 over 2 years ago
#142 - Running in .NET within Unity crashes
Issue -
State: closed - Opened by Darth-Carrotpie over 2 years ago
- 6 comments
#141 - Sentence boundary for other languages
Issue -
State: open - Opened by mmehdig almost 3 years ago
#140 - libblingfiretokdll.dylib arm64e support
Issue -
State: open - Opened by youyinnn almost 3 years ago
- 1 comment
#139 - Word Tokenization - Unexpected Output
Issue -
State: open - Opened by albertnanda almost 3 years ago
#138 - Invalid parameters error FAMultiMapPack_fixed.cpp
Issue -
State: open - Opened by Kaelorn almost 3 years ago
#137 - Support for ARM64 architecture
Issue -
State: open - Opened by DavidObando almost 3 years ago
- 5 comments
#136 - Error loading files with paths with non-ANSI characters
Issue -
State: closed - Opened by DavidObando about 3 years ago
- 2 comments
#135 - Add centos7 binaries
Pull Request -
State: open - Opened by ben-childs-docusign about 3 years ago
- 1 comment
#134 - BlingFire Nuget Package does not work on CentOS7
Issue -
State: open - Opened by ben-childs-docusign about 3 years ago
- 2 comments
#133 - Fix a typo Sentence to Word
Pull Request -
State: closed - Opened by hcyang about 3 years ago
- 1 comment
#132 - removed requirements, since most of them have nothing to do with the p…
Pull Request -
State: closed - Opened by SergeiAlonichau about 3 years ago
#131 - BlingFire installation: fa_build_lex missing!?
Issue -
State: closed - Opened by Kaelorn about 3 years ago
- 2 comments
#130 - Blingfire in longrunning process
Issue -
State: closed - Opened by ELind77 about 3 years ago
- 2 comments
#129 - A list of feature requests for BlingFire
Issue -
State: open - Opened by GeorgeS2019 about 3 years ago
- 3 comments
#128 - Size of models?
Issue -
State: closed - Opened by Lelelo1 about 3 years ago
- 1 comment
#127 - Cannot find "no_padding" option in C# ?
Issue -
State: closed - Opened by myl1ne about 3 years ago
- 2 comments
#126 - updated Windows binaries
Pull Request -
State: closed - Opened by SergeiAlonichau about 3 years ago
#125 - added Mac OS binaries
Pull Request -
State: closed - Opened by SergeiAlonichau about 3 years ago
#124 - added: ids_to_text API for all models
Pull Request -
State: closed - Opened by SergeiAlonichau about 3 years ago
#123 - correcting linker flags for Xcode
Pull Request -
State: closed - Opened by gowthamrang-ds over 3 years ago
- 1 comment
#122 - [Bug] Broken master build on mac
Issue -
State: closed - Opened by gowthamrang-ds over 3 years ago
- 1 comment
#121 - [Bug] Input/Output Error compiling linguistic sources into automata on Mac
Issue -
State: open - Opened by gowthamrang-ds over 3 years ago
- 1 comment
#120 - Update readme for wasm cmake build
Pull Request -
State: closed - Opened by Toudsour over 3 years ago
#119 - Add size optimization building
Pull Request -
State: closed - Opened by Toudsour over 3 years ago
#118 - fixed special tokens parsing for case-insensitive BERT model, e.g. [C…
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#117 - Bump urllib3 from 1.24.2 to 1.26.5 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] over 3 years ago
Labels: dependencies
#116 - updated macOS binaries
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#115 - updated nuget and pypi packages with windows binaries, fixed some bugs
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#114 - increment versions, some clean up
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#113 - Roberta tokenizer - first word in sentence doesn't match huggingface tokenizer
Issue -
State: open - Opened by tomateb over 3 years ago
- 1 comment
#112 - Broken link in README
Issue -
State: open - Opened by abcdenis over 3 years ago
#111 - added no-dummy-prefix ldb.conf parameter and added API to change it f…
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#110 - Error during compile new BERT tokenizer model on Windows
Issue -
State: closed - Opened by Distancehe over 3 years ago
#109 - How to generate bin model file based on specified vocab file for wordpiece tokenizer ?
Issue -
State: closed - Opened by Distancehe over 3 years ago
#108 - Which vocab file you are using for xlm_roberta_base
Issue -
State: closed - Opened by hhdd92 over 3 years ago
- 3 comments
#107 - returned byte[] interface back, Span<T> is also avaible via BlingFire…
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#106 - Where is spm_export_vocab?
Issue -
State: closed - Opened by theoqian over 3 years ago
- 1 comment
#105 - smaller fixes to the fasttext hash code
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#104 - fixed ngram hash computation function
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#103 - Fix style cop errors: SA1200, SA1636
Pull Request -
State: closed - Opened by sanketshah11 over 3 years ago
#102 - recompiled and updated the windows dlls
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#101 - Added syllabification API, fixed nuget
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#100 - Fix Roslyn errors SA1512, CA1401, SA1633, SA1507, SA1309, SA1508, SA1210, SA1312, SA1111, SA1313
Pull Request -
State: closed - Opened by sanketshah11 over 3 years ago
- 1 comment
#99 - quick fix for earlier changes in C# code
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#98 - Fix typo in README
Pull Request -
State: closed - Opened by palerdot over 3 years ago
#97 - Some code refactoring to support Roberta algorithm, they ids and merg…
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
- 1 comment
#96 - added byte level bpe for gpt2 (initial version)
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
- 1 comment
#95 - Update CSharp bindings to support Span<T> and stackalloc
Pull Request -
State: closed - Opened by shahidkhuram over 3 years ago
#94 - Gpt2
Pull Request -
State: closed - Opened by SergeiAlonichau over 3 years ago
#93 - Bump cryptography from 3.2 to 3.3.2 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] almost 4 years ago
Labels: dependencies
#92 - Bert-like tokenization model compilation not finishing
Issue -
State: closed - Opened by RobertGengiu almost 4 years ago
- 2 comments
#91 - HuggingFace export
Issue -
State: closed - Opened by GeorgeS2019 almost 4 years ago
- 1 comment
#90 - updated helper scripts
Pull Request -
State: closed - Opened by SergeiAlonichau almost 4 years ago
#89 - Bump nltk from 3.3 to 3.4.5 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] almost 4 years ago
Labels: dependencies
#88 - Bump urllib3 from 1.24.1 to 1.24.2 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] almost 4 years ago
Labels: dependencies
#87 - added URL segmentation models
Pull Request -
State: closed - Opened by SergeiAlonichau almost 4 years ago
#86 - Calling LoadModel from .net core WebAPI throws SEHException
Issue -
State: closed - Opened by gluckez almost 4 years ago
- 2 comments
#85 - Convert SentencePiece Model to BlingFire.
Issue -
State: closed - Opened by overwindows almost 4 years ago
- 1 comment
#84 - Bump cryptography from 2.4.1 to 3.2 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] about 4 years ago
Labels: dependencies
#83 - Feature Request: ids_to_text method
Issue -
State: closed - Opened by ankane about 4 years ago
- 5 comments
#82 - Support for GPT-2, RoBERTa, and general Hugging Face models
Issue -
State: closed - Opened by ankane about 4 years ago
- 16 comments
#81 - Why personal title abbreviation split
Issue -
State: open - Opened by chunggeonlee about 4 years ago
- 1 comment