Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / google/sentencepiece issues and pull requests
#1088 - fix crash in unigram model training
Pull Request -
State: open - Opened by pilleye 4 days ago
- 1 comment
#1087 - Fix ci qemu
Pull Request -
State: closed - Opened by dbowring 5 days ago
#1086 - Bump cryptography from 43.0.0 to 44.0.1 in /.github/workflows/requirements in the pip group
Pull Request -
State: open - Opened by dependabot[bot] 6 days ago
Labels: dependencies, python
#1085 - Fix CI build issues
Pull Request -
State: open - Opened by dbowring 9 days ago
- 1 comment
#1084 - Upgrade cibuildwheel to v2.20.0 (build for python3.13)
Pull Request -
State: open - Opened by dbowring 12 days ago
- 2 comments
#1083 - Unable to install sentencepiece on Python 3.13
Issue -
State: open - Opened by glxay 13 days ago
- 5 comments
#1082 - Unable to install sentencepiece on Python 3.13
Issue -
State: closed - Opened by glxay 13 days ago
#1081 - Bump the github-actions group across 1 directory with 4 updates
Pull Request -
State: open - Opened by dependabot[bot] 17 days ago
Labels: dependencies, github_actions
#1080 - Bump the build-time-deps group across 1 directory with 6 updates
Pull Request -
State: open - Opened by dependabot[bot] 17 days ago
Labels: dependencies, python
#1079 - Vocabulary size too high
Issue -
State: closed - Opened by jixiedaima 27 days ago
- 1 comment
#1078 - model issue:
Issue -
State: closed - Opened by jixiedaima about 1 month ago
- 2 comments
#1077 - `split_by_number` only splits Western Arabic Numerals and Full-Width Digits
Issue -
State: closed - Opened by Numeri about 1 month ago
- 1 comment
#1076 - [Question] Could not install via pip or build from source code as a model of python
Issue -
State: open - Opened by Mikachu2333 about 1 month ago
- 2 comments
#1075 - is possible convert hugging face tokenizers in sentence piece .model?
Issue -
State: closed - Opened by Caio-lima-santos about 1 month ago
#1074 - Update python readme prerequisites
Pull Request -
State: open - Opened by Chad-007 about 2 months ago
#1073 - Add dummy prefix from args
Pull Request -
State: closed - Opened by yiyangh-ps 3 months ago
- 1 comment
#1072 - Bump the build-time-deps group across 1 directory with 6 updates
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: dependencies, python
#1071 - BPE Dropout tokenizer generates unk at the beginning of sequence
Issue -
State: open - Opened by AnnaLebedeva 3 months ago
#1070 - The pip command to install the SentencePiece Python module fails.
Issue -
State: open - Opened by tprrt 3 months ago
- 4 comments
#1069 - Doesn't seem to work with Python 3.13
Issue -
State: open - Opened by tprrt 3 months ago
- 9 comments
#1068 - Initialized number of seed sentencepieces too low
Issue -
State: open - Opened by DmitriiP20 3 months ago
- 1 comment
#1067 - subprocess-exited-with-error
Issue -
State: closed - Opened by Nana-kwame-junior 3 months ago
- 2 comments
#1066 - Update artifact actions from v3 to v4
Pull Request -
State: open - Opened by kasinadhsarma 3 months ago
- 1 comment
#1065 - Asan detects memory leak in sentencepiece/_sentencepiece.cpython-312-x86_64-linux-gnu.so+0x6f7f4
Issue -
State: open - Opened by renxida 4 months ago
#1064 - Bump the build-time-deps group across 1 directory with 4 updates
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
- 1 comment
Labels: dependencies, python
#1063 - Bump the github-actions group across 1 directory with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 4 months ago
- 1 comment
Labels: dependencies, github_actions
#1062 - Distributed implementation of the unsupervised unigram word segmentizer possible?
Issue -
State: closed - Opened by y-he2 4 months ago
#1061 - Enhancements to CI Workflows and Python Module Initialization with Minor Fixes
Pull Request -
State: open - Opened by kasinadhsarma 4 months ago
- 1 comment
#1060 - Compatibility Issue when using v0.2.0 with transformers and tensorflow
Issue -
State: open - Opened by aws-tianquaw 5 months ago
- 3 comments
#1059 - "space must not be included in normalized string" when training with a sentence iterator
Issue -
State: closed - Opened by bauwenst 5 months ago
- 1 comment
#1058 - Bump the github-actions group across 1 directory with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 5 months ago
- 1 comment
Labels: dependencies, github_actions
#1057 - Bump the build-time-deps group across 1 directory with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 5 months ago
- 1 comment
Labels: dependencies, python
#1056 - Use libsentencepiece.0.dylib in macos. When load model, model_factory.cc(43) LOG(ERROR) Unknown model_type: 16
Issue -
State: closed - Opened by codeAndxv 5 months ago
- 2 comments
#1055 - Manually modify the vocabulary list
Issue -
State: open - Opened by lileica 5 months ago
- 1 comment
#1054 - Can't load Llama3 tokenizer.model
Issue -
State: closed - Opened by fabriceyhc 5 months ago
- 1 comment
#1053 - Is it possible to add normalization rules into a trained sentence piece model?
Issue -
State: closed - Opened by lost-libra 5 months ago
- 2 comments
#1052 - Training with a custom base vocabulary and handling reserved tokens
Issue -
State: closed - Opened by rteehas 5 months ago
- 1 comment
#1051 - Crashes on out of range inputs depending on other inputs
Issue -
State: open - Opened by colehaus 5 months ago
- 1 comment
Labels: bug
#1050 - logprobs in the vocabulary file do not match the values computed from the tokenized training document
Issue -
State: closed - Opened by pnugues 5 months ago
- 2 comments
#1049 - Bump the build-time-deps group in /.github/workflows/requirements with 2 updates
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
- 1 comment
Labels: dependencies, python
#1048 - Bump the github-actions group with 2 updates
Pull Request -
State: closed - Opened by dependabot[bot] 6 months ago
- 1 comment
Labels: dependencies, github_actions
#1047 - With unigram algorithm, constant piece at end of each sentences does not become a token
Issue -
State: open - Opened by jogardi 6 months ago
Labels: bug
#1046 - Error Attribute Error: type object 'SentencePieceTrainer' has no attribute 'train'. Did you mean: 'Train'?
Issue -
State: open - Opened by bop578530 6 months ago
- 1 comment
#1045 - builds for android devices
Issue -
State: closed - Opened by RaoufiTech 6 months ago
#1044 - decode token one by one
Issue -
State: closed - Opened by nigelzzz 6 months ago
- 1 comment
#1043 - decode one by one can't show space
Issue -
State: closed - Opened by nigelzzz 6 months ago
- 3 comments
#1042 - Why is the Hugging Face encoding 1 greater compared to the Google SentencePiece encoding when using the XLM-RoBERTa SentencePiece tokenizer?
Issue -
State: closed - Opened by RaoufiTech 6 months ago
- 1 comment
#1041 - Bump the build-time-deps group across 1 directory with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 7 months ago
Labels: dependencies, python
#1040 - Bump the github-actions group with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 7 months ago
Labels: dependencies, github_actions
#1039 - multi-thread batch encode seems slower than list comprehension
Issue -
State: closed - Opened by Mr-Grin 7 months ago
- 1 comment
#1038 - Update setup.py
Pull Request -
State: closed - Opened by raushanksec 7 months ago
- 1 comment
#1037 - Add support for windows arm64
Pull Request -
State: closed - Opened by Nagico2 7 months ago
- 1 comment
#1036 - Bump the build-time-deps group across 1 directory with 4 updates
Pull Request -
State: closed - Opened by dependabot[bot] 7 months ago
- 1 comment
Labels: dependencies, python
#1035 - Bump the pip group in /.github/workflows/requirements with 2 updates
Pull Request -
State: closed - Opened by dependabot[bot] 7 months ago
Labels: dependencies, python
#1034 - Bump certifi from 2023.11.17 to 2024.7.4 in /.github/workflows/requirements in the pip group
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies, python
#1033 - Bump the github-actions group across 1 directory with 6 updates
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies, github_actions
#1032 - Bump the build-time-deps group in /.github/workflows/requirements with 4 updates
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies, python
#1031 - Zero Width Joiner issue for Sinhala Language
Issue -
State: open - Opened by Nadil-K 8 months ago
#1030 - No typings in Python package
Issue -
State: open - Opened by marcospgp 8 months ago
- 1 comment
Labels: enhancement
#1029 - When I set SPM_PROTOBUF_PROVIDER to "package" in CMakeLists.txt, the compilation fails.
Issue -
State: open - Opened by hhxdestiny 8 months ago
#1028 - trainer_interface.cc: Integer value -1 is outside the valid range of values [0, 255] for the enumeration type 'ScriptType'
Issue -
State: open - Opened by kcoul 8 months ago
- 1 comment
Labels: enhancement
#1027 - Bump urllib3 from 2.1.0 to 2.2.2 in /.github/workflows/requirements in the pip group
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies, python
#1026 - Error
Issue -
State: closed - Opened by silentghost1412 8 months ago
- 1 comment
#1025 - install command line tools without sudo
Issue -
State: closed - Opened by zjesko 8 months ago
- 1 comment
#1024 - Wrong calculation of max_score in unigram_model.cc
Issue -
State: open - Opened by fairydreaming 8 months ago
#1023 - How to deal with id
Issue -
State: open - Opened by 980202006 8 months ago
- 3 comments
#1022 - Parameterize lattice node allocator size to optimize chunk allocation performance
Pull Request -
State: closed - Opened by PriyankaRanganath 9 months ago
- 3 comments
#1021 - How long does it take to train 31.2GB text data?
Issue -
State: closed - Opened by Mintchocolater 9 months ago
- 1 comment
#1020 - Bump the build-time-deps group in /.github/workflows/requirements with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 9 months ago
Labels: dependencies, python
#1019 - Bump the github-actions group across 1 directory with 6 updates
Pull Request -
State: closed - Opened by dependabot[bot] 9 months ago
- 1 comment
Labels: dependencies, github_actions
#1018 - resume/restart training of tokenizer
Issue -
State: closed - Opened by ganeshkrishnan1 9 months ago
- 3 comments
#1017 - I want to obtain a model file using my vocab!
Issue -
State: closed - Opened by scj0709 9 months ago
- 1 comment
#1016 - Convert SentencePiece .vocab format to OpenNMT-py .onmt_vocab format
Issue -
State: closed - Opened by HURIMOZ 9 months ago
- 1 comment
#1015 - fixing minor typos in the API.md
Pull Request -
State: closed - Opened by Cassini-chris 9 months ago
#1014 - debloat the cmakelists.txt and add a bunch of customization for building
Pull Request -
State: closed - Opened by alexlnkp 9 months ago
#1013 - bump CMake minimum required version to avoid warnings
Pull Request -
State: closed - Opened by alexlnkp 9 months ago
- 2 comments
#1012 - adding vocab_size consistency
Pull Request -
State: closed - Opened by Cassini-chris 9 months ago
#1011 - Bump requests from 2.31.0 to 2.32.0 in /.github/workflows/requirements in the pip group
Pull Request -
State: closed - Opened by dependabot[bot] 9 months ago
Labels: dependencies, python
#1010 - Runtime error on iOS
Issue -
State: closed - Opened by l3utterfly 9 months ago
- 11 comments
#1009 - Tokenization for phonetic languages
Issue -
State: closed - Opened by divyeshrajpura4114 9 months ago
- 3 comments
#1008 - make it more friendly for mingw enviroments
Pull Request -
State: closed - Opened by Kreijstal 10 months ago
- 1 comment
#1007 - Fix typo
Pull Request -
State: closed - Opened by xu-song 10 months ago
#1006 - Build sentencepiece with mingw
Issue -
State: closed - Opened by Kreijstal 10 months ago
- 1 comment
#1005 - Fixing issues with the normalizer.cc (typo, type safety, cast fucn)
Pull Request -
State: closed - Opened by Cassini-chris 10 months ago
#1004 - Bump setuptools from 69.2.0 to 69.5.1 in /.github/workflows/requirements in the build-time-deps group
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies, python
#1003 - Bump the github-actions group with 6 updates
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
- 1 comment
Labels: dependencies, github_actions
#1002 - Add missing output formats to spm_encode flag documentation
Pull Request -
State: closed - Opened by mcognetta 10 months ago
#1001 - Tokenize at the word level without spacers nor joiners
Issue -
State: closed - Opened by HURIMOZ 10 months ago
- 2 comments
#1000 - No make file found while build and install the Python wrapper
Issue -
State: closed - Opened by NickStrain 10 months ago
- 2 comments
#999 - Treat Hawaiian Glottal stop as consonant, not punctuation
Issue -
State: closed - Opened by HURIMOZ 10 months ago
- 4 comments
#998 - Bump idna from 3.6 to 3.7 in /.github/workflows/requirements
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies, python
#998 - Bump idna from 3.6 to 3.7 in /.github/workflows/requirements
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies, python
#997 - Is GGUF supported?
Issue -
State: closed - Opened by micheledellaguardia 10 months ago
- 1 comment
#997 - Is GGUF supported?
Issue -
State: closed - Opened by micheledellaguardia 10 months ago
- 1 comment
#996 - Bump the github-actions group with 6 updates
Pull Request -
State: closed - Opened by dependabot[bot] 11 months ago
- 1 comment
Labels: dependencies, github_actions
#995 - Bump the build-time-deps group in /.github/workflows/requirements with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies, python
#994 - Support for Windows Python 3.12.2
Issue -
State: closed - Opened by Nick- 11 months ago
#993 - Error when running this command: pip install 'transformers[tf-cpu]' on mac
Issue -
State: closed - Opened by ambadumbuya 11 months ago
- 1 comment
#993 - Error when running this command: pip install 'transformers[tf-cpu]' on mac
Issue -
State: closed - Opened by ambadumbuya 11 months ago
- 1 comment
#992 - Inconsistent result between py and cpp
Issue -
State: closed - Opened by Lewis-Lu 11 months ago
- 1 comment