OpenNMT/CTranslate2 issues and pull requests

#1395 - Fix relative positions shape when forwarding a sequence in the decoder

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1394 - Error in translating with alternatives

Issue - State: closed - Opened by pdakwal about 1 year ago - 4 comments
Labels: bug

#1393 - Fix typo in llama 2 example

Pull Request - State: closed - Opened by vadi2 about 1 year ago

#1392 - how to obtain get past_kv_cache values?

Issue - State: open - Opened by arunpatro about 1 year ago - 3 comments

#1391 - Provide HF AutoModel interface

Issue - State: open - Opened by QuietRocket about 1 year ago - 4 comments

#1390 - AttributeError: module 'ctranslate2' has no attribute 'StorageView'

Issue - State: closed - Opened by Geremia about 1 year ago - 5 comments

#1389 - By default, keep the same FP precision when converting models

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1388 - Extremely slow generation speed for llama 2 70B chat model

Issue - State: open - Opened by k21993 about 1 year ago - 2 comments

#1387 - How to sample multiple completions?

Issue - State: closed - Opened by arunpatro about 1 year ago - 3 comments

#1386 - How to extract the logits from using `forward_batch` in cpu?

Issue - State: closed - Opened by arunpatro about 1 year ago - 2 comments

#1385 - Bigger models unproportionally slower

Issue - State: closed - Opened by NeonBohdan about 1 year ago - 3 comments

#1384 - Question about characters not allowed

Issue - State: closed - Opened by homink about 1 year ago - 2 comments

#1383 - token_type_ids for BERT models

Issue - State: closed - Opened by hachall about 1 year ago - 1 comment
Labels: enhancement

#1382 - Fixed a small error for model conversion

Pull Request - State: closed - Opened by RistoAle97 about 1 year ago

#1381 - Conversion of models does not work with torch <=1.12.1

Issue - State: closed - Opened by RistoAle97 about 1 year ago - 2 comments

#1380 - Update Llama converter to accept extra tokens in the vocabulary

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1379 - Question on CTranslate2 logprob precision of Llama model and missing bos_id in sp.decode()

Issue - State: closed - Opened by ziyuwan about 1 year ago - 7 comments

#1378 - Build error with CUDA 10.2 Jetson Nano

Issue - State: open - Opened by alexismailov2 about 1 year ago - 24 comments

#1377 - CT2 a custom fine-tuned LLAMA2 model?

Issue - State: closed - Opened by salahzoubi about 1 year ago - 8 comments

#1376 - The accuracy of model improved after quantized with ct2 in 8bit

Issue - State: closed - Opened by curname about 1 year ago - 4 comments

#1375 - Marian converter and SPM Vocabulary

Issue - State: closed - Opened by m-resta about 1 year ago - 4 comments

#1374 - Accept left offsets when applying position encodings

Pull Request - State: open - Opened by guillaumekln about 1 year ago

#1373 - distilbert-base-uncased-mnli

Issue - State: closed - Opened by AparnaAgrawal02 about 1 year ago - 1 comment
Labels: enhancement, help wanted

#1372 - Accept left offsets in the rotary embeddings layer

Pull Request - State: open - Opened by guillaumekln about 1 year ago

#1371 - Enable GPU tests for Alibi and RotaryEmbeddings layers

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1370 - Accept left offsets in the masked softmax operator

Pull Request - State: open - Opened by guillaumekln about 1 year ago

#1369 - Remove vocabulary workaround in the Llama converter

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1368 - ct2-transformers-converter errror for --model meta-llama/Llama-2-70b-chat-hf

Issue - State: closed - Opened by silvacarl2 about 1 year ago - 3 comments

#1367 - Support dtype conversion for the StorageView Python class

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1366 - Fix bfloat16 dispatch in logits processor ApplyTimestampRules

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1365 - Some questions

Issue - State: closed - Opened by breaddance about 1 year ago - 8 comments

#1364 - Pass the system prompt in static_prompt in the Llama 2 example

Pull Request - State: closed - Opened by guillaumekln about 1 year ago

#1363 - Llama 2 fails with context length >> 2000

Issue - State: closed - Opened by Joemgu7 about 1 year ago - 7 comments

#1362 - Converted Falcon 40B weights can not loaded

Issue - State: closed - Opened by mallorbc about 1 year ago - 4 comments

#1361 - Request for Implementing Support for wav2vec2, MMS, and XLS-R Models

Issue - State: open - Opened by nabil6391 about 1 year ago - 1 comment
Labels: enhancement

#1360 - Representing non-ASCII characters using ASCII

Pull Request - State: closed - Opened by BrightXiaoHan about 1 year ago - 2 comments

#1355 - LLAMA 2 support [Question] [Enhancement]

Issue - State: closed - Opened by trholding about 1 year ago - 4 comments

#1351 - While using the latest meta model (LLAMA-2-7b-chat-hf) converted with ctranslate2 ,getting ValueError: DequantizeGemmOutput: output should have a float type

Issue - State: closed - Opened by Apoorv7092 about 1 year ago - 5 comments

#1349 - Support left padding to forward batch prompts in a single step

Issue - State: open - Opened by guillaumekln about 1 year ago
Labels: enhancement

#1348 - A keyerror is raised when using the FALCON 40B model converted by ctranslate2

Issue - State: closed - Opened by srimouli04 about 1 year ago - 9 comments

#1343 - CPP inference Error. ** Error in `./run': double free or corruption (!prev):

Issue - State: closed - Opened by ustcdane about 1 year ago - 7 comments

#1337 - Get encoding from flan T5

Issue - State: open - Opened by Alexander-Jin about 1 year ago - 1 comment

#1333 - Continuous batching

Issue - State: open - Opened by andreapiso about 1 year ago - 6 comments
Labels: enhancement

#1330 - CMake error: CUDA_cublas_LIBRARY set to NOTFOUND

Issue - State: closed - Opened by Geremia about 1 year ago - 4 comments

#1329 - Code for chat inference server

Issue - State: closed - Opened by hobodrifterdavid about 1 year ago - 19 comments

#1324 - Exception when exporting bloomz model

Issue - State: open - Opened by jordimas about 1 year ago - 2 comments
Labels: bug

#1322 - How to use custom stopping criteria with the parameter callback in generate_batch() function

Issue - State: closed - Opened by curname about 1 year ago - 6 comments

#1320 - ct2-transformers-converter fails on falcon-rw-1b

Issue - State: closed - Opened by julianmukaj over 1 year ago - 3 comments
Labels: bug

#1306 - This CTranslate2 package was not compiled with CUDA support

Issue - State: closed - Opened by ciayomin over 1 year ago - 15 comments

#1300 - Request to support FlashAttention in cuda attention.cc

Issue - State: closed - Opened by nemoramo over 1 year ago - 23 comments
Labels: enhancement

#1296 - BERT Models: Huge difference in last hidden states of similar examples

Issue - State: closed - Opened by vakkov over 1 year ago - 5 comments

#1285 - Question for asynchronous in generate_batch

Issue - State: closed - Opened by Snowdar over 1 year ago - 2 comments

#1283 - can't convert opennmt.py model with alibi or rotary embeddings to ctranslate2

Issue - State: open - Opened by totaltube over 1 year ago - 8 comments
Labels: enhancement

#1250 - CUDA 12 support (libcublas.so.11 is not found)

Issue - State: closed - Opened by digitalsignalperson over 1 year ago - 25 comments

#1239 - Keep FFN output layer in float32 for T5 models

Pull Request - State: open - Opened by guillaumekln over 1 year ago - 3 comments

#1238 - Do not hardcode the library major version in CMakeLists.txt

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1237 - same GPU memory between GPT2-13B-fp16 to GPT2-13B-int8 in CTranslate2

Issue - State: open - Opened by vicwer over 1 year ago - 2 comments

#1236 - Repeated text with Marian Model generation

Issue - State: open - Opened by zzgsty over 1 year ago - 1 comment

#1235 - Fix compilation with BUILD_SHARED_LIBS=OFF

Pull Request - State: closed - Opened by panosk over 1 year ago - 3 comments

#1234 - Assisted Generation feature

Issue - State: open - Opened by wsxiaoys over 1 year ago

#1233 - Quantized RedPajama responds in chinese

Issue - State: closed - Opened by NeonBohdan over 1 year ago - 1 comment

#1232 - ValueError: Tokenizer class BloomTokenizer does not exist or is not currently imported.

Issue - State: open - Opened by moseshu over 1 year ago - 1 comment

#1231 - Support for GPTBigCodeForCausalLM (StarCoder/ SantaCoder)

Issue - State: open - Opened by michaelfeil over 1 year ago - 4 comments

#1230 - Adding support for transformers - Salesforce/CodeGen architecture

Pull Request - State: closed - Opened by michaelfeil over 1 year ago - 3 comments

#1229 - Support for CodeT5pConfig

Issue - State: open - Opened by ferboz over 1 year ago - 1 comment

#1228 - Support the MPT model from MosaicML

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1227 - Support paths with Unicode characters on Windows

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1226 - Generalize conversion of encoder-decoder models from OpenNMT-tf

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1225 - Raise asynchronous exception from generate_tokens method

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1224 - Invalid opcode on older AMD opteron cpu (avx support?)

Issue - State: closed - Opened by agittins over 1 year ago - 3 comments

#1223 - add fairseq nllb insttructions

Issue - State: open - Opened by Omicronlawful over 1 year ago - 1 comment

#1222 - Fix installation of Intel MKL package in manylinux2014

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1221 - add onmt-py converter for llama-onmt mpt-onmt

Pull Request - State: closed - Opened by vince62s over 1 year ago

#1220 - lmsys/fastchat-t5-3b-v1.0: inconsistent generated output with converted model

Issue - State: closed - Opened by Matthieu-Tinycoaching over 1 year ago - 12 comments

#1219 - CTranslate2 can support Llama?

Issue - State: closed - Opened by moseshu over 1 year ago - 1 comment

#1218 - binary version v67324752 load problem

Issue - State: closed - Opened by syngokhan over 1 year ago - 3 comments

#1217 - Extract last hidden state

Issue - State: open - Opened by dathudeptrai over 1 year ago - 4 comments
Labels: enhancement

#1216 - ct2-fairseq-converter --vocab_mapping

Issue - State: closed - Opened by Omicronlawful over 1 year ago - 1 comment

#1215 - ct2-fairseq-converter

Issue - State: closed - Opened by Omicronlawful over 1 year ago

#1214 - It works fine, but gives an error.

Issue - State: closed - Opened by mayjack0312 over 1 year ago - 2 comments

#1213 - How to add context to translation models?

Issue - State: closed - Opened by eyalmazuz over 1 year ago - 11 comments

#1212 - Support for Mosaic ML MPT 7B

Issue - State: closed - Opened by praneetreddy017 over 1 year ago - 4 comments

#1211 - How to trans a model with Parallel encoder

Issue - State: open - Opened by wangshauitj over 1 year ago - 1 comment

#1210 - Resume model execution from where it stopped

Issue - State: closed - Opened by NeonBohdan over 1 year ago - 1 comment
Labels: enhancement

#1209 - Different generation parameters in the same batch

Issue - State: open - Opened by juliensalinas over 1 year ago

#1208 - Python Interface AutoModelConvert for Huggingface Transformers

Issue - State: closed - Opened by michaelfeil over 1 year ago - 3 comments

#1207 - Model running fine on cpu but not on gpu

Issue - State: closed - Opened by mayanksinha900 over 1 year ago - 4 comments

#1206 - GPT-NeoX

Issue - State: open - Opened by palladium123 over 1 year ago - 3 comments

#1205 - MKL not used when static linking with WHOLE_ARCHIVE

Issue - State: open - Opened by panosk over 1 year ago - 4 comments

#1204 - Optimize rotary embedding recreation

Issue - State: closed - Opened by janekb04 over 1 year ago - 2 comments

#1203 - How to build the prebuild binaries?

Issue - State: closed - Opened by JustFrederik over 1 year ago - 7 comments

#1202 - support ChatGLM

Issue - State: open - Opened by nghuyong over 1 year ago - 6 comments

#1201 - Manually destroy cuBLAS and cuDNN handles before threads exit

Pull Request - State: open - Opened by guillaumekln over 1 year ago

#1200 - GPT-J on Tesla T4: the target device or backend do not support efficient float16 computation

Issue - State: closed - Opened by juliensalinas over 1 year ago - 2 comments

#1199 - Does CT2 support loading of two GPUs

Issue - State: open - Opened by lx0126z over 1 year ago - 1 comment

#1198 - Update docstring for end_token argument

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1197 - Older architectures are not inserted to CUDA_ARCH_LIST

Issue - State: closed - Opened by panosk over 1 year ago - 2 comments

#1196 - CMake errors when using -DBUILD_SHARED_LIBS=OFF after #1178

Issue - State: closed - Opened by panosk over 1 year ago - 14 comments

#1195 - Add option to keep the end token in the results

Pull Request - State: closed - Opened by guillaumekln over 1 year ago

#1194 - Centos 8 support

Issue - State: closed - Opened by kolserdav over 1 year ago - 1 comment

GitHub / OpenNMT/CTranslate2 issues and pull requests