b4rtaz/distributed-llama issues and pull requests

#234 - The test of windows 10 failed during "make dllema"

Issue - State: open - Opened by Tuhy97 23 days ago - 1 comment

#233 - Implement docker

Pull Request - State: open - Opened by enigodupont 24 days ago

#232 - Adding support for hostnames on worker connection

Pull Request - State: closed - Opened by enigodupont 25 days ago - 4 comments

#231 - specify binding ip

Issue - State: open - Opened by withinboredom 26 days ago

#230 - Critical error: Cannot read magic value when first tries to inference

Issue - State: open - Opened by Tuhy97 about 1 month ago - 3 comments

#229 - Chatting in CJK language lead to crash

Issue - State: closed - Opened by omegacoleman about 2 months ago - 1 comment

#228 - fix: Tokenizer utf-8 handling

Pull Request - State: closed - Opened by omegacoleman about 2 months ago

#227 - feat: cors support.

Pull Request - State: closed - Opened by b4rtaz about 2 months ago

#226 - fix: memory leak in Tokenizer class

Pull Request - State: closed - Opened by omegacoleman about 2 months ago - 1 comment

#225 - bugs found using open-webui 'Stop Thinking' button with dllama-api

Issue - State: closed - Opened by omegacoleman about 2 months ago - 1 comment

#224 - fix: reset EOS detector across requests

Pull Request - State: closed - Opened by omegacoleman about 2 months ago - 1 comment

#223 - fix: make api server ignore SIGPIPE if needed

Pull Request - State: closed - Opened by omegacoleman about 2 months ago - 2 comments

#222 - feat: multiple eos tokens.

Pull Request - State: closed - Opened by b4rtaz about 2 months ago

#221 - feat: support more than 2 eos tokens

Pull Request - State: closed - Opened by omegacoleman about 2 months ago - 3 comments

#220 - fix: bad bound check in shiftForward_F32_F32

Pull Request - State: closed - Opened by omegacoleman about 2 months ago - 1 comment

#219 - Assert failure in shiftForward_F32_F32

Issue - State: closed - Opened by omegacoleman about 2 months ago

#218 - Access to web interface API

Issue - State: closed - Opened by DDJBR about 2 months ago

#217 - error: ‘_MM_FROUND_TO_NEAREST_INT’ was not declared in this scope

Issue - State: open - Opened by squidKid-deluxe about 2 months ago - 2 comments

#216 - Support for YOLO 11 OBB

Issue - State: open - Opened by ng-druid 2 months ago - 1 comment

#214 - Does distributed-llama currently support multimodal models?

Issue - State: open - Opened by SherronBurtint 2 months ago - 2 comments

#213 - build vulkan failed using mingw

Issue - State: open - Opened by zhangddjs 3 months ago - 4 comments

#212 - segmentation fault

Issue - State: open - Opened by zhangddjs 3 months ago - 4 comments

#211 - How about Model Context Protocol support ?

Issue - State: open - Opened by zhangddjs 3 months ago

#210 - Python pinning in converter/requirements.txt breaks pip

Issue - State: open - Opened by notfol 3 months ago

#209 - "Network is closed" error after a few interactions in multi-node setup (2× RTX 4070)

Issue - State: closed - Opened by tsa3 3 months ago - 4 comments

#208 - Support for Qwen3 30ba3b

Issue - State: open - Opened by smpurkis 3 months ago - 1 comment

#207 - Feature Request: Enable caching

Issue - State: closed - Opened by D-i-t-gh 3 months ago - 1 comment

#206 - Transfer weights and the tokenizer file to the root computer.

Issue - State: closed - Opened by evcharger 3 months ago - 1 comment

#199 - Open WebUI support

Issue - State: closed - Opened by Sboshoff76 3 months ago - 2 comments

#198 - Feature Request execute in parallel

Issue - State: open - Opened by MichaelFomenko 4 months ago - 1 comment

#197 - Error with the "make dllama"

Issue - State: closed - Opened by rportojr 4 months ago - 4 comments

#196 - feat: vulkan optimization.

Pull Request - State: closed - Opened by b4rtaz 4 months ago

#195 - fix: vulkan memory type.

Pull Request - State: closed - Opened by b4rtaz 4 months ago

#193 - Feature Request: Multiple Workers on one Machine

Issue - State: open - Opened by MichaelFomenko 4 months ago - 4 comments

#192 - feat: vulkan matmul optimization.

Pull Request - State: closed - Opened by b4rtaz 4 months ago

#191 - Network is in non-blocking mode

Issue - State: closed - Opened by LJ-Hao 4 months ago - 1 comment

#190 - Inconsistent struct layouts can break cross-architecture usage

Issue - State: open - Opened by antoine-sac 4 months ago

#189 - fix: converter hf now handles byte characters. Closes #188

Pull Request - State: closed - Opened by antoine-sac 4 months ago - 1 comment

#188 - Vocabulary containing special "byte tokens" not converted correctly

Issue - State: closed - Opened by antoine-sac 4 months ago

#187 - PORT Number

Issue - State: closed - Opened by greytery 5 months ago - 1 comment

#186 - In chat mode, the LLM agent seems to keep talking to itself without stopping.

Issue - State: closed - Opened by Marvin-BW 5 months ago - 3 comments

#185 - Error when converting tokenizer from Mistral Large Instruct 2411: "Exception: Cannot resolve bosId or eosIds"

Issue - State: open - Opened by philigrale 5 months ago - 8 comments

#184 - README.md: add model/buffer-float-type limitations

Pull Request - State: closed - Opened by lemmi 5 months ago - 1 comment

#183 - q80 and f16 models fail with Critical error: Unsupported ...

Issue - State: closed - Opened by lemmi 5 months ago - 5 comments

#182 - Converting a DeepSeek-R1 model

Issue - State: closed - Opened by D-i-t-gh 5 months ago - 4 comments

#181 - feat: benchmark.

Pull Request - State: closed - Opened by b4rtaz 5 months ago

#179 - Not able see Scaling performance with NuC (12th Gen) with deepseek_r1_distill_llama_8b_q40

Issue - State: open - Opened by deepaks2 5 months ago - 7 comments

#178 - Downloads wrong model

Issue - State: closed - Opened by greytery 5 months ago - 1 comment

#177 - Convert Hugging Face model with multiple eos tokens

Issue - State: closed - Opened by IterableTrucks 5 months ago - 1 comment

#176 - feat: vulkan.

Pull Request - State: closed - Opened by b4rtaz 5 months ago

#175 - Branching out on model support

Issue - State: open - Opened by pcfreak30 5 months ago

#174 - fix: nnuint.

Pull Request - State: closed - Opened by b4rtaz 5 months ago

#173 - Update README.md

Pull Request - State: closed - Opened by brettp 5 months ago - 1 comment