Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / llmariner/inference-manager issues and pull requests
#543 - feat(engine): add ollama tempalte for Phi-4
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement
#542 - fix(engine): ignore ErrRequestCanceled from Preloader.pullModel
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: bug
#541 - fix(server): fix a data race in TestSendAndProcessTask
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: bug
#540 - chore(engine): add a vlog on model pull error
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: skip-changelog
#539 - feat(engine): add a template for phi-4
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement
#538 - feat: bump vllm version to support Phi-4
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
- 1 comment
Labels: enhancement
#537 - chore: fix a comment in findLeastLoadedEngine
Pull Request -
State: closed - Opened by kkaneda about 1 month ago
Labels: skip-changelog
#536 - feat: route request to another engine when the runtime fail to schedule
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: enhancement
#535 - feat: cancel an inference request when an error occurs
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: enhancement
#534 - Release v1.12.0
Pull Request -
State: closed - Opened by github-actions[bot] about 2 months ago
Labels: skip-changelog
#533 - fix(engine): make special_tokens_map.json optional
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: bug
#532 - feat(engine): make --chat-template optional
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement
#531 - Release v1.11.0
Pull Request -
State: closed - Opened by github-actions[bot] about 2 months ago
Labels: skip-changelog
#530 - fix(engine): set the default context length for deepseek
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: bug
#529 - fix(engine): add an Ollama template for deepseek-ai-DeepSeek-Coder-V2-Lite-Instruct
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: bug
#528 - chore: remove the legacyContent field
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: skip-changelog
#527 - feat(engine): be able to configure runtime class from Helm values
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement
#526 - ci(local): disable component-status-sender in the local dev env
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: skip-changelog
#525 - feat(engine): be able to specify a scheduler name
Pull Request -
State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement
#524 - fix(engine): add a retry delay for the reachable check to the runtime
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
#523 - feat(engine): update ScaledObjects during startup
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: enhancement
#522 - feat(engine): expose inference engine metrics
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: enhancement
#521 - fix(script): fix helmfile selecting flag
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: bug
#520 - refactor(engine): rename metrics files to client
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: skip-changelog
#519 - fix(engine): disable leader election when the autoscaler type is keda
Pull Request -
State: closed - Opened by Ladicle about 2 months ago
Labels: bug
#518 - feat(engine): set --tensor-paralle-size for Inferentia nodes
Pull Request -
State: closed - Opened by kkaneda 2 months ago
Labels: enhancement
#517 - feat(engine)!: add KEDA integration for vLLM
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: enhancement, breaking-change
#516 - refactor(engine): move scaler registerer from runtime to autoscaler pkg
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: skip-changelog
#515 - refactor(engine): rename files for the builtin scaler
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: skip-changelog
#514 - refactor(engine): rename Multiautoscaler to BuiltinScaler
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: skip-changelog
#513 - feat: add a template for Llama3.3
Pull Request -
State: closed - Opened by kkaneda 2 months ago
Labels: enhancement
#512 - chore(chart): remove unnecessary file
Pull Request -
State: closed - Opened by Ladicle 2 months ago
- 1 comment
Labels: skip-changelog
#511 - Release v1.10.1
Pull Request -
State: closed - Opened by github-actions[bot] 2 months ago
Labels: skip-changelog
#510 - chore: create a PodStatusSender only when enabled
Pull Request -
State: closed - Opened by kkaneda 2 months ago
Labels: skip-changelog
#509 - fix: bump cluster-manager dep to fix health reporting
Pull Request -
State: closed - Opened by kkaneda 2 months ago
Labels: bug
#508 - Release v1.10.0
Pull Request -
State: closed - Opened by github-actions[bot] 2 months ago
Labels: skip-changelog
#507 - ci(apply-dep): support for installing extra dependency apps
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: skip-changelog
#506 - feat(runtime): set runtime/model annotations to the pod template
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: enhancement
#505 - feat(engine): send component status message
Pull Request -
State: closed - Opened by guangrui-cloudnatix 2 months ago
Labels: enhancement
#504 - refactor(chart): wrap optional values with `with`
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: skip-changelog
#503 - feat: support vLLM on macOS for development
Pull Request -
State: closed - Opened by Ladicle 2 months ago
Labels: enhancement
#502 - ci: use memory as a rate limit backend in the local dev env
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#502 - ci: use memory as a rate limit backend in the local dev env
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#501 - chore(hack): implicitly specify the dependency components for test
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#501 - chore(hack): implicitly specify the dependency components for test
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#500 - Release v1.9.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#500 - Release v1.9.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#499 - ci(release): add bump option
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#499 - ci(release): add bump option
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#498 - fix(chart): unset the default value of `enable`
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: bug
#498 - fix(chart): unset the default value of `enable`
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: bug
#497 - feat: support request-based rate-limiter
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#497 - feat: support request-based rate-limiter
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#496 - fix(engine): unset legacyContent
Pull Request -
State: closed - Opened by guangrui-cloudnatix 3 months ago
- 4 comments
Labels: bug
#496 - fix(engine): unset legacyContent
Pull Request -
State: closed - Opened by guangrui-cloudnatix 3 months ago
- 4 comments
Labels: bug
#495 - fix(engine): correctly set pod annotations for runtimes
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#495 - fix(engine): correctly set pod annotations for runtimes
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#494 - test(server): fix the flaky integration test
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#494 - test(server): fix the flaky integration test
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#493 - fix(trition-proxy): remove an unneeded char in the prompt template
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#493 - fix(trition-proxy): remove an unneeded char in the prompt template
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#492 - feat(api): support both array and string for the "content" field
Pull Request -
State: closed - Opened by kkaneda 3 months ago
- 1 comment
Labels: enhancement
#492 - feat(api): support both array and string for the "content" field
Pull Request -
State: closed - Opened by kkaneda 3 months ago
- 1 comment
Labels: enhancement
#491 - ci(chart): cleanup LLMariner Chart.lock for test
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#491 - ci(chart): cleanup LLMariner Chart.lock for test
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#490 - feat(server): support an enable flag for the usage sender
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#490 - feat(server): support an enable flag for the usage sender
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#489 - Release v1.8.1
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#489 - Release v1.8.1
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#488 - Release v1.9.0
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#488 - Release v1.9.0
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#487 - Release v1.9.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#487 - Release v1.9.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#486 - fix(chart): update the schema of the engine chart
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#486 - fix(chart): update the schema of the engine chart
Pull Request -
State: closed - Opened by kkaneda 3 months ago
#485 - feat(chart): add an enable option for the dependency condition
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#484 - Release v1.8.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#483 - fix(engine/config): restore an unreferenced volume validation
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: bug
#482 - feat(runtime): set an engine Deployment as an owner of runtime StatefulSet
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#481 - ci(make): add target to re-apply and restarts components
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#480 - fix(engine): fix leader-election and runtime deletion handling
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: bug
#479 - feat(api/engine): support audio chat completion
Pull Request -
State: closed - Opened by guangrui-cloudnatix 3 months ago
- 1 comment
Labels: enhancement
#478 - fix(server): set StreamOptions only for streaming requests
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#477 - ci: add scripts for testing local inference server/engine
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#476 - Release v1.7.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#475 - refactor(log): remove unused struct
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#474 - ci(make): fix git-clean-check to validate the preceding target results
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#473 - feat(engine): propagate pod annotations to runtime pods
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: enhancement
#472 - fix(chart): do not set the default value of s3.endpoinrUrl
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: bug
#471 - feat(chart): add cronjob for regular runtime shutdown
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#470 - fix(values.yaml): fix typos
Pull Request -
State: closed - Opened by takeshi-cloudnatix 3 months ago
- 2 comments
#469 - Release v1.6.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#468 - feat(engine): change the vLLM version back to 0.6.2
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: enhancement
#467 - feat(engine): set VLLM_RPC_TIMEOUT to a larger value
Pull Request -
State: closed - Opened by kkaneda 3 months ago
Labels: enhancement
#466 - chore(chart): fix a comment for PrometheusRule
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#465 - feat(server): add an option to enable/disable dynamic model loading
Pull Request -
State: closed - Opened by kkaneda 3 months ago
- 3 comments
Labels: enhancement
#464 - Release v1.5.0
Pull Request -
State: closed - Opened by github-actions[bot] 3 months ago
Labels: skip-changelog
#463 - feat(chart): productionize helm chart
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: enhancement
#462 - ci(pr-labeler): change trigger to pull_request_target
Pull Request -
State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog
#461 - feat(api): support multiple content types in chat completion request
Pull Request -
State: closed - Opened by guangrui-cloudnatix 3 months ago
Labels: enhancement