Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / llmariner/job-manager issues and pull requests

#331 - ci(hack): disable redis secret for dev environment

Pull Request - State: closed - Opened by Ladicle about 1 month ago
Labels: skip-changelog

#330 - chore(chart): remove an unnecessary space in values.yaml

Pull Request - State: closed - Opened by Ladicle about 1 month ago
Labels: skip-changelog

#329 - ci: add scripts for setup a job-manager test environment

Pull Request - State: closed - Opened by Ladicle about 1 month ago
Labels: skip-changelog

#328 - chore(server): cleanup indices and remove unused columns

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: skip-changelog

#327 - feat(server): add index to the cluster_id column of clusters

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#326 - fix(server): sort the cluster list in ListClusters

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: bug

#325 - feat(dispatcher): pull queued workloads only from an assigned cluster

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#324 - fix(server): add ingress path for /llmariner.jobs.server.v1.JobWorkerService`

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: bug

#323 - feat(dispatcher): be able to configure cluster status update interval

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#322 - feat(engine): ignore cordoned GPU nodes from cluster status

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#321 - feat(server): Ignore stale clusters from scheduling candidates

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#320 - feat(server): implement the scheduler

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#319 - feat(dispatcher): send cluster info to server

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#318 - feat(api): add JobService and ListClusters

Pull Request - State: closed - Opened by kkaneda about 1 month ago - 1 comment
Labels: enhancement

#317 - feat(api): add the "gpu_nodes" field in ClusterStatus

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: enhancement

#316 - chore(server): introduce Scheduler

Pull Request - State: closed - Opened by kkaneda about 1 month ago
Labels: skip-changelog

#315 - chore(dispatcher): remove the GetSelfCluster call

Pull Request - State: closed - Opened by kkaneda about 2 months ago
Labels: skip-changelog

#314 - feat: implement UpdateClusterStatus

Pull Request - State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement

#313 - feat(api): add an API for sending the cluster status

Pull Request - State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement

#312 - Revert "feat: persist schedulable envs to database (#311)"

Pull Request - State: closed - Opened by kkaneda about 2 months ago

#311 - feat: persist schedulable envs to database

Pull Request - State: closed - Opened by kkaneda about 2 months ago
Labels: enhancement

#310 - chore: bump rbac-server dep from 0.3.0 to 0.6.0

Pull Request - State: closed - Opened by kkaneda about 2 months ago
Labels: skip-changelog

#309 - Release v1.4.1

Pull Request - State: closed - Opened by github-actions[bot] 2 months ago

#308 - fix: use a beacon health check

Pull Request - State: closed - Opened by kkaneda 2 months ago
Labels: bug

#307 - fix: bump cluster-manager dep to fix health reporting

Pull Request - State: closed - Opened by kkaneda 2 months ago
Labels: bug

#306 - fix(dispatcher): set distinct names to the controllers

Pull Request - State: closed - Opened by kkaneda 2 months ago
Labels: bug

#305 - Release v1.4.0

Pull Request - State: closed - Opened by github-actions[bot] 2 months ago

#304 - feat(dispatcher): report component status to cluster manager

Pull Request - State: closed - Opened by guangrui-cloudnatix 2 months ago
Labels: enhancement

#303 - Release v1.3.0

Pull Request - State: closed - Opened by github-actions[bot] 3 months ago

#302 - ci(release): add bump option

Pull Request - State: closed - Opened by Ladicle 3 months ago
Labels: skip-changelog

#301 - fix(chart): unset the default value of `enable`

Pull Request - State: closed - Opened by Ladicle 3 months ago
Labels: bug

#300 - feat(fine-tuning): make BitsAndBytesQuantization quantization optional

Pull Request - State: closed - Opened by kkaneda 3 months ago
Labels: enhancement

#299 - fix(dispatcher): do not pull models that have the same prefix

Pull Request - State: closed - Opened by kkaneda 3 months ago
Labels: bug

#298 - feat(fine-tuning): install autoawq

Pull Request - State: closed - Opened by kkaneda 3 months ago
Labels: enhancement

#297 - feat(fine-tuning): bump the transformer to the latest version

Pull Request - State: closed - Opened by kkaneda 3 months ago
Labels: enhancement

#296 - feat: bump the image of fine-tuning jobs

Pull Request - State: open - Opened by kkaneda 3 months ago
Labels: enhancement

#295 - Release v1.2.0

Pull Request - State: closed - Opened by github-actions[bot] 3 months ago

#294 - feat(mod): upgrade llmariner modules

Pull Request - State: closed - Opened by Ladicle 3 months ago
Labels: enhancement

#293 - feat(server): support an enable flag for the usage sender

Pull Request - State: closed - Opened by Ladicle 3 months ago
Labels: enhancement

#100 - Fix typo

Pull Request - State: closed - Opened by Ladicle 10 months ago

#100 - Fix typo

Pull Request - State: closed - Opened by Ladicle 10 months ago

#99 - Be able to specify S3 region

Pull Request - State: closed - Opened by kkaneda 10 months ago

#99 - Be able to specify S3 region

Pull Request - State: closed - Opened by kkaneda 10 months ago

#98 - Use global Helm values

Pull Request - State: closed - Opened by kkaneda 10 months ago

#97 - Be able to specify the image pull policy of fine-tuning jobs

Pull Request - State: closed - Opened by kkaneda 10 months ago

#96 - Pass a validation file to a fine-tuning job

Pull Request - State: closed - Opened by kkaneda 10 months ago

#95 - Set validation file in job proto

Pull Request - State: closed - Opened by kkaneda 10 months ago

#95 - Set validation file in job proto

Pull Request - State: closed - Opened by kkaneda 10 months ago

#94 - Check if a base model exists at job creation time

Pull Request - State: closed - Opened by kkaneda 10 months ago

#94 - Check if a base model exists at job creation time

Pull Request - State: closed - Opened by kkaneda 10 months ago

#93 - Add arm64 support and latest tag.

Pull Request - State: closed - Opened by jmuk 10 months ago

#93 - Add arm64 support and latest tag.

Pull Request - State: closed - Opened by jmuk 10 months ago

#92 - Be able to configure job container image

Pull Request - State: closed - Opened by kkaneda 10 months ago

#91 - Update ECR repository names.

Pull Request - State: closed - Opened by jmuk 10 months ago

#91 - Update ECR repository names.

Pull Request - State: closed - Opened by jmuk 10 months ago

#90 - Use the public ECR for dispatcher.

Pull Request - State: closed - Opened by jmuk 10 months ago - 4 comments

#90 - Use the public ECR for dispatcher.

Pull Request - State: closed - Opened by jmuk 10 months ago - 4 comments

#89 - Allow uploading fake-job as a manual step.

Pull Request - State: closed - Opened by jmuk 10 months ago

#88 - Verify authorization token

Pull Request - State: closed - Opened by Ladicle 10 months ago

#87 - Upload generated models

Pull Request - State: closed - Opened by kkaneda 10 months ago

#87 - Upload generated models

Pull Request - State: closed - Opened by kkaneda 10 months ago

#86 - Fix service account label

Pull Request - State: closed - Opened by kkaneda 10 months ago

#86 - Fix service account label

Pull Request - State: closed - Opened by kkaneda 10 months ago

#85 - Add service account to server

Pull Request - State: closed - Opened by kkaneda 10 months ago

#84 - Make job correctly load files

Pull Request - State: closed - Opened by kkaneda 10 months ago

#84 - Make job correctly load files

Pull Request - State: closed - Opened by kkaneda 10 months ago

#83 - Create Role & RoleBinding in the job namespace

Pull Request - State: closed - Opened by kkaneda 10 months ago - 1 comment

#82 - Skip build and push of fine-tuning as post-merge.

Pull Request - State: closed - Opened by jmuk 10 months ago

#81 - Fix a typo

Pull Request - State: closed - Opened by jmuk 10 months ago

#80 - Fix Role and RoleBindings

Pull Request - State: closed - Opened by kkaneda 10 months ago - 1 comment

#80 - Fix Role and RoleBindings

Pull Request - State: closed - Opened by kkaneda 10 months ago - 1 comment

#79 - Remove volume mount for model store

Pull Request - State: closed - Opened by kkaneda 10 months ago

#79 - Remove volume mount for model store

Pull Request - State: closed - Opened by kkaneda 10 months ago

#78 - Refactor post-merge script.

Pull Request - State: closed - Opened by jmuk 10 months ago

#78 - Refactor post-merge script.

Pull Request - State: closed - Opened by jmuk 10 months ago

#77 - Make the fine-tuning job download the base model

Pull Request - State: closed - Opened by kkaneda 10 months ago

#77 - Make the fine-tuning job download the base model

Pull Request - State: closed - Opened by kkaneda 10 months ago

#76 - Fix a typo.

Pull Request - State: closed - Opened by jmuk 10 months ago

#76 - Fix a typo.

Pull Request - State: closed - Opened by jmuk 10 months ago

#75 - Set TrainingFile

Pull Request - State: closed - Opened by kkaneda 10 months ago

#75 - Set TrainingFile

Pull Request - State: closed - Opened by kkaneda 10 months ago

#74 - Publish helm charts through s3.

Pull Request - State: closed - Opened by jmuk 10 months ago

#73 - Exit(1) on error

Pull Request - State: closed - Opened by kkaneda 10 months ago

#73 - Exit(1) on error

Pull Request - State: closed - Opened by kkaneda 10 months ago

#72 - Package and push helm chart to ECR.

Pull Request - State: closed - Opened by jmuk 10 months ago - 1 comment

#72 - Package and push helm chart to ECR.

Pull Request - State: closed - Opened by jmuk 10 months ago - 1 comment

#71 - Fix parameter for fine-tuning image.

Pull Request - State: closed - Opened by jmuk 10 months ago

#71 - Fix parameter for fine-tuning image.

Pull Request - State: closed - Opened by jmuk 10 months ago

#70 - Fix ecr-login action parameters.

Pull Request - State: closed - Opened by jmuk 10 months ago

#70 - Fix ecr-login action parameters.

Pull Request - State: closed - Opened by jmuk 10 months ago

#69 - Fix the docker command parameter.

Pull Request - State: closed - Opened by jmuk 10 months ago

#69 - Fix the docker command parameter.

Pull Request - State: closed - Opened by jmuk 10 months ago

#68 - Update a job message field on job completion

Pull Request - State: closed - Opened by Ladicle 10 months ago - 1 comment

#68 - Update a job message field on job completion

Pull Request - State: closed - Opened by Ladicle 10 months ago - 1 comment

#67 - Use GetBaseModelPath

Pull Request - State: closed - Opened by kkaneda 10 months ago

#67 - Use GetBaseModelPath

Pull Request - State: closed - Opened by kkaneda 10 months ago

#66 - Implement all OpenAI job API status values

Pull Request - State: closed - Opened by Ladicle 10 months ago

#66 - Implement all OpenAI job API status values

Pull Request - State: closed - Opened by Ladicle 10 months ago

#65 - Make minor refactoring to run.go

Pull Request - State: closed - Opened by kkaneda 10 months ago

#65 - Make minor refactoring to run.go

Pull Request - State: closed - Opened by kkaneda 10 months ago