Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / NVIDIA/k8s-device-plugin issues and pull requests
#966 - Does time-slicing or MPS GPU-sharing supports a mode for processe to exclusively use GPU DRAM?
Issue -
State: open - Opened by so2bin 5 days ago
#965 - Enable labels for ClusterUUID and CliqueId
Pull Request -
State: open - Opened by ArangoGutierrez 6 days ago
#964 - Bump google.golang.org/grpc from 1.63.2 to 1.67.0
Pull Request -
State: open - Opened by dependabot[bot] 8 days ago
Labels: dependencies
#963 - Bump google.golang.org/grpc from 1.63.2 to 1.67.0
Pull Request -
State: open - Opened by dependabot[bot] 8 days ago
Labels: maintenance, dependencies
#962 - Enable hostNetwork = true
Issue -
State: open - Opened by jihuiyang-x 10 days ago
- 2 comments
#961 - Bump nvidia/cuda from 12.6.0-base-ubi9 to 12.6.1-base-ubi9 in /deployments/container
Pull Request -
State: open - Opened by dependabot[bot] 13 days ago
- 1 comment
Labels: dependencies, docker
#960 - How to Select a Specific MIG Instance in a Container
Issue -
State: closed - Opened by HdSedighi 15 days ago
- 1 comment
#959 - Bump the k8sio group across 1 directory with 3 updates
Pull Request -
State: open - Opened by dependabot[bot] 15 days ago
Labels: testing, dependencies
#958 - Bump the k8sio group across 1 directory with 4 updates
Pull Request -
State: open - Opened by dependabot[bot] 15 days ago
Labels: maintenance, dependencies
#957 - Bump google.golang.org/grpc from 1.63.2 to 1.66.2
Pull Request -
State: closed - Opened by dependabot[bot] 15 days ago
- 1 comment
Labels: maintenance, dependencies
#956 - Bump the k8sio group across 1 directory with 4 updates
Pull Request -
State: open - Opened by dependabot[bot] 15 days ago
Labels: dependencies
#955 - Bump google.golang.org/grpc from 1.63.2 to 1.66.2
Pull Request -
State: closed - Opened by dependabot[bot] 15 days ago
- 1 comment
Labels: dependencies
#954 - Containers that use cuda images in k8s do not have gpu resources, but the process id can be seen using nvidia-smi
Issue -
State: open - Opened by ZYWNB666 18 days ago
- 2 comments
#952 - Remove namespace field from cluster-scoped `ClusterRole` and `ClusterRoleBinding` resources
Pull Request -
State: open - Opened by alfredkrohmer 21 days ago
#951 - Bump golang from 1.22.6 to 1.23.1 in /deployments/devel
Pull Request -
State: open - Opened by dependabot[bot] 21 days ago
Labels: maintenance, dependencies
#950 - Bump golang from 1.22.6 to 1.23.1 in /deployments/devel
Pull Request -
State: open - Opened by dependabot[bot] 21 days ago
Labels: dependencies, docker
#949 - Bump golang.org/x/net from 0.27.0 to 0.29.0
Pull Request -
State: open - Opened by dependabot[bot] 22 days ago
Labels: maintenance, dependencies
#948 - Bump golang.org/x/mod from 0.19.0 to 0.21.0
Pull Request -
State: open - Opened by dependabot[bot] 22 days ago
Labels: maintenance, dependencies
#947 - Bump golang.org/x/mod from 0.20.0 to 0.21.0
Pull Request -
State: open - Opened by dependabot[bot] 22 days ago
Labels: dependencies
#946 - Bump golang.org/x/net from 0.27.0 to 0.29.0
Pull Request -
State: closed - Opened by dependabot[bot] 22 days ago
Labels: dependencies
#945 - Xid 68 marked as use app error, different than official NVIDIA Xid doc
Issue -
State: open - Opened by gyuho 24 days ago
#944 - Bump nvidia/cuda from 12.6.0-base-ubuntu22.04 to 12.6.1-base-ubuntu22.04 in /deployments/container
Pull Request -
State: open - Opened by dependabot[bot] 24 days ago
- 1 comment
Labels: maintenance, dependencies
#943 - Wrong family type detected
Issue -
State: open - Opened by Madfish5415 27 days ago
#942 - Bump NVIDIA/holodeck from 0.2.3 to 0.2.4
Pull Request -
State: open - Opened by dependabot[bot] 27 days ago
Labels: dependencies, github_actions
#941 - general protection fault, probably for non-canonical address 0x25b5f6bb1a24827e: 0000 [#1] SMP NOPTI
Issue -
State: open - Opened by zsksy123 27 days ago
- 1 comment
#940 - Bump google.golang.org/grpc from 1.63.2 to 1.66.0
Pull Request -
State: closed - Opened by dependabot[bot] 29 days ago
- 1 comment
Labels: maintenance, dependencies
#939 - Bump google.golang.org/grpc from 1.63.2 to 1.66.0
Pull Request -
State: closed - Opened by dependabot[bot] 29 days ago
- 1 comment
Labels: dependencies
#938 - Bump github.com/onsi/ginkgo/v2 from 2.20.1 to 2.20.2 in /tests
Pull Request -
State: open - Opened by dependabot[bot] 29 days ago
Labels: testing, dependencies
#937 - Bump github.com/onsi/gomega from 1.34.1 to 1.34.2 in /tests
Pull Request -
State: open - Opened by dependabot[bot] 29 days ago
Labels: testing, dependencies
#936 - Bump github.com/gruntwork-io/terratest from 0.47.0 to 0.47.1 in /tests
Pull Request -
State: open - Opened by dependabot[bot] 29 days ago
Labels: testing, dependencies
#935 - How to exclude some specific GPUs?
Issue -
State: open - Opened by manhtukhang about 1 month ago
#933 - Bump slackapi/slack-github-action from 1.26.0 to 1.27.0
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies, github_actions
#932 - Bump github.com/mittwald/go-helm-client from 0.12.10 to 0.12.13 in /tests
Pull Request -
State: open - Opened by dependabot[bot] about 1 month ago
Labels: testing, dependencies
#931 - [no-relnote] Use nbody as test job
Pull Request -
State: closed - Opened by elezar about 1 month ago
#930 - Switch to CUDA ubi9 base image
Pull Request -
State: closed - Opened by elezar about 1 month ago
#929 - Is there any way in the meantime to request more than 1 replica from each GPU in my node?
Issue -
State: open - Opened by wei1793786487 about 1 month ago
- 2 comments
#928 - [Docs] Add catalog of labels to README
Pull Request -
State: open - Opened by chipzoller about 1 month ago
- 9 comments
#927 - [no-relnote] Don't use animated emoji for slack notification
Pull Request -
State: closed - Opened by elezar about 1 month ago
#926 - [no-relnote] Address integer overflow linting errors
Pull Request -
State: closed - Opened by elezar about 1 month ago
Labels: maintenance
#925 - [no-relnote] Use ubi8 image in tests
Pull Request -
State: closed - Opened by elezar about 1 month ago
#924 - [no-relnote] Fix typo in log collection tooling
Pull Request -
State: closed - Opened by elezar about 1 month ago
#923 - [no-relnote] Fix typo in log collection tooling
Pull Request -
State: closed - Opened by elezar about 1 month ago
#922 - Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.1 in /tests
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: testing, dependencies
#921 - Bump github.com/matryer/moq from 0.3.4 to 0.5.0 in /deployments/devel
Pull Request -
State: open - Opened by dependabot[bot] about 1 month ago
Labels: maintenance, dependencies
#920 - Bump github.com/matryer/moq from 0.3.4 to 0.5.0 in /deployments/devel
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies, go
#917 - Addressing several security vulnerabilities in the version v0.16.2
Issue -
State: open - Opened by thle40 about 1 month ago
- 1 comment
#916 - Running device plugin with mixed mode MIG without SYS_ADMIN
Issue -
State: open - Opened by vishnukarthikl about 1 month ago
- 2 comments
#915 - [no-relnote] Add Slack notification for e2e test failures
Pull Request -
State: closed - Opened by ArangoGutierrez about 1 month ago
#911 - Bump the k8sio group with 4 updates
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
- 1 comment
Labels: dependencies
#910 - Bump github.com/urfave/cli/v2 from 2.27.3 to 2.27.4
Pull Request -
State: open - Opened by dependabot[bot] about 1 month ago
Labels: maintenance, dependencies
#909 - Bump the k8sio group with 4 updates
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
- 1 comment
Labels: maintenance, dependencies
#908 - Bump the k8sio group in /tests with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
- 1 comment
Labels: testing, dependencies
#907 - Use SELinux package to apply context
Pull Request -
State: closed - Opened by empovit about 1 month ago
- 1 comment
#904 - Bump golang from 1.22.6 to 1.23.0 in /deployments/devel
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: maintenance, dependencies
#903 - Bump golang from 1.22.6 to 1.23.0 in /deployments/devel
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: dependencies, docker
#899 - Add --no-cleanup-on-exit option to GFD
Pull Request -
State: open - Opened by elezar about 2 months ago
#898 - modify the log level of some errors
Pull Request -
State: open - Opened by googs1025 about 2 months ago
- 1 comment
#897 - Bump github.com/mittwald/go-helm-client from 0.12.10 to 0.12.12 in /tests
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: testing, dependencies
#896 - Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0 in /tests
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: testing, dependencies
#895 - Bump golang.org/x/net from 0.27.0 to 0.28.0
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: dependencies
#894 - Bump golang.org/x/net from 0.27.0 to 0.28.0
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: maintenance, dependencies
#893 - Bump golang.org/x/mod from 0.19.0 to 0.20.0
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: maintenance, dependencies
#834 - Why there is no GPU resource allocatable on a GPU cloud instance
Issue -
State: open - Opened by shizhouhu 2 months ago
- 5 comments
#833 - Docker image tag v0.9.0-ubuntu20.04
Issue -
State: open - Opened by yuliyan-valchev-ft 2 months ago
- 3 comments
#832 - README fixes/enhancements
Pull Request -
State: closed - Opened by chipzoller 2 months ago
- 11 comments
#831 - Bump google.golang.org/grpc from 1.63.2 to 1.65.0
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 1 comment
Labels: maintenance, dependencies
#813 - Use reduced ubi8 base image
Pull Request -
State: closed - Opened by elezar 3 months ago
- 2 comments
#810 - Bump google.golang.org/grpc from 1.63.2 to 1.65.0
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
- 3 comments
Labels: dependencies
#808 - nvidia-device-plugin-daemonset CrashLoopBackoff in Truenas Scale Dragonfish
Issue -
State: open - Opened by jmkgreen 3 months ago
- 8 comments
#769 - I want to deploy three models, one large language model occupying one GPU, one embedding model and one re-ranking model sharing one GPU, How can I do it?
Issue -
State: open - Opened by Flynn-Zh 4 months ago
- 4 comments
#764 - MPS Memory limits confusion
Issue -
State: open - Opened by RonanQuigley 4 months ago
- 1 comment
Labels: lifecycle/stale
#763 - Daemonset container and initContainer can run only in privleged mode for daemonset-mps-control-daemon
Issue -
State: open - Opened by kndoni 4 months ago
- 3 comments
Labels: lifecycle/stale
#762 - Failed to send command to MPS daemon
Issue -
State: open - Opened by RonanQuigley 4 months ago
- 4 comments
#746 - Change GFD repository image V0.15.0 Helm
Issue -
State: open - Opened by YFrendo 4 months ago
- 9 comments
Labels: lifecycle/stale
#745 - MicroShift 4.15+ support with helm charts
Pull Request -
State: open - Opened by arthur-r-oliveira 4 months ago
- 3 comments
#744 - “Wrong” units for `nvidia.com/gpu.memory`
Issue -
State: closed - Opened by sftim over 1 year ago
- 2 comments
Labels: lifecycle/stale
#743 - GFD timestamp in label
Issue -
State: closed - Opened by sftim over 1 year ago
- 2 comments
Labels: lifecycle/stale
#742 - `nvidia.com/gpu.memory` capacity
Issue -
State: closed - Opened by faust64 over 1 year ago
- 3 comments
Labels: lifecycle/stale
#741 - GKE support
Issue -
State: closed - Opened by lmyslinski over 1 year ago
- 5 comments
Labels: lifecycle/stale
#740 - How can I run gpu-feature-discovery without privileged mode?
Issue -
State: open - Opened by yangliping about 1 year ago
- 4 comments
Labels: lifecycle/stale
#739 - List of possible values for `nvidia.com/gpu.product`?
Issue -
State: closed - Opened by romilbhardwaj about 1 year ago
- 4 comments
Labels: lifecycle/stale
#738 - (Question) Value for `nvidia.com/gpu.count` when using MIG with mixed strategy
Issue -
State: closed - Opened by sia-hk about 1 year ago
- 2 comments
Labels: lifecycle/stale
#737 - High severity CVEs on latest gpu-feature-discovery(v0.8.2)
Issue -
State: closed - Opened by sakshisharma84 12 months ago
- 2 comments
Labels: lifecycle/stale
#736 - Multiple device types detected:
Issue -
State: open - Opened by sipvoip 11 months ago
- 6 comments
#735 - `nvidia-docker` update needed
Issue -
State: closed - Opened by ikad95 9 months ago
- 3 comments
Labels: lifecycle/stale
#734 - feature discovery worker pods scheduled on wrong nodes
Issue -
State: closed - Opened by garymm 7 months ago
- 5 comments
Labels: lifecycle/stale
#733 - cannot generate nvidia.com/gpu.xxx labels on node
Issue -
State: closed - Opened by double12gzh 7 months ago
- 7 comments
Labels: lifecycle/stale
#732 - Does not set the right value for nvidia.com/gpu.replicas label when timesharing is enabled
Issue -
State: closed - Opened by monirul 4 months ago
- 2 comments
Labels: lifecycle/stale
#731 - nvidia.com/gpu.product and nvidia.com/gpu.replicas does not reflect heterogeneous device setup
Issue -
State: closed - Opened by Suckzoo almost 2 years ago
- 10 comments
Labels: lifecycle/stale
#729 - Incorrect deviceClassWhitelist configuration is provided
Issue -
State: closed - Opened by fprzewozny 4 months ago
- 3 comments
Labels: lifecycle/stale
#712 - K3S - Failed to start plugin: error waiting for MPS daemon
Issue -
State: closed - Opened by FrsECM 5 months ago
- 8 comments
Labels: lifecycle/stale
#711 - The plugin has already support nvlink?
Issue -
State: open - Opened by baoervsdier 5 months ago
- 2 comments
Labels: lifecycle/stale
#701 - helm: can't upgrade to 0.15.0 in place due to daemonset label selector change
Issue -
State: closed - Opened by mrparkers 5 months ago
- 5 comments
Labels: lifecycle/stale
#699 - Allow /dev/shm size to be specified
Pull Request -
State: closed - Opened by elezar 5 months ago
- 2 comments
Labels: lifecycle/stale
#690 - Add podLabels variable for all daemonsets.
Pull Request -
State: closed - Opened by sidewinder12s 5 months ago
- 5 comments
#685 - mps server error Failed to start : invalid argument
Issue -
State: closed - Opened by aphrodite1028 5 months ago
- 2 comments
Labels: lifecycle/stale
#669 - update nodelabel for config-manger k8s-device-plugin continuing printing error msg, not stop
Issue -
State: closed - Opened by aphrodite1028 5 months ago
- 3 comments
Labels: lifecycle/stale
#652 - Back-off restarting failed container nvidia-device-plugin-ctr
Issue -
State: closed - Opened by A-Akhil 6 months ago
- 5 comments
Labels: lifecycle/stale
#605 - Access NVIDIA GPUs in K8s in a non-privileged container
Issue -
State: open - Opened by pintohutch 7 months ago
- 4 comments
#519 - GPU health status exposure and remediation methods
Issue -
State: open - Opened by aidan-canva 8 months ago
- 1 comment
Labels: question