Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / AliyunContainerService/gpushare-scheduler-extender issues and pull requests
#232 - [QUESTION] gpushare支持 多个不同的scheduler 部署在一起么?
Issue -
State: open - Opened by hijeffwu about 1 month ago
#231 - 如果有多张显卡,我该如何指定显卡进行虚拟化
Issue -
State: open - Opened by MoeXiaoHei 4 months ago
#230 - 申请了gpu的pod在运行一段时间之后 容器内会出现找不到显卡
Issue -
State: open - Opened by wanghaowish 5 months ago
- 1 comment
#229 - 运行了一年后,创建新的 pod 报错 failed bind with extender at URL http://127.0.0.1:32766/gpushare-scheduler/bind, code 500
Issue -
State: open - Opened by klvchen 6 months ago
#228 - 节点上有多个GPU时,无法正常分配GPU
Issue -
State: open - Opened by hotbaby 7 months ago
#227 - 副本问题
Issue -
State: open - Opened by AndrewOYLK 8 months ago
#226 - k8s上安装好插件,无法识别到集群GPU资源
Issue -
State: open - Opened by ferris-cx 8 months ago
- 1 comment
#225 - 版本兼容性问题
Issue -
State: open - Opened by ferris-cx 8 months ago
- 1 comment
#224 - [AKS] kube-scheduler static POD not running for Aliyun GPU Scheduler Extender
Issue -
State: open - Opened by dsatizabal 9 months ago
- 1 comment
#223 - kubelet版本問題
Issue -
State: open - Opened by longcheung123 9 months ago
#222 - 方案只能在阿里云上的机器里使用吗
Issue -
State: open - Opened by wolgod 9 months ago
#221 - Remove DeletionTimestamp!=nil condition in IsCompletePod function
Pull Request -
State: open - Opened by zhangbc97 about 1 year ago
#220 - ALIYUN_COM_GPU_MEM_IDX in the annotation is different than ALIYUN_COM_GPU_MEM_IDX inside the pod
Issue -
State: open - Opened by wokalski about 1 year ago
#219 - 这个项目目前在使用过程中存在的问题
Issue -
State: open - Opened by freelizhun over 1 year ago
#218 - 调度层有bug吧,请求8G,实际设备最大7G,居然最终能创建成功pod
Issue -
State: open - Opened by hiahia121 over 1 year ago
#217 - 关于显存申请基本单位改为MiB但不起作用的问题
Issue -
State: open - Opened by harrymore over 1 year ago
#217 - 关于显存申请基本单位改为MiB但不起作用的问题
Issue -
State: open - Opened by harrymore over 1 year ago
#216 - 该项目还在维护吗
Issue -
State: open - Opened by zhaizhch over 1 year ago
#216 - 该项目还在维护吗
Issue -
State: open - Opened by zhaizhch over 1 year ago
- 1 comment
#215 - Support for Horizontal Pod Autoscaling (HPA) with GPU Pods? 是否支持使用GPU Pods的水平Pod自动扩展(HPA)?
Issue -
State: open - Opened by tobq over 1 year ago
- 1 comment
#215 - Support for Horizontal Pod Autoscaling (HPA) with GPU Pods? 是否支持使用GPU Pods的水平Pod自动扩展(HPA)?
Issue -
State: open - Opened by tobq over 1 year ago
- 1 comment
#214 - feat: adjust k8s to 1.28
Pull Request -
State: open - Opened by Yobol over 1 year ago
#214 - feat: adjust k8s to 1.28
Pull Request -
State: open - Opened by Yobol over 1 year ago
#213 - 如果一个机器上有两张卡,第一张卡的内存使满了,之后的任务会调度到另一张卡上吗
Issue -
State: open - Opened by vicmeng over 1 year ago
#213 - 如果一个机器上有两张卡,第一张卡的内存使满了,之后的任务会调度到另一张卡上吗
Issue -
State: open - Opened by vicmeng over 1 year ago
#212 - 如果想要指定使用两张显卡多卡训练 该怎么做
Issue -
State: open - Opened by vicmeng over 1 year ago
- 1 comment
#212 - 如果想要指定使用两张显卡多卡训练 该怎么做
Issue -
State: open - Opened by vicmeng over 1 year ago
- 1 comment
#211 - 这个GPU共享插件支持使用dcgm-exporter做监控吗
Issue -
State: open - Opened by db-root over 1 year ago
- 5 comments
#210 - Back-off restarting failed container: gpushare-device-plugin-ds-xxxxx
Issue -
State: open - Opened by JiangLingJun over 1 year ago
- 1 comment
#209 - 你好,kubectl logs这个命令在gpu容器上无效,在普通容器上却可以
Issue -
State: closed - Opened by 140ai almost 2 years ago
#208 - GPU cores scheduling / GPU核心调度
Issue -
State: open - Opened by valafon almost 2 years ago
#207 - plugin does not evenly distribute the pods. 这个插件无法均匀分配Pod。
Issue -
State: open - Opened by valafon almost 2 years ago
- 3 comments
#206 - docs: update install.md
Pull Request -
State: closed - Opened by KunWuLuan almost 2 years ago
#205 - Not able to use gpushare-scheduler-extender on EKS cluster with Kubernetes v1.24
Issue -
State: open - Opened by suchisur almost 2 years ago
- 2 comments
#204 - 优化循环查找可用设备
Pull Request -
State: open - Opened by wangzhipeng almost 2 years ago
- 1 comment
#203 - Bump golang.org/x/net from 0.1.1-0.20221027164007-c63010009c80 to 0.7.0
Pull Request -
State: open - Opened by dependabot[bot] almost 2 years ago
Labels: dependencies
#202 - 显存与真实情况不符
Issue -
State: open - Opened by SakuraAxy about 2 years ago
- 1 comment
#201 - 多次进行删除创建Pod之后,会导致新创建Pod出现Pending状态
Issue -
State: open - Opened by liufangpeng about 2 years ago
#200 - scheduler-policy-config.yaml文件咨询
Issue -
State: closed - Opened by liufangpeng about 2 years ago
#199 - 使用kubeflow1.6.1 使用自定义镜像有问题
Issue -
State: open - Opened by 631068264 about 2 years ago
- 3 comments
#198 - nodeinfo.go allocateGPUID method optimization
Issue -
State: open - Opened by wangxiaoyang-dev about 2 years ago
#197 - fix: gpushare concurrent map read write
Pull Request -
State: open - Opened by swartz-k about 2 years ago
#196 - k3s services not started scheduler exited: stat /etc/kubernetes/scheduler.conf: no such file or directory
Issue -
State: open - Opened by RotemAmergi about 2 years ago
- 1 comment
#195 - fix: circleci version
Pull Request -
State: closed - Opened by swartz-k about 2 years ago
- 1 comment
#194 - feat: upgrade golang to 1.19 and k8s to 1.25
Pull Request -
State: closed - Opened by swartz-k about 2 years ago
#193 - fix: controller process item logic
Pull Request -
State: closed - Opened by swartz-k about 2 years ago
#192 - Controller processNextWorkItem return false when err == nil
Issue -
State: closed - Opened by swartz-k about 2 years ago
#191 - Wrong GPU ID
Issue -
State: closed - Opened by tintranvan over 2 years ago
- 4 comments
#190 - How to share arithmetical force of a gpu?
Issue -
State: open - Opened by joeevon over 2 years ago
- 1 comment
#189 - trivy image scan lists critical and high vulnerability against latest image k8s-gpushare-schd-extender:1.11-d170d8a
Issue -
State: open - Opened by carlwang87 over 2 years ago
#188 - gpu pods are in pending states despite of enough gpu resource
Issue -
State: closed - Opened by mf-giwoong-lee over 2 years ago
#187 - pod运行完成后,插件更新gpu池不及时。当有多个pending的pod排队分配资源时,最后一个pod会一直等到flushUnschedulablePodsLeftover才会重新分配资源
Issue -
State: open - Opened by huiyangz over 2 years ago
#186 - 读取到了两块显卡,但是请求/gpushare-scheduler/filter后部分容器一直只能调度到其中一块显卡
Issue -
State: closed - Opened by 1003111014 over 2 years ago
- 1 comment
#185 - pod包含多个container时报错: "unknown device id: no-gpu-has-5MiB-to-run"
Issue -
State: open - Opened by serend1p1ty over 2 years ago
- 1 comment
#184 - 单机双显卡时,调度器显示绑定到了不同的显卡上,实际全部都调度到了一张显卡上
Issue -
State: open - Opened by 1003111014 over 2 years ago
- 1 comment
#183 - Update ADOPTERS.md
Pull Request -
State: closed - Opened by ftx0day over 2 years ago
- 1 comment
#182 - Any instruction/template to help define customized GPU scheduler policy?
Issue -
State: open - Opened by blackjack2015 over 2 years ago
#181 - Device list strategy - mounts
Issue -
State: open - Opened by xhejtman over 2 years ago
#180 - Microk8s installation instructions
Issue -
State: open - Opened by agnoam over 2 years ago
- 2 comments
#179 - 如何使用ALIYUN_COM_GPU_SPECIAL_IDX指定主机运行,我使用主机名不生效
Issue -
State: closed - Opened by 1003111014 over 2 years ago
#178 - gpushare能统计到两块显卡,但是分配的时候只用到了一块显卡,另外一块无法调用到
Issue -
State: closed - Opened by 1003111014 over 2 years ago
#177 - gpushare能统计到两块显卡,但是分配的时候只用到了一块显卡,另外一块无法调用到
Issue -
State: closed - Opened by 1003111014 over 2 years ago
- 2 comments
#176 - Create device-plugin-ds.yaml, Daemon don't create device-plugin-ds pod
Issue -
State: closed - Opened by echo3215987 almost 3 years ago
- 1 comment
#175 - not able to find dev with index
Issue -
State: open - Opened by southquist almost 3 years ago
- 1 comment
#174 - how to share multiple gpus?
Issue -
State: closed - Opened by mengwanguc almost 3 years ago
- 3 comments
#173 - gpushare-device-plugin pod fails to start
Issue -
State: closed - Opened by southquist almost 3 years ago
- 6 comments
#172 - Update install instructions for kubernetes v1.23+
Pull Request -
State: closed - Opened by noranraskin almost 3 years ago
- 1 comment
#171 - Pods FailedScheduling with Post "http://127.0.0.1:32766/gpushare-scheduler/filter": EOF
Issue -
State: open - Opened by noranraskin almost 3 years ago
- 5 comments
#170 - gpushare-schd-extender in Pending State
Issue -
State: open - Opened by m1nish1208 about 3 years ago
- 7 comments
#169 - gpushare-scheduler-extender http server panic
Issue -
State: closed - Opened by fullpolarfox about 3 years ago
- 2 comments
#168 - 请问支持在k3s中用这个么
Issue -
State: open - Opened by jiaminxu about 3 years ago
- 3 comments
#167 - Why I can not use prioritise
Issue -
State: open - Opened by chenwenyan about 3 years ago
#166 - policy-config-file is no longer supported by kubernetes starting by v1.23
Issue -
State: closed - Opened by jeffguorg about 3 years ago
- 10 comments
#165 - Is the plugin compatible with new kubernetes version and new NVIDIA drivers versions ?
Issue -
State: open - Opened by zhangxingdeppon about 3 years ago
- 3 comments
#164 - 优化设备选择策略,优化用户指定设备id
Pull Request -
State: open - Opened by chenrulongmaster about 3 years ago
- 1 comment
#163 - Integration with Rancher
Issue -
State: open - Opened by 201508876PMH over 3 years ago
- 8 comments
#162 - Add didiglobal to the list of adopters
Pull Request -
State: closed - Opened by tongchao199 over 3 years ago
#161 - gpushare-device-plugin STATUS is Pending
Issue -
State: open - Opened by Jacoobr over 3 years ago
- 2 comments
#160 - gpushare-device-plugin STATUS is Pending
Issue -
State: closed - Opened by Jacoobr over 3 years ago
- 1 comment
#159 - 能获取显卡的数量,但是显存为0
Issue -
State: open - Opened by gxwangit over 3 years ago
- 1 comment
#158 - 分配8G 但是实际使用不会限制在8G内
Issue -
State: open - Opened by momomobinx over 3 years ago
- 10 comments
#157 - How to use this project in AKS or other cloud providers
Issue -
State: open - Opened by 2811299 over 3 years ago
- 1 comment
#156 - listAndWatch ended unexpectedly for device plugin aliyun.com/gpu-mem with error rpc error: code = ResourceExhausted desc = grpc: received message larger than max (5547796 vs. 4194304)
Issue -
State: open - Opened by yingjunxu over 3 years ago
- 3 comments
#155 - gpu-operator support
Issue -
State: open - Opened by xhejtman over 3 years ago
#154 - How should I apply this plugin without docker layer but just use the containerd ?
Issue -
State: open - Opened by ddddddddlang over 3 years ago
- 2 comments
#153 - gpushare-device plugin daemonset is not working
Issue -
State: closed - Opened by riyasoni5990 over 3 years ago
- 9 comments
#152 - fix bug: bind error when some pod fields are unknown
Pull Request -
State: closed - Opened by happy2048 almost 4 years ago
#151 - What does `Pending(Allocated)` when using `kubectl inspect gpushare`?
Issue -
State: open - Opened by cailun01 almost 4 years ago
- 3 comments
#150 - Kubernetes removing docker as a runtime in late 2021
Issue -
State: open - Opened by mmenbawy almost 4 years ago
- 5 comments
#149 - Insufficient aliyun.com/gpu-mem.
Issue -
State: closed - Opened by 2811299 almost 4 years ago
- 2 comments
#148 - I CAN'T GET GPU INFORMATION. The pod gpushare-device-plugin-ds is pending.
Issue -
State: open - Opened by DoubleChen-cc almost 4 years ago
- 2 comments
#147 - Not showing the exact amount of memory
Issue -
State: open - Opened by rajitha1998 almost 4 years ago
#146 - 是否有相关的显存exporter 可以供Prometheus监控使用?
Issue -
State: open - Opened by shfpflyawei almost 4 years ago
#145 - Support in EKS [Help]
Issue -
State: open - Opened by pen-pal almost 4 years ago
- 15 comments
#144 - Update ADOPTERS.md
Pull Request -
State: closed - Opened by 70data about 4 years ago
#143 - 单pod多gpu容器时设备插件匹配失败问题
Issue -
State: open - Opened by baozhiming about 4 years ago
- 7 comments
#142 - How to modify scheduler configuration when I start k8s with binary system
Issue -
State: open - Opened by huangleilz about 4 years ago
- 1 comment
#141 - aliyun.com/gpu-mem 为0
Issue -
State: open - Opened by alanyanglong about 4 years ago
- 3 comments
#140 - Unable to schedule pod with: Insufficient aliyun.com/gpu-mem
Issue -
State: open - Opened by k0nstantinv about 4 years ago
- 3 comments
#139 - some questions about the file "kube-scheduler.yaml"
Issue -
State: closed - Opened by AlvL1225 about 4 years ago