Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / koordinator-sh/koordinator issues and pull requests

#2180 - scheduler: elastic quota ignore terminating pod immediately

Pull Request - State: closed - Opened by TaoYang526 3 months ago - 11 comments
Labels: lgtm, approved, size/M, ok-to-test

#2179 - [proposal] scheduler: elastic quota ignore terminating pod immediately

Issue - State: closed - Opened by TaoYang526 3 months ago - 1 comment
Labels: kind/proposal

#2178 - scheduler: assure only gpu will allocate by topology

Pull Request - State: closed - Opened by ZiMengSheng 3 months ago - 4 comments
Labels: lgtm, approved, size/S

#2177 - scheduler: fix pod update when pod creation is before quota creation

Pull Request - State: closed - Opened by shaloulcy 3 months ago - 3 comments
Labels: lgtm, approved, size/M

#2176 - schedler: move gang OnceSatified to gangGroupInfo

Pull Request - State: closed - Opened by buptcozy 3 months ago - 6 comments
Labels: lgtm, approved, size/M

#2175 - scheduler: add e2e trace metrics

Pull Request - State: open - Opened by ZiMengSheng 3 months ago - 2 comments
Labels: size/L

#2174 - scheduler: suppport amd.com/gpu

Pull Request - State: closed - Opened by ZiMengSheng 3 months ago - 4 comments
Labels: lgtm, approved

#2173 - koordlet: fix unsafe conversion in net_cls

Pull Request - State: closed - Opened by saintube 3 months ago - 3 comments
Labels: lgtm, approved, size/M

#2172 - koord-manager: support resource amplification config, cpu, memory and other resource

Pull Request - State: closed - Opened by yangfeiyu20102011 3 months ago - 4 comments
Labels: lgtm, approved, size/XL

#2171 - [BUG] DCGM Metrics Not Supported When Use Koordinator GPU Allocate Logic

Issue - State: open - Opened by ZiMengSheng 3 months ago - 1 comment
Labels: area/koord-scheduler, kind/bug, lifecycle/stale

#2170 - [question] Recommendation controller has been open-source?

Issue - State: closed - Opened by slm940208 3 months ago - 2 comments
Labels: kind/question

#2169 - scheduler: remove deepCopy of reservationInfo

Pull Request - State: open - Opened by ZiMengSheng 4 months ago - 3 comments
Labels: size/L, lifecycle/stale

#2168 - koord-descheduler: support node selector for each descheduler profile

Pull Request - State: open - Opened by songtao98 4 months ago - 4 comments
Labels: do-not-merge/hold, size/L, lifecycle/stale

#2167 - scheduler: support scheduler config v1

Pull Request - State: closed - Opened by AdrianMachao 4 months ago - 8 comments
Labels: size/XXL, lgtm, approved

#2166 - [question] why function EstimateNode use node.Status.Allocatable by default

Issue - State: open - Opened by qwjhq 4 months ago - 2 comments
Labels: area/koord-scheduler, kind/question

#2165 - [BUG] v1beta3 doesn't support PreEnqueue extension point !!

Issue - State: closed - Opened by ZiMengSheng 4 months ago - 1 comment
Labels: area/koord-scheduler, kind/bug

#2164 - [proposal] cpu burst should also burst some pods when the node util nears threshold

Issue - State: open - Opened by zwzhang0107 4 months ago
Labels: help wanted, area/koordlet, kind/proposal

#2163 - scheduler: support reservation ignored and nodenumaresource preemption

Pull Request - State: closed - Opened by saintube 4 months ago - 5 comments
Labels: size/XXL, lgtm, approved

#2162 - koordlet: add record events

Pull Request - State: closed - Opened by kangclzjc 4 months ago - 3 comments
Labels: lgtm, approved, size/L

#2161 - [proposal] How to Better Integrate Mid-Tier Runtime Hooks with Batch Resources?

Issue - State: open - Opened by tan90github 4 months ago - 3 comments
Labels: area/koordlet, area/koord-manager, kind/proposal, area/api-machinery, lifecycle/stale

#2160 - koord-descheduler: fixes namespace object limiter

Pull Request - State: closed - Opened by songtao98 4 months ago - 4 comments
Labels: lgtm, approved, size/L

#2159 - 【WIP】koordlet: perform operations only under the specified cpuburst strategy and improve config logging

Pull Request - State: closed - Opened by yangfeiyu20102011 4 months ago - 2 comments
Labels: do-not-merge/work-in-progress, size/S

#2158 - [BUG] NodeNUMAResource should support allocating reserved cpus when CPU amplification enabled

Issue - State: open - Opened by saintube 4 months ago
Labels: area/koord-scheduler, kind/bug

#2157 - ci: add disk GC for E2E jobs

Pull Request - State: closed - Opened by saintube 4 months ago - 2 comments
Labels: lgtm, approved, size/M

#2156 - koord-descheduler: update k8s descheduler to 0.28.0

Pull Request - State: closed - Opened by songtao98 4 months ago - 2 comments
Labels: lgtm, approved, size/S

#2155 - scheduler: add coscheduling preEnqueue

Pull Request - State: closed - Opened by AdrianMachao 4 months ago - 5 comments
Labels: lgtm, approved, size/L

#2154 - scheduler: fix too many ut log

Pull Request - State: closed - Opened by ZiMengSheng 4 months ago - 3 comments
Labels: lgtm, approved, size/S

#2153 - koordlet: add some information to improve log readability

Pull Request - State: closed - Opened by yangfeiyu20102011 4 months ago - 2 comments
Labels: size/XS, lgtm, approved

#2152 - koord-manager: add unallocated resource into mid resource.

Pull Request - State: closed - Opened by tan90github 4 months ago - 11 comments
Labels: lgtm, approved, size/L

#2151 - scheduler: fix panic when NUMANode equals -1

Pull Request - State: closed - Opened by ZiMengSheng 4 months ago - 5 comments
Labels: lgtm, approved, size/S

#2150 - [proposal] Reservation support binding to scheduled pods

Issue - State: open - Opened by saintube 4 months ago
Labels: area/koord-scheduler, area/koord-manager, kind/proposal

#2149 - koord-manager: fix invalid assignments in the resource amplification …

Pull Request - State: closed - Opened by yangfeiyu20102011 4 months ago - 5 comments
Labels: lgtm, approved, size/S

#2148 - [BUG] Pods been deleted from gang.Children when recreated with same name.

Issue - State: open - Opened by KunWuLuan 4 months ago - 3 comments
Labels: area/koord-scheduler, kind/bug, lifecycle/stale

#2147 - chore(deps): bump golangci/golangci-lint-action from 5.3.0 to 6.1.0

Pull Request - State: open - Opened by dependabot[bot] 4 months ago - 3 comments
Labels: size/XS, dependencies

#2145 - Update OWNERS_ALIASES

Pull Request - State: closed - Opened by hormes 4 months ago - 2 comments
Labels: size/XS

#2142 - koord-descheduler: LowNodeLoad check if evicted pod can cause new node over utilized

Pull Request - State: closed - Opened by songtao98 4 months ago - 3 comments
Labels: lgtm, approved, size/L

#2140 - [proposal] Coscheduling move some BeforePreFilter quick check into PreEnqueueExtension

Issue - State: open - Opened by ZiMengSheng 4 months ago - 5 comments
Labels: area/koord-scheduler, kind/proposal

#2139 - scheduler: add reservation preemption

Pull Request - State: closed - Opened by saintube 4 months ago - 8 comments
Labels: size/XXL, lgtm, approved

#2123 - koordlet: Add resctrl runtime hook for pod level

Pull Request - State: closed - Opened by kangclzjc 5 months ago - 4 comments
Labels: size/XXL, lgtm, approved

#2122 - feat(deps): bump go.opentelemetry.io/contrib/instrumentation/google.golang.org/grpc/otelgrpc from 0.35.0 to 0.46.0

Pull Request - State: open - Opened by dependabot[bot] 5 months ago - 1 comment
Labels: size/M, dependencies

#2119 - [proposal] koord-descheduler: Optimize the reassessment of single-node resource left

Issue - State: closed - Opened by zwForrest 5 months ago - 1 comment
Labels: kind/proposal, area/koord-descheduler

#2117 - [BUG] The parent queue has over the maximum

Issue - State: open - Opened by hiwangzhihui 5 months ago - 2 comments
Labels: area/koord-scheduler, kind/bug

#2116 - [BUG] EnqueueRequestForNode should not only watch update of node's allocatable

Issue - State: closed - Opened by songtao98 5 months ago - 5 comments
Labels: good first issue, help wanted, area/koord-manager, kind/bug

#2113 - [question] resource reservation does not consider scale up via cluster-autoscaler

Issue - State: open - Opened by lukasmrtvy 5 months ago - 7 comments
Labels: area/koord-scheduler, kind/question

#2108 - [question]reservation对象和namespace配额疑问

Issue - State: closed - Opened by zj619 5 months ago - 3 comments
Labels: kind/question, lifecycle/stale

#2106 - chore(deps): bump docker/build-push-action from 5 to 6

Pull Request - State: open - Opened by dependabot[bot] 5 months ago - 2 comments
Labels: size/XS, dependencies, lifecycle/stale

#2104 - [question] elastic quota allow-lent-resource 没有生效

Issue - State: closed - Opened by zj619 5 months ago - 3 comments
Labels: area/koord-scheduler, kind/question, lifecycle/stale

#2082 - chore(deps): bump goreleaser/goreleaser-action from 5 to 6

Pull Request - State: open - Opened by dependabot[bot] 6 months ago - 2 comments
Labels: size/XS, dependencies, lifecycle/stale

#2078 - [BUG] Elastic Quota Management not working as expected

Issue - State: closed - Opened by taraszka 6 months ago - 4 comments
Labels: area/koord-scheduler, area/koord-manager, kind/question, kind/bug, lifecycle/stale

#2075 - Add license scan report and status

Pull Request - State: open - Opened by fossabot 6 months ago - 4 comments
Labels: size/XS, do-not-merge/hold

#2064 - scheduler: make gang quickCheck earlier

Pull Request - State: closed - Opened by ZiMengSheng 6 months ago - 4 comments
Labels: lgtm, approved, size/S

#2063 - scheduler: add reservation level event

Pull Request - State: closed - Opened by zwzhang0107 6 months ago - 4 comments
Labels: lgtm, approved, size/XL

#2062 - [proposal] CPUNormalization can amplify cpu via cfs_quota

Issue - State: closed - Opened by LGTH 6 months ago - 2 comments
Labels: area/koordlet, area/koord-manager, kind/proposal, lifecycle/stale

#2061 - koordlet: fix flaky test in pleg

Pull Request - State: closed - Opened by saintube 6 months ago - 4 comments
Labels: lgtm, approved, size/M

#2060 - koordlet: revise flaky test in pleg

Pull Request - State: closed - Opened by saintube 6 months ago - 3 comments
Labels: lgtm, approved, size/M

#2059 - scheduler: support empty reservation affinity

Pull Request - State: closed - Opened by ZiMengSheng 6 months ago - 3 comments
Labels: size/XS, lgtm, approved

#2058 - scheduler: enhance reservation error message

Pull Request - State: open - Opened by saintube 6 months ago - 2 comments
Labels: size/L

#2057 - koordlet: add taskids in statesinformer

Pull Request - State: open - Opened by kangclzjc 6 months ago - 2 comments
Labels: size/L

#2056 - koordlet: fix BlkioReconcile file close

Pull Request - State: closed - Opened by testwill 6 months ago - 4 comments
Labels: size/XS, lgtm, approved

#2055 - koordlet: skip the container which is not running in cpuBurst applyCFSQuotaBurst

Pull Request - State: closed - Opened by yangfeiyu20102011 6 months ago - 2 comments
Labels: size/XS, lgtm, approved

#2054 - koordlet: change CollectContainerThrottledMetric with duration=2*collectoInterval

Pull Request - State: closed - Opened by zwzhang0107 6 months ago - 2 comments
Labels: size/XS, lgtm, approved

#2053 - ci: fix scheduler e2e workflow after 1.28 upgradation

Pull Request - State: closed - Opened by saintube 6 months ago - 2 comments
Labels: lgtm, approved, size/M

#2052 - koordlet: add resctrl updater

Pull Request - State: open - Opened by kangclzjc 6 months ago - 3 comments
Labels: size/L

#2051 - [proposal] koord-descheduler support objectLimiter for namespace

Issue - State: open - Opened by songtao98 6 months ago - 2 comments
Labels: kind/proposal, area/koord-descheduler

#2049 - chore(deps): bump golangci/golangci-lint-action from 5.3.0 to 6.0.1

Pull Request - State: open - Opened by dependabot[bot] 6 months ago - 1 comment
Labels: size/XS, dependencies

#2048 - scheduler: justify device share err msg

Pull Request - State: closed - Opened by ZiMengSheng 6 months ago - 1 comment
Labels: size/M

#2047 - [BUG] Pod Cannot Start When CPUNormalization enable

Issue - State: closed - Opened by LGTH 7 months ago - 6 comments
Labels: area/koordlet, kind/bug

#2046 - koordlet: add nri remove

Pull Request - State: closed - Opened by kangclzjc 7 months ago - 4 comments
Labels: lgtm, approved, size/L

#2045 - Device topology awareness and co-scheduling of GPU and RDMA

Issue - State: closed - Opened by zhangQiWorr 7 months ago - 3 comments
Labels: area/koord-scheduler, kind/question, lifecycle/stale

#2044 - scheduler: make device topology alignment switchable

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 5 comments
Labels: lgtm, approved, size/M

#2043 - [proposal] deschedule production pods between nodes

Issue - State: open - Opened by zwForrest 7 months ago - 5 comments
Labels: kind/proposal, area/koord-descheduler

#2042 - Continously getting "no space left on Device" even though node has enough space.

Issue - State: closed - Opened by kavita1205 7 months ago - 3 comments
Labels: area/koordlet, kind/question, kind/bug, lifecycle/stale

#2041 - scheduler: support pod-level numaAllocateStrategy

Pull Request - State: open - Opened by ZiMengSheng 7 months ago - 3 comments
Labels: do-not-merge/hold, size/M

#2040 - [proposal] Report node allocatable RDT ctrl groups

Issue - State: open - Opened by saintube 7 months ago
Labels: area/koordlet, kind/proposal

#2039 - REQUEST: New membership for <Suozz>

Issue - State: closed - Opened by Suozz 7 months ago - 4 comments
Labels: kind/github-membership

#2038 - all: migrate to 1.28.7

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 2 comments
Labels: size/XXL, lgtm, approved

#2036 - REQUEST: New membership for <lucming>

Issue - State: closed - Opened by lucming 7 months ago - 4 comments
Labels: kind/github-membership

#2035 - chore(deps): bump golangci/golangci-lint-action from 5 to 6

Pull Request - State: closed - Opened by dependabot[bot] 7 months ago - 2 comments
Labels: size/XS, dependencies

#2033 - scheduler: fix that cpu should be preferred if numa policy is restricted

Pull Request - State: closed - Opened by KunWuLuan 7 months ago - 3 comments
Labels: lgtm, approved, size/S

#2032 - scheduler: coscheduling plugin sync scheduled in controller

Pull Request - State: closed - Opened by xulinfei1996 7 months ago - 4 comments
Labels: lgtm, approved, size/L

#2031 - scheduler: fix coscheduling plugin podgroup status error

Pull Request - State: closed - Opened by xulinfei1996 7 months ago - 2 comments
Labels: size/S

#2030 - [PodGroup update phase Error] PodGroup cannot Update phase from Scheduling to Scheduled when gangMember larger than two

Issue - State: closed - Opened by PeterChg 7 months ago - 4 comments
Labels: area/koord-scheduler, kind/bug

#2029 - scheduler: coscheduling plugin only record gang OnceResourceSatisfied…

Pull Request - State: closed - Opened by xulinfei1996 7 months ago - 3 comments
Labels: size/XS, lgtm, approved

#2028 - [No space left on device Error] koordlet is throwing No space left

Issue - State: closed - Opened by kavita1205 7 months ago - 2 comments
Labels: kind/question, kind/bug

#2027 - chore: fix some nit error

Pull Request - State: closed - Opened by googs1025 7 months ago - 4 comments
Labels: size/XS, lgtm, approved

#2026 - [install error] koordlet panic

Issue - State: closed - Opened by googs1025 7 months ago - 4 comments
Labels: area/koordlet, kind/question, kind/bug, lifecycle/stale

#2025 - webhook: optimize webhook patchResponse function

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 2 comments
Labels: lgtm, approved, size/S

#2024 - webhook: optimize webhook patchResponse function

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 3 comments
Labels: size/S

#2023 - chores: add openssf best practices badge into README

Pull Request - State: closed - Opened by songtao98 7 months ago - 3 comments
Labels: size/XS, lgtm, approved

#2022 - chore(deps): bump golangci/golangci-lint-action from 3 to 5

Pull Request - State: closed - Opened by dependabot[bot] 7 months ago - 3 comments
Labels: size/XS, lgtm, approved, dependencies

#2021 - [BUG] The description of throttlingPercent in the documentation is incorrect

Issue - State: open - Opened by j4ckstraw 7 months ago - 3 comments
Labels: documentation, kind/bug

#2020 - chore(deps): bump helm/kind-action from 1.9.0 to 1.10.0

Pull Request - State: open - Opened by dependabot[bot] 7 months ago - 2 comments
Labels: size/XS, dependencies, lifecycle/stale

#2019 - [proposal] Support multi-queue mechanism, similar to what Volcano does.

Issue - State: open - Opened by PeterChg 7 months ago
Labels: kind/proposal

#2018 - scheduler: remove invalid hint in which some numaNode lack resource

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 3 comments
Labels: lgtm, approved, size/M

#2017 - scheduler: try best to distribute cpu and memory evenly across numa

Pull Request - State: closed - Opened by ZiMengSheng 7 months ago - 3 comments
Labels: lgtm, approved, size/XL

#2016 - utils: change metrics server util pkg name

Pull Request - State: closed - Opened by zwzhang0107 7 months ago - 3 comments
Labels: size/XS, lgtm, approved

#2014 - feat(deps): bump golang.org/x/net from 0.16.0 to 0.23.0

Pull Request - State: closed - Opened by dependabot[bot] 7 months ago - 4 comments
Labels: size/S, dependencies, lifecycle/stale

#2009 - scheduler: improve gang log

Pull Request - State: closed - Opened by googs1025 8 months ago - 9 comments
Labels: lgtm, approved, size/S