Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / leptonai/gpud issues and pull requests

#261 - feat(nvidia/query): bump up nvidia-smi cmd timeout, better debugging info

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#259 - nit(containerd/pod): use id package for state name

Pull Request - State: closed - Opened by gyuho 2 months ago

#258 - fix(containerd): use consistent state name

Pull Request - State: closed - Opened by cardyok 2 months ago

#256 - fix(disk): add retries for lsblk

Pull Request - State: closed - Opened by gyuho 2 months ago

#254 - nit(disk): rename state key to disk_ext_partition

Pull Request - State: closed - Opened by gyuho 2 months ago

#253 - fix(controller): only read stdout for run command

Pull Request - State: closed - Opened by gyuho 2 months ago

#252 - revert(package_controller): revert read all changes

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#251 - feat(lsblk): add more test case, clarify parse error

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#249 - feat(disk): use "findmnt --target" to find filesystem usage

Pull Request - State: closed - Opened by gyuho 2 months ago

#245 - feat(pci): move /states to /events for acs srv-valid checks

Pull Request - State: closed - Opened by gyuho 2 months ago

#244 - chore(deps): bump golang.org/x/crypto from 0.25.0 to 0.31.0

Pull Request - State: closed - Opened by dependabot[bot] 2 months ago
Labels: dependency-issue

#242 - feat(components/dmesg): catch EDAC correctable errorrs in dmesg

Pull Request - State: closed - Opened by gyuho 2 months ago

#242 - feat(components/dmesg): catch EDAC correctable errorrs in dmesg

Pull Request - State: closed - Opened by gyuho 2 months ago

#241 - feat(go.mod): upgrade go sqlite3

Pull Request - State: closed - Opened by gyuho 2 months ago

#241 - feat(go.mod): upgrade go sqlite3

Pull Request - State: closed - Opened by gyuho 2 months ago

#239 - nit(containerd/pod): rename state keys

Pull Request - State: closed - Opened by gyuho 2 months ago

#239 - nit(containerd/pod): rename state keys

Pull Request - State: closed - Opened by gyuho 2 months ago

#238 - nit(dmesg): add more regex OOM matcher test cases with timestamps

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#238 - nit(dmesg): add more regex OOM matcher test cases with timestamps

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#237 - nit(diagnose): print matched dmesg line in scan command

Pull Request - State: closed - Opened by gyuho 2 months ago

#237 - nit(diagnose): print matched dmesg line in scan command

Pull Request - State: closed - Opened by gyuho 2 months ago

#236 - feat(components/pci): check PCI access control services for baremetal systems

Pull Request - State: closed - Opened by gyuho 2 months ago - 3 comments

#236 - feat(components/pci): check PCI access control services for baremetal systems

Pull Request - State: closed - Opened by gyuho 2 months ago - 3 comments

#235 - feat(components/os): detect virt environment, system manufacturer

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#235 - feat(components/os): detect virt environment, system manufacturer

Pull Request - State: closed - Opened by gyuho 2 months ago - 1 comment

#234 - feat(components/dmesg): simplify /events fields

Pull Request - State: closed - Opened by gyuho 2 months ago

#234 - feat(components/dmesg): simplify /events fields

Pull Request - State: closed - Opened by gyuho 2 months ago

#233 - feat(components): add missing event type in /events

Pull Request - State: closed - Opened by gyuho 3 months ago

#233 - feat(components): add missing event type in /events

Pull Request - State: closed - Opened by gyuho 3 months ago

#231 - nit(gpud): fix flag description --expected-port-states-nvidia-infiniband

Pull Request - State: closed - Opened by gyuho 3 months ago - 1 comment

#231 - nit(gpud): fix flag description --expected-port-states-nvidia-infiniband

Pull Request - State: closed - Opened by gyuho 3 months ago - 1 comment

#228 - fix(components): separate timeout for poller get function calls

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#228 - fix(components): separate timeout for poller get function calls

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#227 - feat(nvidia): set components/events timestamp in UTC explicitly

Pull Request - State: closed - Opened by gyuho 3 months ago

#227 - feat(nvidia): set components/events timestamp in UTC explicitly

Pull Request - State: closed - Opened by gyuho 3 months ago

#226 - feat(server): send components in gossip

Pull Request - State: closed - Opened by cardyok 3 months ago

#225 - fix(nvidia/hw-slowdown): rename from "clock" to only expose hardware slowdown issues, convert to events

Pull Request - State: closed - Opened by gyuho 3 months ago - 4 comments
Labels: bug

#225 - fix(nvidia/hw-slowdown): rename from "clock" to only expose hardware slowdown issues, convert to events

Pull Request - State: closed - Opened by gyuho 3 months ago - 4 comments
Labels: bug

#223 - feat(session): make context local to each session for flexibility

Pull Request - State: closed - Opened by cardyok 3 months ago

#223 - feat(session): make context local to each session for flexibility

Pull Request - State: closed - Opened by cardyok 3 months ago

#222 - fix(nvidia): derive product name using NVML results first

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#222 - fix(nvidia): derive product name using NVML results first

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#220 - fix(nvidia/clock): use nvml clock events, fall back to nvidia-smi parsing

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#220 - fix(nvidia/clock): use nvml clock events, fall back to nvidia-smi parsing

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#215 - fix(nvidia/nvml): correct boolean checks on whether clock events supported

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#215 - fix(nvidia/nvml): correct boolean checks on whether clock events supported

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#214 - fix(session): close reader channel on fast return

Pull Request - State: closed - Opened by cardyok 3 months ago

#213 - feat(components/memory): track current jit alloc buffer size, vm alloc status

Pull Request - State: closed - Opened by gyuho 3 months ago - 1 comment

#213 - feat(components/memory): track current jit alloc buffer size, vm alloc status

Pull Request - State: closed - Opened by gyuho 3 months ago - 1 comment

#212 - fix(cmd/gpud): handle "run --expected-port-states-nvidia-infiniband" flag

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#212 - fix(cmd/gpud): handle "run --expected-port-states-nvidia-infiniband" flag

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#210 - feat(session): optimize default transport config

Pull Request - State: closed - Opened by cardyok 3 months ago

#210 - feat(session): optimize default transport config

Pull Request - State: closed - Opened by cardyok 3 months ago

#209 - nit(k8s/pod): quote string node name in case it's empty

Pull Request - State: closed - Opened by gyuho 3 months ago

#209 - nit(k8s/pod): quote string node name in case it's empty

Pull Request - State: closed - Opened by gyuho 3 months ago

#207 - fix(pkg/systemd): handle "n/a" in uptime with trailing characters

Pull Request - State: closed - Opened by gyuho 3 months ago

#205 - fix(session): disable http keep alive

Pull Request - State: closed - Opened by cardyok 3 months ago - 2 comments

#204 - fix(nvidia/infiniband): match mellanox to count PCI devices

Pull Request - State: closed - Opened by gyuho 3 months ago

#203 - can't get gpu info with wsl platform

Issue - State: closed - Opened by zhuima 3 months ago - 11 comments
Labels: feature, awaiting feedback

#202 - feat(nvidia): re-order nvidia-smi collect after NVML calls

Pull Request - State: closed - Opened by gyuho 3 months ago

#199 - fix(nvidia/infiniband): use "<" to evaluate ip port rates

Pull Request - State: closed - Opened by gyuho 3 months ago
Labels: bug

#197 - fix(join): remove space in provider

Pull Request - State: closed - Opened by cardyok 3 months ago

#196 - feat(nvidia/infiniband): make port states configurable

Pull Request - State: closed - Opened by gyuho 3 months ago