Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / NVIDIA/deepops issues and pull requests

#550 - DeepOps.hosts role is fragile to inventory configuration

Issue - State: closed - Opened by ajdecon over 4 years ago - 6 comments
Labels: no-issue-activity

#100 - hosts: include NIC name bugfix

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#99 - Use separate dir for galaxy roles

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#98 - config - clean out unused variables

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago - 1 comment

#97 - gitignore cleanup

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#96 - slurm - refactor epilog+prolog

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago - 1 comment

#95 - Open sourcing of deepops container dependencies

Issue - State: closed - Opened by hightoxicity almost 6 years ago - 2 comments

#94 - Refactor virtual

Pull Request - State: closed - Opened by michael-balint almost 6 years ago - 1 comment

#93 - nvidia-docker - refactor for idempotence

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#92 - nvidia-driver - refactor for idempotence

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#91 - slurm - cleanup return_to_service variable

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#90 - reboot - use ansible task instead of custom role

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#89 - slurm - upgrade from heirloom-mailx to s-nail

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#88 - nvidia-driver: updates

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#87 - Cannot utilize more than the maximum GPUs in a node

Issue - State: closed - Opened by kmmanto almost 6 years ago - 1 comment

#86 - nvidia-driver playbook: run on all hosts with GPUs

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#85 - Create role for nvidia-docker

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#84 - Updated playbook+role for nvidia driver

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#83 - K8s cluster work

Pull Request - State: closed - Opened by dholt almost 6 years ago

#82 - Slurm prolog+epilog changes

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago - 1 comment

#81 - Slurm role - check for requirements

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#80 - Revive slurm

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#79 - Fix driver PPA package names

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#78 - nvidia-docker: reload docker service

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#77 - rename mgmt to management

Pull Request - State: closed - Opened by dholt almost 6 years ago

#76 - Fix nvidia-docker repo for xenial

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#75 - Fix driver package name

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago - 3 comments

#74 - Resurrect the hosts playbook

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#73 - Finish the migration from mgmt to management

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#72 - Docker playbook - run on tesla-servers, too

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago

#71 - Install slurm deb from github release

Pull Request - State: closed - Opened by lukeyeager almost 6 years ago - 1 comment

#70 - Pull DEPLOYMENT.md cleanup/streamlining changes in and supporting helper scripts

Pull Request - State: closed - Opened by supertetelman almost 6 years ago - 1 comment

#69 - Fixed nodeSelector(s) for elk, ingress and registry

Pull Request - State: closed - Opened by mfoco about 6 years ago

#68 - Grafana does not Display GPU Metrics (Ubuntu 18.04, GTX 1060)

Issue - State: closed - Opened by ekaakurniawan about 6 years ago - 3 comments

#67 - Re-order README to create an end-to-end flow

Pull Request - State: closed - Opened by supertetelman about 6 years ago

#66 - Fix warning about ssh role syntax

Pull Request - State: closed - Opened by lukeyeager about 6 years ago - 1 comment

#65 - Automatically generate machine file, dhcp file, and inventory file

Issue - State: closed - Opened by supertetelman about 6 years ago - 1 comment

#64 - Minor README formatting changes

Pull Request - State: closed - Opened by supertetelman about 6 years ago

#63 - Training

Pull Request - State: closed - Opened by nvhans about 6 years ago - 1 comment

#62 - Update docs to use bootstrap script for setting up provisioning box

Pull Request - State: closed - Opened by supertetelman about 6 years ago

#61 - Create an optional section to setup and install Kubeflow

Issue - State: closed - Opened by supertetelman about 6 years ago - 4 comments

#60 - Pull README changes with merge conflicts fixed

Pull Request - State: closed - Opened by supertetelman about 6 years ago - 1 comment

#59 - General updates to README and additional config comments (#1)

Pull Request - State: closed - Opened by supertetelman about 6 years ago

#58 - adding nfs prefix to the nfs role

Pull Request - State: closed - Opened by ryanolson about 6 years ago

#57 - Fix readme s/pkg/dev/

Pull Request - State: closed - Opened by lukeyeager about 6 years ago

#56 - Incorporate NV Kubernetes documentation examples

Issue - State: closed - Opened by dholt about 6 years ago - 1 comment
Labels: enhancement, help wanted, Documentation

#55 - Document Pod Security Policy

Issue - State: closed - Opened by dholt about 6 years ago - 1 comment
Labels: enhancement, help wanted

#54 - slurm role: append grub options

Pull Request - State: closed - Opened by lukeyeager about 6 years ago

#53 - Merge packaging changes into the dev

Pull Request - State: closed - Opened by dholt about 6 years ago

#52 - improve resolvconf deployment settings

Issue - State: closed - Opened by DataRacer11 about 6 years ago - 1 comment

#51 - General updates to README and additional config comments (#1)

Pull Request - State: closed - Opened by supertetelman about 6 years ago - 1 comment

#50 - Updates to the vagrant workflow

Pull Request - State: closed - Opened by lukeyeager about 6 years ago

#49 - master nodes labeled incorrectly

Issue - State: closed - Opened by dholt about 6 years ago - 1 comment

#48 - Permit to use an internal NTP server for node time syncing

Issue - State: closed - Opened by hightoxicity about 6 years ago - 2 comments

#47 - switch to prometheus operator

Issue - State: closed - Opened by deathowl about 6 years ago - 1 comment

#46 - Allow to push NTP server if defined through DHCP

Pull Request - State: closed - Opened by hightoxicity about 6 years ago

#45 - Allow to push NTP server if defined through DHCP

Pull Request - State: closed - Opened by hightoxicity about 6 years ago

#44 - dgxie HA

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#43 - dhcp ha by dhcp dyn range sharding

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#42 - Virtual DeepOps - dgx01 final reboot with no IP address

Issue - State: closed - Opened by yoeljacobsen about 6 years ago - 1 comment

#41 - Alert about spawning DHCP server into the cluster via helm (dgxie service)

Issue - State: closed - Opened by hightoxicity about 6 years ago - 8 comments

#40 - Deepops entirely proxy friendly

Issue - State: closed - Opened by hightoxicity about 6 years ago - 2 comments

#39 - Fix qwant

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#38 - Use proxies if set

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#37 - Fixed dgx-servers group and YML file name in README.md

Pull Request - State: closed - Opened by mfoco about 6 years ago

#36 - Permit to provide extra packages to install at preseed

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#35 - site.yml ansible playbook does not find aptitude package

Issue - State: closed - Opened by hightoxicity about 6 years ago - 2 comments

#34 - Make bootstrap works with proxies

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#33 - Kubespray useradd failes when home directory doesn't exist

Issue - State: closed - Opened by ryanolson about 6 years ago

#31 - Permit to not have a public interface

Pull Request - State: closed - Opened by hightoxicity about 6 years ago - 1 comment

#30 - https_proxy

Pull Request - State: closed - Opened by hightoxicity about 6 years ago

#28 - Set the istio_enabled config variable in the config example

Issue - State: closed - Opened by hightoxicity over 6 years ago - 3 comments

#27 - tiller init --upgrade is failing in airgapped cluster bootstrapping

Issue - State: closed - Opened by hightoxicity over 6 years ago - 4 comments

#26 - Virtual DeepOps - Minor Updates

Pull Request - State: closed - Opened by michael-balint over 6 years ago

#25 - Virtual DeepOps: deploy a virtual deepops cluster using libvirt

Pull Request - State: closed - Opened by michael-balint over 6 years ago

#24 - Fix typos in README.md

Pull Request - State: closed - Opened by weeellz over 6 years ago - 1 comment

#23 - Release 18.08

Pull Request - State: closed - Opened by dholt over 6 years ago - 1 comment

#21 - Add information for provisioning non-DGX Tesla GPU nodes

Issue - State: closed - Opened by dholt over 6 years ago - 5 comments
Labels: enhancement

#20 - Limit Rook to management nodes

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment

#19 - Simplify monitoring deployment

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#18 - WIP: Release 18.07

Pull Request - State: closed - Opened by dholt over 6 years ago

#17 - Support RHEL 7.5 for current Ubuntu-only playbooks

Issue - State: closed - Opened by jlefman over 6 years ago

#16 - Incorporate Slurm build and update instructions

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: critical-bug

#15 - WIP: PXE

Pull Request - State: closed - Opened by dholt over 6 years ago

#14 - WIP: Kubespray

Pull Request - State: closed - Opened by dholt over 6 years ago - 1 comment

#13 - Down-rev rook version to fix OSD bug

Pull Request - State: closed - Opened by dholt over 6 years ago

#12 - Add information about SSH keys

Pull Request - State: closed - Opened by dholt over 6 years ago

#11 - Update docker version

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#10 - Convert dgxie config-map to inline config

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#9 - Clean up Helm instructions, configure Ingress and add container registry with Helm

Pull Request - State: closed - Opened by dholt over 6 years ago
Labels: enhancement

#8 - Generate and use certs for local docker registry

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#7 - Convert DGXie to Helm chart

Pull Request - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#6 - Update Kubespray version

Issue - State: closed - Opened by dholt over 6 years ago - 1 comment
Labels: enhancement

#5 - slack notice?

Issue - State: closed - Opened by jlefman over 6 years ago - 1 comment

#4 - Readme

Pull Request - State: closed - Opened by dholt over 6 years ago

#3 - add DGX POD whitepaper reference

Pull Request - State: closed - Opened by dholt over 6 years ago