Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / NVIDIA/pyxis issues and pull requests

#159 - Alow multiple invocations of --container-mounts in SBATCH header

Issue - State: open - Opened by cponder 10 days ago - 1 comment

#158 - Packaging: add a dependency on Slurm version for RPM packages

Pull Request - State: open - Opened by kcgthb 21 days ago

#157 - Container volumes fail with srun command

Issue - State: closed - Opened by Tanya1515 23 days ago - 2 comments

#154 - disable concurrent pull and keep squashfs

Pull Request - State: closed - Opened by itechdima about 1 month ago - 1 comment

#153 - Interoperability with cli_filter/user_defaults plugin

Issue - State: open - Opened by jfolz 3 months ago - 1 comment

#152 - user supplemental groups do not show up in session, in container

Issue - State: open - Opened by EternalTB 4 months ago - 5 comments

#150 - Unable to install correct version w/ Slurm installed from source

Issue - State: open - Opened by thsu-terraytx 5 months ago - 7 comments

#149 - Connect to Pyxis Container from VSCode

Issue - State: closed - Opened by ECMGit 6 months ago - 6 comments

#148 - --container-image behavior different between cmd line and batch file

Issue - State: closed - Opened by RHudsonH 6 months ago - 2 comments

#147 - Unpacking on compute node

Issue - State: closed - Opened by mmg10 7 months ago - 1 comment

#146 - Document --container-env with overriding and entrypoint details.

Pull Request - State: closed - Opened by skandermoalla 7 months ago - 3 comments

#145 - Can't pass environment variables to entrypoint

Issue - State: closed - Opened by skandermoalla 7 months ago - 2 comments

#143 - Support for fuse mounts

Issue - State: open - Opened by ocaisa 8 months ago

#142 - Pyxis fails with docker socket permission denied

Issue - State: closed - Opened by RamHPC 8 months ago - 4 comments

#141 - Is there a single container instance per allocated node?

Issue - State: closed - Opened by ocaisa 8 months ago - 5 comments

#140 - UCX fails when trying to run training across 2 nodes

Issue - State: closed - Opened by RamHPC 9 months ago - 1 comment

#139 - Slurm "run" is failing to find "munge" component when run with "container"

Issue - State: closed - Opened by RamHPC 9 months ago - 6 comments

#138 - mkdir Permission denied

Issue - State: open - Opened by proshir 10 months ago - 5 comments

#137 - Don't get stdout/stderr output from entry point script

Issue - State: closed - Opened by sphuber 10 months ago - 7 comments

#136 - Logging in as another user

Issue - State: open - Opened by calvinp0 10 months ago - 4 comments

#135 - Error upon invoking container image (failed with rc=-1)

Issue - State: open - Opened by as7a5 12 months ago - 5 comments

#134 - Version 0.17.0 breaks passing variables from systemd

Issue - State: closed - Opened by mathrock74 about 1 year ago - 2 comments

#133 - --container-mounts can't mount directory with spaces

Issue - State: open - Opened by mathrock74 about 1 year ago - 3 comments

#132 - debian: allow slurm-smd to provide libslurm

Pull Request - State: closed - Opened by lukeyeager about 1 year ago

#131 - cgroups mounted twice into a container, once rw once ro

Issue - State: closed - Opened by itzsimpl about 1 year ago - 2 comments

#129 - Running scontrol from within container

Issue - State: open - Opened by itzsimpl about 1 year ago - 4 comments

#128 - Pyxis Installation Error

Issue - State: closed - Opened by leela-uppuluri over 1 year ago - 2 comments

#127 - pyxis setup issue

Issue - State: closed - Opened by vinayburugu over 1 year ago - 7 comments

#126 - how to increase ulimit stack

Issue - State: closed - Opened by verdimrc over 1 year ago - 2 comments

#125 - Simple way to check installed pyxis version

Issue - State: closed - Opened by estepona over 1 year ago - 4 comments

#124 - Export all Environment Variables

Issue - State: closed - Opened by sean-smith over 1 year ago - 1 comment

#123 - epilog failures upgrading to v0.16

Issue - State: closed - Opened by twh over 1 year ago - 13 comments

#122 - Container does not start on a small set on cluster

Issue - State: open - Opened by lidavid88 over 1 year ago - 2 comments

#121 - `Permission denied` errors when importing Docker image

Issue - State: closed - Opened by javrtg over 1 year ago - 2 comments

#120 - mksquashfs unkillable by OOM killer during container export

Issue - State: closed - Opened by jfolz over 1 year ago - 5 comments

#119 - Updated Slurm + Pyxis, now entrypoint is `/` instead of current location.

Issue - State: open - Opened by crinavar over 1 year ago - 5 comments

#118 - Can't get image from local Docker registry

Issue - State: closed - Opened by rstober over 1 year ago - 1 comment

#117 - NCCL cannot use GPU RDMA inside pyxis-enroot container with Slurm

Issue - State: closed - Opened by szhengac over 1 year ago - 2 comments

#116 - Each process in nccl test can only see one GPU

Issue - State: closed - Opened by szhengac over 1 year ago - 4 comments

#115 - integration with slurmrestd

Issue - State: open - Opened by jamesbeedy over 1 year ago - 3 comments

#114 - OpenMPI mpirun received unexpected process identifier

Issue - State: closed - Opened by verdimrc over 1 year ago - 2 comments

#112 - pyxis with enroot using local docker registry images failing

Issue - State: closed - Opened by karanveersingh5623 almost 2 years ago - 13 comments

#111 - Mutli-GPU fails without adding node/gpu arguments to srun

Issue - State: closed - Opened by szhengac almost 2 years ago - 4 comments

#110 - Pyxis not picking up our GPUs

Issue - State: closed - Opened by slurmuser almost 2 years ago - 4 comments

#109 - Entrypoint hanging

Issue - State: closed - Opened by slurmuser almost 2 years ago - 9 comments

#108 - Questions on ENROOT_CACHE_PATH

Issue - State: closed - Opened by stephandooper almost 2 years ago - 2 comments

#107 - Multi-node jobs fail

Issue - State: closed - Opened by rormseth almost 2 years ago - 4 comments

#106 - Pyxis randomnly hangs on some imports

Issue - State: closed - Opened by slurmuser almost 2 years ago - 1 comment

#105 - New release, please?

Issue - State: closed - Opened by ltalirz almost 2 years ago - 2 comments

#104 - srun fails when command is not on host

Issue - State: closed - Opened by AlienYouth almost 2 years ago - 9 comments

#103 - Problems running OptiX job through Pyxis, what am I missing?

Issue - State: closed - Opened by crinavar about 2 years ago - 6 comments

#102 - Using a rankfile with srun from a container

Issue - State: closed - Opened by OguzPastirmaci about 2 years ago - 20 comments

#100 - added clang format settings for project

Pull Request - State: closed - Opened by fawzi about 2 years ago - 4 comments

#99 - Fail to srun NCG hpc-benchmarks docker

Issue - State: closed - Opened by inspurasc about 2 years ago - 1 comment

#98 - Fail to restart slurmd server after tweaking my PMIx configuration through systemd

Issue - State: closed - Opened by inspurasc about 2 years ago - 4 comments

#97 - Fail to make container image writable

Issue - State: closed - Opened by xinyx62 about 2 years ago - 1 comment

#96 - srun container (system PATH) issue?

Issue - State: closed - Opened by xihajun over 2 years ago - 2 comments

#95 - pyxis plug-in components do not effective

Issue - State: closed - Opened by xinyx62 over 2 years ago - 5 comments

#94 - numactl error enroot/pyxis running nvidia hpc-benchmark

Issue - State: closed - Opened by xinyx62 over 2 years ago - 7 comments

#93 - Can't srun with pyxis

Issue - State: closed - Opened by aboseria over 2 years ago - 10 comments

#92 - Installation issue

Issue - State: closed - Opened by BDHU over 2 years ago - 5 comments

#91 - ENROOT_DATA_PATH ignored

Issue - State: closed - Opened by matyro over 2 years ago - 9 comments

#90 - Enforcing CPU limits on containers

Issue - State: closed - Opened by abuettner93 over 2 years ago - 2 comments

#89 - document how to use enroot `.credentials`

Issue - State: closed - Opened by ltalirz over 2 years ago - 2 comments

#88 - Mpix error enroot/pyxis multinode nvidia hpc-benchmark

Issue - State: closed - Opened by Concluant over 2 years ago - 2 comments

#86 - specify the user in Docker container in --container-image while giving path

Issue - State: closed - Opened by praveen5733 over 2 years ago - 1 comment

#85 - The sshd service can not work correctly in the container

Issue - State: closed - Opened by kuangllbnu over 2 years ago - 4 comments

#84 - pyxis option "container-image" not found through slurm

Issue - State: closed - Opened by shubhammehta03 over 2 years ago - 11 comments

#83 - Task prolog script fails

Issue - State: closed - Opened by staeglis over 2 years ago - 2 comments

#82 - Dependency issue with nvcr.io/nvidia/tensorflow:22.05-tf1-py3

Issue - State: closed - Opened by karanveersingh5623 over 2 years ago - 1 comment

#81 - Cannot run nccl tests inside a container

Issue - State: closed - Opened by rvencu over 2 years ago - 11 comments

#80 - Not attaching to running container when using --container-name

Issue - State: closed - Opened by hendraet over 2 years ago - 5 comments

#79 - ParallelCluster: No error but test suite returns host OS instead of container OS

Issue - State: closed - Opened by rvencu over 2 years ago - 1 comment

#77 - RPM can no longer be build on RH/CentOS 7

Issue - State: closed - Opened by martialblog almost 3 years ago - 3 comments

#77 - RPM can no longer be build on RH/CentOS 7

Issue - State: closed - Opened by martialblog almost 3 years ago - 3 comments

#76 - srun exits while container export is still running

Issue - State: closed - Opened by jfolz almost 3 years ago - 4 comments

#75 - SLURM creating container not responding

Issue - State: closed - Opened by aviaisr almost 3 years ago - 1 comment

#74 - `nvslurm-plugin-pyxis` packages do not readily work with hardened enroot

Issue - State: closed - Opened by AlexTMjugador almost 3 years ago - 4 comments

#74 - `nvslurm-plugin-pyxis` packages do not readily work with hardened enroot

Issue - State: closed - Opened by AlexTMjugador almost 3 years ago - 4 comments

#73 - SLURM gpus-per-task issue

Issue - State: open - Opened by itzsimpl almost 3 years ago - 5 comments

#72 - Will containers launched by pyxis inherit the cgroup limitation in slurm?

Issue - State: closed - Opened by SolenoidWGT almost 3 years ago - 2 comments

#71 - ERROR while pulling images

Issue - State: closed - Opened by SolenoidWGT about 3 years ago - 2 comments

#70 - enroot import container_image

Issue - State: closed - Opened by cheyunfei about 3 years ago - 4 comments

#69 - ParallelCluster

Issue - State: closed - Opened by maziarraissi about 3 years ago - 5 comments

#68 - Unable to start container image from local registry using sbatch command

Issue - State: closed - Opened by gavillom about 3 years ago - 3 comments

#66 - --container-name command line argument wont save or load container on host

Issue - State: closed - Opened by arnoldas500 over 3 years ago - 7 comments

#65 - --container-writable is unrecognized option for srun

Issue - State: closed - Opened by arnoldas500 over 3 years ago - 3 comments

#64 - Should `--no-container-remap-root` be equivalent to `ENROOT_REMAP_ROOT no` ?

Issue - State: closed - Opened by vfdev-5 over 3 years ago - 2 comments

#62 - Error while trying to run Pyxies on AWS Parallel Cluster

Issue - State: closed - Opened by MrA2K2 over 3 years ago - 4 comments

#61 - When running srun job with pyxis, I get failed to execute: /bin/bash

Issue - State: closed - Opened by JonShelley over 3 years ago - 2 comments