Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / NVIDIA/pyxis issues and pull requests
#159 - Alow multiple invocations of --container-mounts in SBATCH header
Issue -
State: open - Opened by cponder 10 days ago
- 1 comment
#158 - Packaging: add a dependency on Slurm version for RPM packages
Pull Request -
State: open - Opened by kcgthb 21 days ago
#157 - Container volumes fail with srun command
Issue -
State: closed - Opened by Tanya1515 23 days ago
- 2 comments
#156 - question: how to see pyxis slurm_spank_log messages from job prolog context
Issue -
State: open - Opened by elelayan 24 days ago
- 2 comments
#155 - Add options to disable concurrent image pulling and saving squashfs files to the dedicated file/directory, config to expose enroot logs
Pull Request -
State: open - Opened by itechdima 28 days ago
- 3 comments
#154 - disable concurrent pull and keep squashfs
Pull Request -
State: closed - Opened by itechdima about 1 month ago
- 1 comment
#153 - Interoperability with cli_filter/user_defaults plugin
Issue -
State: open - Opened by jfolz 3 months ago
- 1 comment
#152 - user supplemental groups do not show up in session, in container
Issue -
State: open - Opened by EternalTB 4 months ago
- 5 comments
#150 - Unable to install correct version w/ Slurm installed from source
Issue -
State: open - Opened by thsu-terraytx 5 months ago
- 7 comments
#149 - Connect to Pyxis Container from VSCode
Issue -
State: closed - Opened by ECMGit 6 months ago
- 6 comments
#148 - --container-image behavior different between cmd line and batch file
Issue -
State: closed - Opened by RHudsonH 6 months ago
- 2 comments
#147 - Unpacking on compute node
Issue -
State: closed - Opened by mmg10 7 months ago
- 1 comment
#146 - Document --container-env with overriding and entrypoint details.
Pull Request -
State: closed - Opened by skandermoalla 7 months ago
- 3 comments
#145 - Can't pass environment variables to entrypoint
Issue -
State: closed - Opened by skandermoalla 7 months ago
- 2 comments
#144 - spank: required plugin spank_pyxis.so: task_init() failed with rc=-1
Issue -
State: closed - Opened by yangzhipeng1108 8 months ago
#143 - Support for fuse mounts
Issue -
State: open - Opened by ocaisa 8 months ago
#142 - Pyxis fails with docker socket permission denied
Issue -
State: closed - Opened by RamHPC 8 months ago
- 4 comments
#141 - Is there a single container instance per allocated node?
Issue -
State: closed - Opened by ocaisa 8 months ago
- 5 comments
#140 - UCX fails when trying to run training across 2 nodes
Issue -
State: closed - Opened by RamHPC 9 months ago
- 1 comment
#139 - Slurm "run" is failing to find "munge" component when run with "container"
Issue -
State: closed - Opened by RamHPC 9 months ago
- 6 comments
#138 - mkdir Permission denied
Issue -
State: open - Opened by proshir 10 months ago
- 5 comments
#137 - Don't get stdout/stderr output from entry point script
Issue -
State: closed - Opened by sphuber 10 months ago
- 7 comments
#136 - Logging in as another user
Issue -
State: open - Opened by calvinp0 10 months ago
- 4 comments
#135 - Error upon invoking container image (failed with rc=-1)
Issue -
State: open - Opened by as7a5 12 months ago
- 5 comments
#134 - Version 0.17.0 breaks passing variables from systemd
Issue -
State: closed - Opened by mathrock74 about 1 year ago
- 2 comments
#133 - --container-mounts can't mount directory with spaces
Issue -
State: open - Opened by mathrock74 about 1 year ago
- 3 comments
#132 - debian: allow slurm-smd to provide libslurm
Pull Request -
State: closed - Opened by lukeyeager about 1 year ago
#131 - cgroups mounted twice into a container, once rw once ro
Issue -
State: closed - Opened by itzsimpl about 1 year ago
- 2 comments
#130 - SLURM enter into a running container using overlap and container-name, mounted path is empty
Issue -
State: closed - Opened by itzsimpl about 1 year ago
- 2 comments
#129 - Running scontrol from within container
Issue -
State: open - Opened by itzsimpl about 1 year ago
- 4 comments
#128 - Pyxis Installation Error
Issue -
State: closed - Opened by leela-uppuluri over 1 year ago
- 2 comments
#127 - pyxis setup issue
Issue -
State: closed - Opened by vinayburugu over 1 year ago
- 7 comments
#126 - how to increase ulimit stack
Issue -
State: closed - Opened by verdimrc over 1 year ago
- 2 comments
#125 - Simple way to check installed pyxis version
Issue -
State: closed - Opened by estepona over 1 year ago
- 4 comments
#124 - Export all Environment Variables
Issue -
State: closed - Opened by sean-smith over 1 year ago
- 1 comment
#123 - epilog failures upgrading to v0.16
Issue -
State: closed - Opened by twh over 1 year ago
- 13 comments
#122 - Container does not start on a small set on cluster
Issue -
State: open - Opened by lidavid88 over 1 year ago
- 2 comments
#121 - `Permission denied` errors when importing Docker image
Issue -
State: closed - Opened by javrtg over 1 year ago
- 2 comments
#120 - mksquashfs unkillable by OOM killer during container export
Issue -
State: closed - Opened by jfolz over 1 year ago
- 5 comments
#119 - Updated Slurm + Pyxis, now entrypoint is `/` instead of current location.
Issue -
State: open - Opened by crinavar over 1 year ago
- 5 comments
#118 - Can't get image from local Docker registry
Issue -
State: closed - Opened by rstober over 1 year ago
- 1 comment
#117 - NCCL cannot use GPU RDMA inside pyxis-enroot container with Slurm
Issue -
State: closed - Opened by szhengac over 1 year ago
- 2 comments
#116 - Each process in nccl test can only see one GPU
Issue -
State: closed - Opened by szhengac over 1 year ago
- 4 comments
#115 - integration with slurmrestd
Issue -
State: open - Opened by jamesbeedy over 1 year ago
- 3 comments
#114 - OpenMPI mpirun received unexpected process identifier
Issue -
State: closed - Opened by verdimrc over 1 year ago
- 2 comments
#113 - Request for enhancement: provide a pyxis option to srun that can map ports from the "base os" to the container.
Issue -
State: closed - Opened by rennich almost 2 years ago
- 2 comments
#112 - pyxis with enroot using local docker registry images failing
Issue -
State: closed - Opened by karanveersingh5623 almost 2 years ago
- 13 comments
#111 - Mutli-GPU fails without adding node/gpu arguments to srun
Issue -
State: closed - Opened by szhengac almost 2 years ago
- 4 comments
#110 - Pyxis not picking up our GPUs
Issue -
State: closed - Opened by slurmuser almost 2 years ago
- 4 comments
#109 - Entrypoint hanging
Issue -
State: closed - Opened by slurmuser almost 2 years ago
- 9 comments
#108 - Questions on ENROOT_CACHE_PATH
Issue -
State: closed - Opened by stephandooper almost 2 years ago
- 2 comments
#107 - Multi-node jobs fail
Issue -
State: closed - Opened by rormseth almost 2 years ago
- 4 comments
#106 - Pyxis randomnly hangs on some imports
Issue -
State: closed - Opened by slurmuser almost 2 years ago
- 1 comment
#105 - New release, please?
Issue -
State: closed - Opened by ltalirz almost 2 years ago
- 2 comments
#104 - srun fails when command is not on host
Issue -
State: closed - Opened by AlienYouth almost 2 years ago
- 9 comments
#103 - Problems running OptiX job through Pyxis, what am I missing?
Issue -
State: closed - Opened by crinavar about 2 years ago
- 6 comments
#102 - Using a rankfile with srun from a container
Issue -
State: closed - Opened by OguzPastirmaci about 2 years ago
- 20 comments
#101 - slurmstepd: error: pyxis: [ERROR] /etc/enroot/hooks.d/98-nvidia.sh exited with return code 1
Issue -
State: closed - Opened by infokng about 2 years ago
- 5 comments
#100 - added clang format settings for project
Pull Request -
State: closed - Opened by fawzi about 2 years ago
- 4 comments
#99 - Fail to srun NCG hpc-benchmarks docker
Issue -
State: closed - Opened by inspurasc about 2 years ago
- 1 comment
#98 - Fail to restart slurmd server after tweaking my PMIx configuration through systemd
Issue -
State: closed - Opened by inspurasc about 2 years ago
- 4 comments
#97 - Fail to make container image writable
Issue -
State: closed - Opened by xinyx62 about 2 years ago
- 1 comment
#96 - srun container (system PATH) issue?
Issue -
State: closed - Opened by xihajun over 2 years ago
- 2 comments
#95 - pyxis plug-in components do not effective
Issue -
State: closed - Opened by xinyx62 over 2 years ago
- 5 comments
#94 - numactl error enroot/pyxis running nvidia hpc-benchmark
Issue -
State: closed - Opened by xinyx62 over 2 years ago
- 7 comments
#93 - Can't srun with pyxis
Issue -
State: closed - Opened by aboseria over 2 years ago
- 10 comments
#92 - Installation issue
Issue -
State: closed - Opened by BDHU over 2 years ago
- 5 comments
#91 - ENROOT_DATA_PATH ignored
Issue -
State: closed - Opened by matyro over 2 years ago
- 9 comments
#90 - Enforcing CPU limits on containers
Issue -
State: closed - Opened by abuettner93 over 2 years ago
- 2 comments
#89 - document how to use enroot `.credentials`
Issue -
State: closed - Opened by ltalirz over 2 years ago
- 2 comments
#88 - Mpix error enroot/pyxis multinode nvidia hpc-benchmark
Issue -
State: closed - Opened by Concluant over 2 years ago
- 2 comments
#87 - Pyxis failing on bright cluster manager , failed to create opaque ovlfs whiteout
Issue -
State: closed - Opened by karanveersingh5623 over 2 years ago
- 2 comments
#86 - specify the user in Docker container in --container-image while giving path
Issue -
State: closed - Opened by praveen5733 over 2 years ago
- 1 comment
#85 - The sshd service can not work correctly in the container
Issue -
State: closed - Opened by kuangllbnu over 2 years ago
- 4 comments
#84 - pyxis option "container-image" not found through slurm
Issue -
State: closed - Opened by shubhammehta03 over 2 years ago
- 11 comments
#83 - Task prolog script fails
Issue -
State: closed - Opened by staeglis over 2 years ago
- 2 comments
#82 - Dependency issue with nvcr.io/nvidia/tensorflow:22.05-tf1-py3
Issue -
State: closed - Opened by karanveersingh5623 over 2 years ago
- 1 comment
#81 - Cannot run nccl tests inside a container
Issue -
State: closed - Opened by rvencu over 2 years ago
- 11 comments
#80 - Not attaching to running container when using --container-name
Issue -
State: closed - Opened by hendraet over 2 years ago
- 5 comments
#79 - ParallelCluster: No error but test suite returns host OS instead of container OS
Issue -
State: closed - Opened by rvencu over 2 years ago
- 1 comment
#78 - slurmstepd: error: pyxis: [ERROR] URL https://registry-1.docker.io/v2/library/???
Issue -
State: closed - Opened by jieguolove over 2 years ago
- 6 comments
#77 - RPM can no longer be build on RH/CentOS 7
Issue -
State: closed - Opened by martialblog almost 3 years ago
- 3 comments
#77 - RPM can no longer be build on RH/CentOS 7
Issue -
State: closed - Opened by martialblog almost 3 years ago
- 3 comments
#76 - srun exits while container export is still running
Issue -
State: closed - Opened by jfolz almost 3 years ago
- 4 comments
#75 - SLURM creating container not responding
Issue -
State: closed - Opened by aviaisr almost 3 years ago
- 1 comment
#74 - `nvslurm-plugin-pyxis` packages do not readily work with hardened enroot
Issue -
State: closed - Opened by AlexTMjugador almost 3 years ago
- 4 comments
#74 - `nvslurm-plugin-pyxis` packages do not readily work with hardened enroot
Issue -
State: closed - Opened by AlexTMjugador almost 3 years ago
- 4 comments
#73 - SLURM gpus-per-task issue
Issue -
State: open - Opened by itzsimpl almost 3 years ago
- 5 comments
#72 - Will containers launched by pyxis inherit the cgroup limitation in slurm?
Issue -
State: closed - Opened by SolenoidWGT almost 3 years ago
- 2 comments
#71 - ERROR while pulling images
Issue -
State: closed - Opened by SolenoidWGT about 3 years ago
- 2 comments
#70 - enroot import container_image
Issue -
State: closed - Opened by cheyunfei about 3 years ago
- 4 comments
#69 - ParallelCluster
Issue -
State: closed - Opened by maziarraissi about 3 years ago
- 5 comments
#68 - Unable to start container image from local registry using sbatch command
Issue -
State: closed - Opened by gavillom about 3 years ago
- 3 comments
#67 - 401 Unauthorized Error at Fetching image manifest when running `srun` with local sqsh image
Issue -
State: closed - Opened by crinavar about 3 years ago
- 6 comments
#66 - --container-name command line argument wont save or load container on host
Issue -
State: closed - Opened by arnoldas500 over 3 years ago
- 7 comments
#65 - --container-writable is unrecognized option for srun
Issue -
State: closed - Opened by arnoldas500 over 3 years ago
- 3 comments
#64 - Should `--no-container-remap-root` be equivalent to `ENROOT_REMAP_ROOT no` ?
Issue -
State: closed - Opened by vfdev-5 over 3 years ago
- 2 comments
#62 - Error while trying to run Pyxies on AWS Parallel Cluster
Issue -
State: closed - Opened by MrA2K2 over 3 years ago
- 4 comments
#61 - When running srun job with pyxis, I get failed to execute: /bin/bash
Issue -
State: closed - Opened by JonShelley over 3 years ago
- 2 comments
#60 - tf.distribute.MirroredStrategy fails when run with pyxis + slurm: causes NCCL error #51760
Issue -
State: closed - Opened by andrew-johnson-melb over 3 years ago
- 8 comments