Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ROCm/rccl-tests issues and pull requests

#102 - [GIT] Add CODEOWNERS and PR Template

Pull Request - State: closed - Opened by nileshnegi 10 days ago
Labels: ci:docs-only

#101 - Remove precheckin steps from staticanalysis

Pull Request - State: closed - Opened by samjwu 12 days ago

#100 - hot fixing ncclMemFree for mscclpp

Pull Request - State: closed - Opened by AtlantaPepsi 13 days ago - 1 comment

#99 - [CI] Clone rccl and build from tip of develop

Pull Request - State: closed - Opened by samjwu 14 days ago

#98 - Capturing stderr in the AllReduceSingleProcess test.

Pull Request - State: closed - Opened by corey-derochie-amd 15 days ago

#97 - removing FP8 product from allReduce test cases

Pull Request - State: closed - Opened by mberenjk about 1 month ago

#96 - [Issue]: Some questions focus on the transports in RCCL, including P2P, SHM, and NET

Issue - State: open - Opened by Kyrienn about 1 month ago - 10 comments
Labels: Under Investigation

#95 - Updating to use hipDeviceMallocUncached

Pull Request - State: closed - Opened by saurabhAMD 2 months ago

#94 - Memset to fix inflated performance when GPU is reset

Pull Request - State: closed - Opened by mustafabar 2 months ago

#93 - Add option to output results to a file

Pull Request - State: closed - Opened by dsidler 3 months ago - 2 comments

#92 - Use find_package for MPI

Pull Request - State: closed - Opened by dsidler 3 months ago - 1 comment

#91 - Update Alltoallv test

Pull Request - State: open - Opened by wenkaidu 3 months ago

#90 - [Issue]: 2GPU per node hangs with rccl-tests's alltoall_perf and aws-ofi-rccl plugin

Issue - State: open - Opened by tmh97 4 months ago - 3 comments
Labels: Under Investigation

#89 - Deprecate alltoallv test

Pull Request - State: open - Opened by rahulvaidya20 5 months ago - 1 comment

#89 - Deprecate alltoallv test

Pull Request - State: open - Opened by rahulvaidya20 5 months ago - 1 comment

#88 - Remove CI precheckin script

Pull Request - State: open - Opened by samjwu 5 months ago - 1 comment
Labels: ci:testonly

#88 - Remove CI precheckin script

Pull Request - State: open - Opened by samjwu 5 months ago
Labels: ci:testonly

#87 - [Issue]: 'mpi.h' file not found during rccl-tests build

Issue - State: closed - Opened by jzhang82119 5 months ago - 2 comments
Labels: Under Investigation

#86 - Registered Buffer option from nccl-tests merged

Pull Request - State: closed - Opened by AtlantaPepsi 6 months ago - 3 comments

#85 - [CI] Add static analysis CI

Pull Request - State: closed - Opened by nileshnegi 6 months ago

#85 - [CI] Add static analysis CI

Pull Request - State: closed - Opened by nileshnegi 6 months ago

#84 - Clarification on the HSA_FORCE_FINE_GRAIN_PCIE requirement

Issue - State: closed - Opened by tmh97 7 months ago - 2 comments

#84 - Clarification on the HSA_FORCE_FINE_GRAIN_PCIE requirement

Issue - State: closed - Opened by tmh97 7 months ago - 2 comments

#83 - Fix --root all issue

Pull Request - State: closed - Opened by rahulvaidya20 7 months ago

#82 - Fixing make clean

Pull Request - State: open - Opened by AtlantaPepsi 7 months ago

#82 - Fixing make clean

Pull Request - State: open - Opened by AtlantaPepsi 7 months ago

#81 - Scaling tests to #ngpus

Pull Request - State: closed - Opened by AtlantaPepsi 7 months ago

#81 - Scaling tests to #ngpus

Pull Request - State: closed - Opened by AtlantaPepsi 7 months ago

#80 - Updating to using hipDeviceMallocUncached

Pull Request - State: closed - Opened by gilbertlee-amd 8 months ago - 2 comments

#79 - Rotating tensor -R (default:off)

Pull Request - State: closed - Opened by saurabhAMD 8 months ago - 1 comment

#78 - Rotating Tensor

Pull Request - State: closed - Opened by saurabhAMD 8 months ago - 1 comment

#77 - Rotating Tensor

Pull Request - State: closed - Opened by saurabhAMD 8 months ago

#74 - Fix incorrect device ordinal with limited device visibility

Pull Request - State: closed - Opened by wenkaidu 9 months ago

#73 - [DOCS] Update README for performance-oriented runs

Pull Request - State: closed - Opened by nileshnegi 9 months ago

#72 - All_reduce_perf segfaults with Custom Built RCCL

Issue - State: closed - Opened by tks2004 9 months ago - 3 comments
Labels: Under Investigation

#71 - Amend use of CUSTOM_RCCL_LIB to avoid build error

Pull Request - State: closed - Opened by nileshnegi 10 months ago

#70 - replacing rccl_bfloat16 with hip_bfloat16

Pull Request - State: closed - Opened by mberenjk 10 months ago

#69 - adding git version to rccl-tests

Pull Request - State: closed - Opened by mberenjk 10 months ago
Labels: gfx942

#68 - Revert "adding git version to rccl-test"

Pull Request - State: closed - Opened by akolliasAMD 10 months ago

#67 - printing the version for one rank only

Pull Request - State: closed - Opened by mberenjk 10 months ago
Labels: gfx942

#66 - adding git version to rccl-test

Pull Request - State: closed - Opened by mberenjk 10 months ago

#65 - update the fp8 header file name

Pull Request - State: closed - Opened by Andyli1007 11 months ago

#64 - Revert __nv_bfloat16 back to hip_bfloat16

Pull Request - State: closed - Opened by BertanDogancay 11 months ago

#63 - Enable fp8 support

Pull Request - State: closed - Opened by Andyli1007 11 months ago

#62 - Add hipify steps prior to build

Pull Request - State: closed - Opened by BertanDogancay 11 months ago

#61 - Nccl tests sync

Pull Request - State: closed - Opened by wenkaidu 11 months ago

#60 - Enable fp8 support

Pull Request - State: closed - Opened by Andyli1007 11 months ago

#59 - Fix typo in rank assignment

Pull Request - State: closed - Opened by wenkaidu 11 months ago

#58 - Develop branch merged into master

Pull Request - State: closed - Opened by akolliasAMD 12 months ago - 1 comment

#57 - Add option to disable out-of-place runs

Pull Request - State: closed - Opened by nusislam about 1 year ago

#56 - rccl-test get stuck on gfx1100

Issue - State: open - Opened by Frozenmad about 1 year ago - 5 comments
Labels: Under Investigation

#55 - Update default GPUs and build for AMDGPU_TARGETS

Pull Request - State: closed - Opened by nileshnegi about 1 year ago

#54 - Offload arch linking

Pull Request - State: closed - Opened by lawruble13 about 1 year ago

#52 - Fixing hipcc location for develop CI

Pull Request - State: closed - Opened by BertanDogancay over 1 year ago

#51 - Warm up both out-of-place and in-place collectives

Pull Request - State: closed - Opened by wenkaidu over 1 year ago

#50 - Test NCCL failure common.cu:1285 : internal error

Issue - State: closed - Opened by Eliasj42 over 1 year ago - 1 comment
Labels: Under Investigation

#49 - BUILD: Modified HIPCC path in src/Makefile

Pull Request - State: closed - Opened by nileshnegi over 1 year ago

#48 - Fixing hipcc location for CI (#47)

Pull Request - State: closed - Opened by gilbertlee-amd over 1 year ago

#47 - Fixing hipcc location for CI

Pull Request - State: closed - Opened by gilbertlee-amd over 1 year ago

#46 - Update Makefile - HIPCC Path Updated to latest

Pull Request - State: closed - Opened by arvindcheru over 1 year ago

#45 - hipcc path update

Pull Request - State: closed - Opened by arvindcheru over 1 year ago

#44 - Topic/master cmake sync

Pull Request - State: closed - Opened by edgargabriel over 1 year ago

#43 - search SLES install paths for MPI

Pull Request - State: closed - Opened by edgargabriel over 1 year ago

#42 - Multi-GPU Support with External Pinning

Issue - State: closed - Opened by frobnitzem over 1 year ago - 1 comment
Labels: Under Investigation

#41 - Remove hardcoded number of GPUs limit for alltoallv

Pull Request - State: closed - Opened by wenkaidu over 1 year ago

#40 - Fix merge error

Pull Request - State: closed - Opened by wenkaidu over 1 year ago - 4 comments

#39 - Merge with latest nccl-tests

Pull Request - State: closed - Opened by wenkaidu over 1 year ago

#38 - Merge master branch into develop

Pull Request - State: closed - Opened by wenkaidu over 1 year ago

#35 - fix stack-buffer-overflow reported by address sanitizer

Pull Request - State: open - Opened by jeffdaily almost 2 years ago
Labels: noCI

#34 - fix stack-buffer-overflow reported by address sanitizer

Pull Request - State: open - Opened by jeffdaily almost 2 years ago - 2 comments
Labels: noCI

#33 - fixing the error message for mpirun

Pull Request - State: closed - Opened by PedramAlizadeh almost 2 years ago

#32 - revamp cmake MPI detection

Pull Request - State: closed - Opened by edgargabriel almost 2 years ago - 1 comment

#31 - Adding -pthread flag for linking issues

Pull Request - State: closed - Opened by PedramAlizadeh almost 2 years ago

#30 - Adding -pthread flag for linking issues into src/Makefile

Pull Request - State: closed - Opened by PedramAlizadeh almost 2 years ago

#29 - Adding -pthread flag for linking issues into src/Makefile

Pull Request - State: closed - Opened by PedramAlizadeh almost 2 years ago

#28 - auto-detect and enable MPI

Pull Request - State: closed - Opened by edgargabriel almost 2 years ago

#27 - fix algorithm assigning values in testsuite

Pull Request - State: closed - Opened by edgargabriel about 2 years ago

#26 - Adding the script to build and run the rccl-tests for PTS

Pull Request - State: open - Opened by PedramAlizadeh about 2 years ago
Labels: noCI

#25 - added std::max to avoid buffer overflow

Pull Request - State: closed - Opened by akolliasAMD about 2 years ago

#24 - make cmake stage also pass in CI

Pull Request - State: closed - Opened by edgargabriel about 2 years ago - 1 comment

#23 - add the rccl/lib directory to the link path

Pull Request - State: closed - Opened by edgargabriel about 2 years ago

#22 - fix a messing endif statement

Pull Request - State: closed - Opened by edgargabriel about 2 years ago

#21 - Topic/v2.13.4 sync

Pull Request - State: closed - Opened by edgargabriel over 2 years ago

#20 - Allow more precise measurements of single operation

Pull Request - State: closed - Opened by wenkaidu over 2 years ago

#19 - removed hypercube from Makefile

Pull Request - State: closed - Opened by akolliasAMD over 2 years ago

#18 - Enabling hipGraph codepath for future support

Pull Request - State: closed - Opened by gilbertlee-amd over 2 years ago

#17 - Fix missing error checking for AllocateBuffs due to merge

Pull Request - State: closed - Opened by wenkaidu over 2 years ago

#16 - Test HIP failure common.cu:1129 'hipErrorInvalidDevice' while running test

Issue - State: closed - Opened by sheetalarkadam over 2 years ago - 1 comment

#15 - Add CMake files to build & package

Pull Request - State: closed - Opened by lawruble13 over 2 years ago

#14 - Allow gpu config override

Pull Request - State: closed - Opened by eidenyoshida over 2 years ago

#13 - Build rccl-tests for all supported GPUs

Pull Request - State: closed - Opened by wenkaidu over 2 years ago

#12 - updated alltoallV test to not have any zero values

Pull Request - State: closed - Opened by akolliasAMD over 2 years ago

#11 - update pytest before running CI

Pull Request - State: closed - Opened by edgargabriel over 2 years ago

#10 - Multi rank support

Pull Request - State: closed - Opened by edgargabriel over 2 years ago