An open API service for providing issue and pull request metadata for open source projects.

GitHub / pytorch/pytorch issues and pull requests

Labelled with: module: cuda

#165548 - [CUDA][cuBLASLt] addmm -- enable 2D bias in the Lt path

Pull Request - State: open - Opened by nikitaved about 1 month ago - 1 comment
Labels: module: cuda, open source, ciflow/trunk, topic: not user facing, matrix multiplication, ciflow/rocm, ciflow/rocm-mi300, ciflow/h100, ciflow/h100-symm-mem, ciflow/b200

#164586 - [CUDA][Muon] bump tolerances for Muon test

Pull Request - State: open - Opened by eqy about 2 months ago
Labels: module: optimizer, module: cuda, module: tests, open source, topic: not user facing, matrix multiplication

#164563 - [Blackwell][Inductor] Numerical mismatches in test_max_autotune.py::TestMaxAutotune::test_non_contiguous_input_mm_plus_mm

Issue - State: open - Opened by Aidyn-A about 2 months ago
Labels: module: cuda, module: inductor, Blackwell

#164480 - [CUDA][Inductor][B200] re-bump tolerances for `test_baddmm` in `test_max_autotune.py`

Pull Request - State: open - Opened by eqy about 2 months ago
Labels: module: cuda, open source, topic: not user facing, Blackwell, ciflow/b200

#164354 - Remove workaround to old CUDA bug

Pull Request - State: open - Opened by pearu about 2 months ago - 4 comments
Labels: module: cuda, module: cpu, open source, release notes: cpp, topic: not user facing

#164049 - [CUDA] fix indexing on large tensor causing nvalid configuration argument

Pull Request - State: closed - Opened by Isalia20 about 2 months ago - 5 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, release notes: cuda, topic: bug fixes, merging

#164048 - [CUDA] indexing on large tensor causes invalid configuration argument

Issue - State: closed - Opened by Isalia20 about 2 months ago
Labels: module: cuda

#163664 - [BE] Add Linux aarch64 CUDA install and test to validation framework

Issue - State: closed - Opened by atalman about 2 months ago
Labels: module: binaries, module: cuda, triaged, better-engineering, topic: binaries

#163658 - [CI] Downgrading CUDA driver results: No devices were found

Issue - State: closed - Opened by atalman about 2 months ago - 1 comment
Labels: module: cuda, module: ci, triaged, module: third_party, has workaround

#163581 - [cuDNN][Convolution] Disable cuDNN for 3D convolutions with kernel size != 1 for cuDNN 9.8+

Pull Request - State: closed - Opened by eqy about 2 months ago - 7 comments
Labels: module: cudnn, module: cuda, module: cpu, module: convolution, triaged, open source, ciflow/trunk, topic: bug fixes, release notes: cudnn, module: inductor, ciflow/inductor, merging

#163342 - [CD] - Manywheel CUDA builds failing since Sept 18

Issue - State: closed - Opened by robert-hardwick 2 months ago - 5 comments
Labels: high priority, triage review, module: binaries, module: cuda, triaged, module: regression

#163299 - [CUDA] Cleanup persistent cuBLASLt workspaces before compile-regions test

Pull Request - State: closed - Opened by eqy 2 months ago - 4 comments
Labels: module: cuda, module: tests, triaged, module: cublas, open source, Merged, ciflow/trunk, topic: not user facing, module: CUDACachingAllocator, matrix multiplication, merging

#163070 - [TEST][CUDA] Use proper dtype in test_cuda_tensor_pow_scalar_tensor_cuda

Pull Request - State: closed - Opened by Aidyn-A 2 months ago - 3 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, merging

#163001 - [CUDA13][CUDA Graphs] Fix `test_graph_external_wait_and_record` to account for minimum supported compute-capability

Pull Request - State: open - Opened by eqy 2 months ago
Labels: module: cuda, open source, module: cuda graphs, topic: not user facing

#162995 - [CUDA] fix shared memory race in `reduce_kernel`

Pull Request - State: open - Opened by eqy 2 months ago
Labels: module: cuda, open source, module: reductions, topic: not user facing

#162626 - compile_kernel: Handle python floats as c double

Pull Request - State: closed - Opened by msaroufim 2 months ago - 17 comments
Labels: module: cuda, ciflow/trunk, release notes: cuda, topic: bug fixes, merging

#162578 - [BUG] Fix nonzero_static crash on CUDA when the input is a empty tensor

Pull Request - State: closed - Opened by can-gaa-hou 2 months ago - 10 comments
Labels: module: cuda, triaged, open source, Merged, ciflow/trunk, release notes: cuda, topic: bug fixes, merging

#162544 - Build fbgemm_gpu for TORCH_CUDA_ARCH_LIST=10.0 and CUDA 12.8 and 12.9

Pull Request - State: closed - Opened by danielvegamyhre 2 months ago - 4 comments
Labels: module: cuda, Merged, ciflow/trunk, topic: not user facing, merging

#162333 - [CD] Windows CUDA 13.0 binaries : Windows fatal exception: access violation

Issue - State: closed - Opened by atalman 2 months ago - 9 comments
Labels: high priority, module: binaries, module: windows, module: cuda, oncall: releng, triaged

#162322 - [CUDA 13][cuDNN][Windows] Roll back cuDNN upgrade from 9.13 to 9.12 on Windows

Pull Request - State: closed - Opened by eqy 2 months ago - 3 comments
Labels: module: windows, module: cudnn, module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, merging

#162203 - Update round size with 1 division behavior

Pull Request - State: open - Opened by morrison-turnansky 3 months ago - 5 comments
Labels: oncall: distributed, module: cuda, module: cpu, triaged, open source, NNC, release notes: quantization, topic: not user facing, module: inductor, module: dynamo, release notes: distributed (checkpoint), module: compiled autograd, release notes: inductor (aoti)

#162186 - [CUDA][CUDAGraph] Reduce capture overhead in CUDA Graph memory reuse

Pull Request - State: closed - Opened by eee4017 3 months ago - 11 comments
Labels: module: cuda, open source, Merged, module: cuda graphs, ciflow/trunk, topic: not user facing, merging, ciflow/rocm

#162185 - [B200][NVFP4] Fix argument passing in `test_blockwise_mxfp8_nvfp4_mxfp4_numerics_`

Issue - State: closed - Opened by eqy 3 months ago - 3 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, merging, module: floatx (formerly float8), ciflow/h100

#162180 - [B200][MXFP8] Fix regex in `test_blockwise_mxfp8_nvfp4_error_messages_recipe_mxfp8_cuda`

Pull Request - State: closed - Opened by eqy 3 months ago - 3 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, merging, module: floatx (formerly float8)

#162073 - [cuDNN][SDPA] Enable cuDNN SDPA by default for SM 9.0, SM 10.0

Pull Request - State: open - Opened by eqy 3 months ago
Labels: module: cudnn, module: cuda, open source, topic: not user facing, module: sdpa

#161884 - IPC in the ExpandableSegment can not correctly match the handle from the map ipcMemHandle_to_devptr.

Issue - State: closed - Opened by mengph 3 months ago - 1 comment
Labels: module: cuda, triaged

#161822 - [CUDA][cuBLAS] #125888 introduces measurable CPU overhead for matmuls

Issue - State: closed - Opened by eqy 3 months ago - 4 comments
Labels: module: performance, module: cuda, triaged, module: python frontend

#161789 - "cudaErrorIllegalAddress" in unique_consecutive (with return_inverse and RMM)

Issue - State: closed - Opened by nboeschen 3 months ago - 1 comment
Labels: module: crash, module: cuda, triaged

#161749 - [cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks

Pull Request - State: closed - Opened by eqy 3 months ago - 8 comments
Labels: module: cuda, triaged, module: cublas, module: determinism, open source, Merged, ciflow/trunk, release notes: cuda, merging, ciflow/h100

#161649 - Use vectorized stores for all dtypes in cat

Pull Request - State: closed - Opened by ngimel 3 months ago - 31 comments
Labels: module: cuda, Merged, Reverted, ciflow/trunk, release notes: cuda, ci-no-td

#161581 - DISABLED test_autocast_ignored_types (__main__.TestCudaAutocast)

Issue - State: open - Opened by pytorch-bot[bot] 3 months ago - 3 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#161481 - About torch.UntypedStorage._new_shared_cuda

Issue - State: closed - Opened by Schnabel-8 3 months ago - 1 comment
Labels: module: multiprocessing, module: cuda, triaged

#161434 - [cuDNN][SDPA][Nested Tensor] add forward/backward caching support for cuDNN SDPA Nested tensor/varlen

Pull Request - State: open - Opened by eqy 3 months ago
Labels: module: cudnn, module: cuda, open source, module: nestedtensor, topic: not user facing, module: sdpa

#161399 - [ATen][CUDA] Add family conditional for CUTLASS matmuls

Pull Request - State: closed - Opened by Aidyn-A 3 months ago - 3 comments
Labels: module: cuda, triaged, open source, release notes: cuda, matrix multiplication, module: floatx (formerly float8), module: core aten

#161380 - CUDA 13 -- sm_120 -- Nvidia 5090 -- ptxas warning : Value of threads …

Pull Request - State: closed - Opened by DrStone71 3 months ago - 15 comments
Labels: module: cuda, triaged, open source, Merged, ciflow/trunk, topic: bug fixes, topic: not user facing, merging

#161305 - [cuBLASLt][FP8] `cuBLASLt` appears to support float8 rowwise-scaling on H100

Pull Request - State: open - Opened by eqy 3 months ago - 13 comments
Labels: module: cuda, module: cublas, open source, Merged, Reverted, ciflow/trunk, topic: not user facing, module: floatx (formerly float8), ci-no-td, ciflow/rocm-mi300

#161177 - [cuDNN][convolution] remove redundant conv3d 64bit test

Pull Request - State: closed - Opened by eqy 3 months ago - 15 comments
Labels: module: cudnn, module: cuda, open source, Merged, module: tf32, ciflow/trunk, topic: not user facing, merging

#161139 - `roundup_power2_divisions` ignores the value of 1

Issue - State: closed - Opened by nick-griaznov 3 months ago
Labels: module: cuda, triaged, module: CUDACachingAllocator

#161107 - [CUDA] Don't import hipify modules on CUDA

Pull Request - State: open - Opened by eqy 3 months ago
Labels: module: cuda, open source, topic: not user facing

#161046 - torch.ones(1, device=torch.cuda.current_device()) CUDA error: operation not supported

Issue - State: closed - Opened by EvilCalf 3 months ago - 2 comments
Labels: needs reproduction, module: cuda, triaged

#160992 - fix-unpin-memory-tensor-param

Pull Request - State: closed - Opened by ghostspiders 3 months ago - 9 comments
Labels: oncall: distributed, module: cuda, triaged, open source, Merged, ciflow/trunk, release notes: distributed (checkpoint), merging

#160960 - compile error: 'nvmlDeviceGetGpuFabricInfoV' was not declared in this scope

Issue - State: closed - Opened by can-gaa-hou 3 months ago - 1 comment
Labels: module: build, module: cuda, triaged, actionable

#160719 - DISABLED test_graph_partition_cpu_scalar_multiple (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 3 months ago - 8 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#160693 - [FP8][cuBLAS][SM100] cuBLAS doesn't support rowwise-scaling on `sm100`

Pull Request - State: open - Opened by eqy 3 months ago
Labels: module: cuda, module: cublas, open source, topic: not user facing, matrix multiplication, module: floatx (formerly float8)

#160598 - DISABLED test_autocast_custom_enabled (__main__.TestCudaAutocast)

Issue - State: closed - Opened by pytorch-bot[bot] 3 months ago - 6 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#160554 - [ATen][CUDA] Use new CCCL API in v2.8

Pull Request - State: closed - Opened by Aidyn-A 3 months ago - 3 comments
Labels: module: cuda, open source, ciflow/trunk, release notes: cuda, topic: not user facing, merging, module: core aten

#160551 - DISABLED test_autocast_custom_cast_inputs (__main__.TestCudaAutocast)

Issue - State: closed - Opened by pytorch-bot[bot] 3 months ago - 6 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#160507 - DISABLED test_autocast_checkpointing (__main__.TestCudaAutocast)

Issue - State: closed - Opened by pytorch-bot[bot] 3 months ago - 9 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#160385 - TTGIR error for FlexAttention on B200

Issue - State: closed - Opened by drisspg 3 months ago - 2 comments
Labels: module: cuda, triaged, oncall: pt2, upstream triton, module: higher order operators, module: pt2-dispatcher, module: flex attention, Blackwell

#160192 - Flex Attention heuristics: a Blackwell config

Pull Request - State: open - Opened by Aidyn-A 3 months ago
Labels: module: cuda, topic: not user facing, module: inductor, module: flex attention

#159939 - [Test][Easy] Use float16 dtype in test_sort_large

Pull Request - State: closed - Opened by Aidyn-A 4 months ago - 7 comments
Labels: module: cuda, open source, ciflow/trunk, topic: not user facing, merging

#159892 - [RFC] CUDAPluggableAllocator receives malloc request of size zero.

Issue - State: closed - Opened by siyuanchai1999 4 months ago - 4 comments
Labels: module: cuda, triaged

#159802 - DISABLED test_pin_memory_no_cuda (__main__.TestDictDataLoader)

Issue - State: closed - Opened by izaitsevfb 4 months ago - 2 comments
Labels: module: dataloader, module: cuda, triaged, skipped

#159689 - [CUDA] Skip pynvml test on platforms that don't have complete support

Pull Request - State: open - Opened by eqy 4 months ago
Labels: module: cuda, open source, topic: not user facing

#159672 - [CUDA] Add some more missing `@serialTest` decorators

Pull Request - State: closed - Opened by eqy 4 months ago - 6 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, merging

#159663 - DISABLED test_mempool_empty_cache_inactive (__main__.TestMemPool)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago - 2 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159466 - [ATen][CUDA][cuFFT] Guard against deprecated error codes

Pull Request - State: closed - Opened by Aidyn-A 4 months ago - 3 comments
Labels: module: cuda, triaged, open source, module: fft, Merged, ciflow/trunk, topic: not user facing, merging, module: core aten

#159446 - not support nvidia 5060

Issue - State: closed - Opened by tuqizhao 4 months ago - 9 comments
Labels: module: windows, module: cuda, triaged

#159309 - torch.nn.InstanceNorm2d CPU/GPU Inconsistency

Issue - State: closed - Opened by ChaitanyaRS06 4 months ago - 1 comment
Labels: module: cuda

#159305 - [CUDA][CUDA Graphs] Move cuda graphs test to subprocess to avoid polluting mempool tests

Pull Request - State: open - Opened by eqy 4 months ago
Labels: module: cuda, open source, module: cuda graphs, topic: not user facing

#159286 - DISABLED test_graph_memory_stats_and_use_result_after_destroy_graph (__main__.TestCuda)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159285 - DISABLED test_cuda_kernel_loop_overflow_large (__main__.TestCuda)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159284 - DISABLED test_graph_memory_stats_and_use_result_after_destroy_graph (__main__.TestCuda)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 1 comment
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159283 - DISABLED test_cuda_kernel_loop_overflow_large (__main__.TestCuda)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159271 - [CUDA] Add `serialTest` decorator to `largeTensorTest` in `test_cuda.py`

Pull Request - State: closed - Opened by eqy 4 months ago - 9 comments
Labels: module: cuda, triaged, module: 64-bit, open source, Merged, ciflow/trunk, topic: not user facing, merging

#159265 - Unpin dependency version when possible

Issue - State: open - Opened by malfet 4 months ago
Labels: module: binaries, module: cuda

#159113 - DISABLED test_graph_two_successive (__main__.TestCuda)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 2 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159104 - [CUDA] Add experimental green context support for SM carveout

Pull Request - State: closed - Opened by eqy 4 months ago - 17 comments
Labels: module: cuda, triaged, open source, Merged, Reverted, ciflow/trunk, topic: not user facing, matrix multiplication, merging, ci-no-td, lint-all-files

#159068 - DISABLED test_cuda_kernel_loop_overflow (__main__.TestCuda)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 1 comment
Labels: module: cuda, triaged, module: flaky-tests, skipped

#159038 - DISABLED test_cuda_memory_leak_detection_propagates_errors (__main__.TestCuda)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 1 comment
Labels: module: cuda, triaged, module: flaky-tests, skipped

#158994 - [CUDA] Fix missing `__syncthreads` in MultiMarginLoss backward

Pull Request - State: closed - Opened by eqy 4 months ago - 3 comments
Labels: module: loss, module: cuda, open source, Merged, ciflow/trunk, release notes: nn, topic: bug fixes, merging

#158981 - [conv][cuDNN][64-bit indexing] reduce memory usage of depthwise conv 64-bit indexing test

Pull Request - State: closed - Opened by eqy 4 months ago - 4 comments
Labels: module: cudnn, module: cuda, module: convolution, triaged, module: 64-bit, open source, Merged, ciflow/trunk, topic: not user facing, merging

#158921 - Is it necessary to add a __syncthreads in the MultiMarginLoss_backward_kernel of file aten/src/ATen/native/cuda/MultiMarginLoss.cu

Issue - State: closed - Opened by zzppq 4 months ago - 1 comment
Labels: module: cuda, triaged, module: correctness (silent)

#158764 - DISABLED test_allocate_in_thread_to_pool (__main__.TestBlockStateAbsorption)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago - 10 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#158494 - [B200] Fix flex-attention heuristic for `test_tma_with_customer_kernel_options_cuda`

Pull Request - State: closed - Opened by eqy 4 months ago - 3 comments
Labels: module: cuda, open source, Merged, ciflow/trunk, topic: not user facing, module: inductor, ciflow/inductor, merging, module: flex attention

#158484 - [Fix] Rework CUDA error explanation framework to be less destructive …

Pull Request - State: closed - Opened by Raymo111 4 months ago - 6 comments
Labels: module: cuda, ciflow/trunk, topic: not user facing, merging

#158395 - Add framework for explanations for common CUDA errors

Pull Request - State: closed - Opened by Raymo111 4 months ago - 3 comments
Labels: module: cuda, Merged, ciflow/trunk, release notes: cuda, merging

#158334 - DISABLED test_live_outputs_multiple_graphs (__main__.CudaGraphTreeTests)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#158301 - Add warning about removed sm50 and sm60 arches

Pull Request - State: open - Opened by atalman 4 months ago - 1 comment
Labels: module: cuda, topic: not user facing

#158172 - `torch.fmin` has inconsistent overflow behavior on CPU and GPU

Issue - State: open - Opened by jiren-the-gray 4 months ago - 1 comment
Labels: module: cuda, triaged, module: edge cases

#158122 - [RFC] A Distributed CUDA Unified Memory Backend for PyTorch

Issue - State: open - Opened by matthewdcong 4 months ago
Labels: oncall: distributed, module: cuda, triaged, module: PrivateUse1

#158037 - Support DeepSeek-style blockwise scaling scaled-mm for fp8 on Hopper+

Pull Request - State: closed - Opened by lw 4 months ago - 23 comments
Labels: module: cuda, Merged, Reverted, ciflow/trunk, topic: not user facing, merging, module: floatx (formerly float8), ciflow/rocm, ci-no-td

#157999 - [CUDA] Support family-conditional compute capabilies in `TORCH_CUDA_ARCH_LIST`

Pull Request - State: closed - Opened by eqy 4 months ago - 3 comments
Labels: module: build, module: cuda, open source, ciflow/trunk, topic: not user facing, topic: build, merging

#157901 - DISABLED test_graph_partition_reorder_custom_op_with_no_dependency1 (__main__.CudaGraphTreeTests)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago - 3 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#157791 - [BE]: Reduce binary size 40% using aggressive fatbin compression.

Pull Request - State: closed - Opened by Skylion007 4 months ago - 35 comments
Labels: module: cuda, triaged, open source, better-engineering, Merged, Reverted, Stale, ciflow/trunk, release notes: cuda, release notes: releng, release notes: build, ciflow/binaries_wheel, ciflow/binaries_libtorch, ci-no-td

#157791 - [BE]: Reduce binary size 40% using aggressive fatbin compression.

Pull Request - State: open - Opened by Skylion007 4 months ago - 32 comments
Labels: module: cuda, open source, better-engineering, Merged, Reverted, ciflow/trunk, release notes: cuda, release notes: releng, release notes: build, ciflow/binaries_wheel, ciflow/binaries_libtorch, ci-no-td

#157761 - DISABLED test_graph_partition_reorder_cpu_and_gpu (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 6 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157725 - DISABLED test_resnet (__main__.TestBlockStateAbsorption)

Issue - State: open - Opened by pytorch-bot[bot] 4 months ago - 9 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157723 - DISABLED test_graph_partition_forward_with_skipped_cudagraphed_backward (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 4 months ago - 8 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157711 - [Easy][Profiler] Fix pattern matcher of profiler

Pull Request - State: closed - Opened by Aidyn-A 5 months ago - 14 comments
Labels: module: cuda, triaged, open source, oncall: profiler, Merged, ciflow/trunk, topic: not user facing, merging

#157668 - NCCL error caused due to use of NVLS in torch 2.7.1-cu128 on aarch64 gb200 cluster

Issue - State: open - Opened by mihirp1998 5 months ago
Labels: triage review, module: cuda, module: nccl, module: arm

#157643 - DISABLED test_graph_partition_forward_backward_not_called (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 5 months ago - 6 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157616 - DISABLED test_graph_partition_forward_backward (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 5 months ago - 6 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157586 - [CUDA][NVTX] use `pytorch` nvtx domain for pytorch ranges

Pull Request - State: closed - Opened by eqy 5 months ago - 3 comments
Labels: module: cuda, triaged, open source, Stale, topic: not user facing

#157535 - Error in Qwen inference: NVML_SUCCESS == r INTERNAL ASSERT FAILED at "/pytorch/c10/cuda/CUDACachingAllocator.cpp

Issue - State: closed - Opened by dineshsoudagar 5 months ago - 3 comments
Labels: high priority, needs reproduction, module: cuda, module: error checking, triaged

#157533 - DISABLED test_graph_partition_custom_op_no_split (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 5 months ago - 4 comments
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#157449 - DISABLED test_graph_partition_custom_op_mutation (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 5 months ago - 10 comments
Labels: module: cuda, triaged, module: flaky-tests, skipped

#157426 - DISABLED test_graph_partition_custom_op_dynamoc_shapes (__main__.CudaGraphTreeTests)

Issue - State: closed - Opened by pytorch-bot[bot] 5 months ago - 1 comment
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped

#157412 - DISABLED test_graph_partition_custom_op (__main__.CudaGraphTreeTests)

Issue - State: open - Opened by pytorch-bot[bot] 5 months ago
Labels: module: cuda, module: rocm, triaged, module: flaky-tests, skipped