Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rocm/composable_kernel issues and pull requests
#1643 - Add generic instances for two stage conv bwd wei
Pull Request -
State: open - Opened by bartekxk 8 days ago
#1642 - MoeSorting fuse set zero
Pull Request -
State: closed - Opened by dummycoderfe 8 days ago
#1641 - Compilation failure with modified tuning parameters on CK Tile GEMM
Issue -
State: open - Opened by zjing14 9 days ago
- 1 comment
#1640 - remove gfx940;gfx941 from default target lists
Pull Request -
State: closed - Opened by illsilin 9 days ago
#1639 - Prevent instantiation of undefined FP8 operators.
Pull Request -
State: closed - Opened by andriy-ca 9 days ago
#1638 - [Issue]: FMHA change of drop_seed_offset to std::variant is breaking builds
Issue -
State: open - Opened by iratebadger 9 days ago
#1637 - [Ck tile] layernorm2d fwd optimize
Pull Request -
State: open - Opened by dummycoderfe 9 days ago
- 1 comment
#1636 - Update ck_a8w8
Pull Request -
State: open - Opened by aska-0096 9 days ago
#1635 - [generate.py] Override blob list if it already exists
Pull Request -
State: closed - Opened by jmmartinez 9 days ago
#1634 - [WIP][CK_TILE] moe
Pull Request -
State: open - Opened by carlushuang 10 days ago
#1633 - Make sure cmake can handle the xnack+/xnack- targets.
Pull Request -
State: closed - Opened by illsilin 10 days ago
#1632 - A prototype of TF32 gemm
Pull Request -
State: open - Opened by zjing14 10 days ago
#1631 - Statically Cast Pointer Offset
Pull Request -
State: closed - Opened by darren-amd 10 days ago
#1630 - Temporary disable part of dynamic op conv instances
Pull Request -
State: closed - Opened by bartekxk 10 days ago
#1629 - [CK_TILE] Allow using default gemm pipeline policy
Pull Request -
State: open - Opened by poyenc 10 days ago
#1628 - [do not review] int4 scale based on jzhang's pre work
Pull Request -
State: open - Opened by mtgu0705 10 days ago
Labels: noCI
#1627 - [DO NOT REVIEW]
Pull Request -
State: open - Opened by mtgu0705 10 days ago
Labels: noCI
#1626 - Linsun/convint8 fwd instances
Pull Request -
State: closed - Opened by linsun12 13 days ago
- 4 comments
#1625 - Linsun/convint8 fwd instances
Pull Request -
State: closed - Opened by linsun12 13 days ago
#1624 - Ck tile/moe sorting
Pull Request -
State: open - Opened by dummycoderfe 13 days ago
#1623 - [CK_TILE] layernorm have more accurate residual
Pull Request -
State: closed - Opened by carlushuang 13 days ago
- 1 comment
#1622 - [CK_TILE] Add small warp gemm
Pull Request -
State: open - Opened by poyenc 14 days ago
#1621 - Reduce build time.
Pull Request -
State: closed - Opened by illsilin 14 days ago
#1620 - [layernorm] hot fix
Pull Request -
State: closed - Opened by carlushuang 14 days ago
#1619 - [CK_TILE] Add operator batched_transpose
Pull Request -
State: open - Opened by fangche123 14 days ago
- 1 comment
#1618 - Generic threshold calculation after merge fixes
Pull Request -
State: closed - Opened by aledudek 14 days ago
#1617 - [Ck_tile] smoothquant
Pull Request -
State: closed - Opened by rocking5566 14 days ago
#1616 - [Ck tile] smoothquant
Pull Request -
State: closed - Opened by rocking5566 15 days ago
#1615 - Ck tile batched gemm example
Pull Request -
State: open - Opened by aledudek 15 days ago
#1614 - Batched GEMM Multiple D based on Universal GEMM
Pull Request -
State: open - Opened by zjing14 15 days ago
#1613 - fix clang format
Pull Request -
State: closed - Opened by illsilin 15 days ago
- 1 comment
#1612 - [HOTFIX] fix ci fail
Pull Request -
State: closed - Opened by rocking5566 15 days ago
- 1 comment
#1611 - CK Tile Batched gemm example
Pull Request -
State: closed - Opened by aledudek 16 days ago
#1610 - Remove virtual destructors from unary ops
Pull Request -
State: closed - Opened by bartekxk 16 days ago
#1609 - [CK_TILE] add scatter_gather
Pull Request -
State: closed - Opened by valarLip 16 days ago
#1608 - [CK_TILE] Add fmha fwd headdim96 support
Pull Request -
State: closed - Opened by qianfengz 16 days ago
#1607 - [CK_TILE] add generic_permute
Pull Request -
State: closed - Opened by valarLip 16 days ago
#1606 - fix compilation errors for gfx12 with clang20
Pull Request -
State: closed - Opened by illsilin 17 days ago
#1605 - [Ck tile] support rmsnorm and related fusion
Pull Request -
State: closed - Opened by rocking5566 18 days ago
- 1 comment
#1604 - [CK_TILE] layernorm support fused-quant/fused-add
Pull Request -
State: closed - Opened by carlushuang 18 days ago
#1603 - [Discussion] Do we have/Where can we find swizzling rules in ck to avoid bank conflict?
Issue -
State: open - Opened by LeiWang1999 18 days ago
- 6 comments
Labels: Under Investigation
#1602 - Pipeline matrix b shuffle
Pull Request -
State: open - Opened by ThomasNing 19 days ago
- 1 comment
#1601 - Ck tile gemm fixes
Pull Request -
State: closed - Opened by jakpiase 20 days ago
#1600 - Polished Grouped GEMM APIs and new BF16 instances
Pull Request -
State: open - Opened by aosewski 20 days ago
#1599 - add rounding converter
Pull Request -
State: open - Opened by dummycoderfe 21 days ago
#1598 - Hot fix ln precision rounding
Pull Request -
State: closed - Opened by dummycoderfe 21 days ago
#1597 - hot_fix epsilon pos
Pull Request -
State: closed - Opened by dummycoderfe 21 days ago
#1596 - Update GPU verification
Pull Request -
State: closed - Opened by geyyer 23 days ago
#1595 - fix the logic of enabling XDL and WMMA instances
Pull Request -
State: closed - Opened by illsilin 23 days ago
#1594 - [POST MERGE PR] Enable grouped conv bwd wei bf16 NHWGC
Pull Request -
State: closed - Opened by bartekxk 23 days ago
#1593 - Explicit cast values to half
Pull Request -
State: closed - Opened by cjatin 23 days ago
#1592 - topk_softmax
Pull Request -
State: closed - Opened by carlushuang 24 days ago
- 1 comment
#1591 - add int8 gemm multiply multiply a8w8
Pull Request -
State: closed - Opened by valarLip 24 days ago
#1590 - [GEMM] Congruous GEMM optimization
Pull Request -
State: open - Opened by aska-0096 24 days ago
Labels: enhancement
#1589 - Enable grouped conv bwd wei bf16 NGCHW
Pull Request -
State: closed - Opened by bartekxk 24 days ago
#1588 - [CK_TILE] More fmha splitkv optimizations
Pull Request -
State: closed - Opened by poyenc 24 days ago
#1587 - Bump rocm-docs-core from 1.8.2 to 1.8.3 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] 25 days ago
Labels: documentation, dependencies, ci:docs-only
#1586 - [Development] How to create a temp tile window from a pointer?
Issue -
State: closed - Opened by LeiWang1999 26 days ago
- 8 comments
#1585 - [PT Inductor] Add parsing grouped conv fwd instances
Pull Request -
State: closed - Opened by tenpercent 27 days ago
#1584 - disable bad instance detected on MI308CPX
Pull Request -
State: closed - Opened by aska-0096 27 days ago
#1583 - Fix layernorm F16 type in ckProfiler
Pull Request -
State: closed - Opened by rocking5566 27 days ago
#1582 - Add --lsr-drop-solution=1 compiler flag.
Pull Request -
State: closed - Opened by illsilin 28 days ago
#1581 - [Issue]: Error linking ckProfiler
Issue -
State: open - Opened by RandUser123sa 28 days ago
- 2 comments
Labels: Under Investigation
#1580 - [Issue]: Cannot receive the correct result while using DeviceBatchedGemmMultiD_Xdl
Issue -
State: closed - Opened by hoangvictor 28 days ago
- 6 comments
Labels: Under Investigation
#1579 - Codegen hipRTC compilation
Pull Request -
State: open - Opened by arai713 29 days ago
#1578 - added link to documentation
Pull Request -
State: closed - Opened by spolifroni-amd 29 days ago
Labels: documentation, ci:docs-only
#1577 - [CK_TILE] Optimize fmha splitkv & splitkv combine kernels
Pull Request -
State: closed - Opened by poyenc 30 days ago
#1576 - Update default stride
Pull Request -
State: closed - Opened by geyyer 30 days ago
#1575 - Ck profiler instance support
Pull Request -
State: closed - Opened by ThomasNing about 1 month ago
- 1 comment
#1574 - Rebase the PR #1520 to ROCm repo.
Pull Request -
State: open - Opened by illsilin about 1 month ago
#1572 - [EXPERIMENTNAL][DO NOT MEREG] Add a prototype of F16/BF16xINT4 GEMM
Pull Request -
State: open - Opened by zjing14 about 1 month ago
#1570 - update layernorm
Pull Request -
State: closed - Opened by ltqin about 1 month ago
#1568 - remove the --rm docker container flags
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1565 - only build tests and examples if user sets GPU_TARGETS
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1562 - Grouped gemm fixes
Pull Request -
State: closed - Opened by bartekxk about 1 month ago
#1558 - [CK-Tile] Universal gemm memory bound pipeline
Pull Request -
State: closed - Opened by aosewski about 1 month ago
- 1 comment
#1556 - Build codegen as standalone
Pull Request -
State: closed - Opened by pfultz2 about 1 month ago
#1554 - Fixes small memory leak from missing hipEventDestroy
Pull Request -
State: closed - Opened by cgmillette about 1 month ago
#1551 - [Issue]: amd_wave_read_first_lane is ambiguous
Issue -
State: closed - Opened by RichardGe about 1 month ago
- 4 comments
Labels: Under Investigation
#1546 - Generic threshold calculation
Pull Request -
State: closed - Opened by aledudek about 1 month ago
#1543 - Introduce gemm_elementwise_gemm
Pull Request -
State: open - Opened by mirza-halilcevic about 1 month ago
#1542 - Introduce gemm_softmax_gemm to codegen
Pull Request -
State: open - Opened by mirza-halilcevic about 1 month ago
- 1 comment
#1541 - BF16 GEMM Stream-K
Pull Request -
State: open - Opened by ozturkosu about 1 month ago
- 2 comments
Labels: bug
#1520 - Enable hipRTC compilation of codegen tests
Pull Request -
State: closed - Opened by music-dino about 2 months ago
- 3 comments
#1514 - [Question] Register data layout for two consecutive GEMMs in flash attention kernel (how is TransposedC implemented)?
Issue -
State: closed - Opened by bulffi about 2 months ago
- 3 comments
Labels: question, Under Investigation
#1434 - WMMA / RDNA3+ kernels for backwards fused attention?
Issue -
State: closed - Opened by Googulator 3 months ago
- 3 comments
Labels: Under Investigation
#1431 - debug build got error: R_X86_64_REX_GOTPCRELX | R_X86_64_PC32 out of range
Issue -
State: closed - Opened by ZJLi2013 4 months ago
- 3 comments
Labels: help wanted, Under Investigation
#1426 - Add dynamic elementwise op
Pull Request -
State: closed - Opened by bartekxk 4 months ago
#1333 - Add custom type vector support
Pull Request -
State: closed - Opened by geyyer 5 months ago
- 7 comments
#1199 - int4 inverse quantization and gemm on existing templates
Issue -
State: closed - Opened by xiabo123 8 months ago
- 5 comments
Labels: question, Under Investigation
#886 - Fused Attention Kernel with gfx1030?
Issue -
State: closed - Opened by onesnep about 1 year ago
- 4 comments
Labels: Under Investigation
#779 - Sequence length 1 GEMV alternative for fused attention
Issue -
State: closed - Opened by cloudhan over 1 year ago
- 2 comments
Labels: enhancement
#775 - Compilation error for navi10 (use of undeclared identifier 'CK_BUFFER_RESOURCE_3RD_DWORD')
Issue -
State: open - Opened by TyraVex over 1 year ago
- 38 comments
#362 - Enhance PartitionedBlockwiseReduction interface to allow more diverse reduction use cases
Issue -
State: open - Opened by rosenrodt about 2 years ago
- 2 comments
Labels: Under Investigation
#266 - Pointwise kernel choose grid size based on number of CU
Issue -
State: closed - Opened by asroy over 2 years ago
- 3 comments
Labels: code quality, Under Investigation
#250 - Jenkins CI doesn't carry build cache from last stages
Issue -
State: closed - Opened by rosenrodt over 2 years ago
- 4 comments
Labels: Under Investigation
#249 - Do not check for size compatibility during MakeArgument()
Issue -
State: open - Opened by rosenrodt over 2 years ago
Labels: code quality, Under Investigation
#236 - example_conv2d_fwd_xdl_bias_relu_add produce wrong result
Issue -
State: open - Opened by asroy over 2 years ago
- 1 comment
Labels: bug, Under Investigation
#227 - github allow "merge" PR even CI is not finished
Issue -
State: closed - Opened by asroy over 2 years ago
- 1 comment
Labels: bug, Under Investigation
#177 - Kernels with LDS bank conflicts
Issue -
State: open - Opened by rosenrodt over 2 years ago
- 2 comments
Labels: Performance Issue, Under Investigation