Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rocm/composable_kernel issues and pull requests

#1643 - Add generic instances for two stage conv bwd wei

Pull Request - State: open - Opened by bartekxk 8 days ago

#1642 - MoeSorting fuse set zero

Pull Request - State: closed - Opened by dummycoderfe 8 days ago

#1641 - Compilation failure with modified tuning parameters on CK Tile GEMM

Issue - State: open - Opened by zjing14 9 days ago - 1 comment

#1640 - remove gfx940;gfx941 from default target lists

Pull Request - State: closed - Opened by illsilin 9 days ago

#1639 - Prevent instantiation of undefined FP8 operators.

Pull Request - State: closed - Opened by andriy-ca 9 days ago

#1637 - [Ck tile] layernorm2d fwd optimize

Pull Request - State: open - Opened by dummycoderfe 9 days ago - 1 comment

#1636 - Update ck_a8w8

Pull Request - State: open - Opened by aska-0096 9 days ago

#1635 - [generate.py] Override blob list if it already exists

Pull Request - State: closed - Opened by jmmartinez 9 days ago

#1634 - [WIP][CK_TILE] moe

Pull Request - State: open - Opened by carlushuang 10 days ago

#1633 - Make sure cmake can handle the xnack+/xnack- targets.

Pull Request - State: closed - Opened by illsilin 10 days ago

#1632 - A prototype of TF32 gemm

Pull Request - State: open - Opened by zjing14 10 days ago

#1631 - Statically Cast Pointer Offset

Pull Request - State: closed - Opened by darren-amd 10 days ago

#1630 - Temporary disable part of dynamic op conv instances

Pull Request - State: closed - Opened by bartekxk 10 days ago

#1629 - [CK_TILE] Allow using default gemm pipeline policy

Pull Request - State: open - Opened by poyenc 10 days ago

#1628 - [do not review] int4 scale based on jzhang's pre work

Pull Request - State: open - Opened by mtgu0705 10 days ago
Labels: noCI

#1627 - [DO NOT REVIEW]

Pull Request - State: open - Opened by mtgu0705 10 days ago
Labels: noCI

#1626 - Linsun/convint8 fwd instances

Pull Request - State: closed - Opened by linsun12 13 days ago - 4 comments

#1625 - Linsun/convint8 fwd instances

Pull Request - State: closed - Opened by linsun12 13 days ago

#1624 - Ck tile/moe sorting

Pull Request - State: open - Opened by dummycoderfe 13 days ago

#1623 - [CK_TILE] layernorm have more accurate residual

Pull Request - State: closed - Opened by carlushuang 13 days ago - 1 comment

#1622 - [CK_TILE] Add small warp gemm

Pull Request - State: open - Opened by poyenc 14 days ago

#1621 - Reduce build time.

Pull Request - State: closed - Opened by illsilin 14 days ago

#1620 - [layernorm] hot fix

Pull Request - State: closed - Opened by carlushuang 14 days ago

#1619 - [CK_TILE] Add operator batched_transpose

Pull Request - State: open - Opened by fangche123 14 days ago - 1 comment

#1618 - Generic threshold calculation after merge fixes

Pull Request - State: closed - Opened by aledudek 14 days ago

#1617 - [Ck_tile] smoothquant

Pull Request - State: closed - Opened by rocking5566 14 days ago

#1616 - [Ck tile] smoothquant

Pull Request - State: closed - Opened by rocking5566 15 days ago

#1615 - Ck tile batched gemm example

Pull Request - State: open - Opened by aledudek 15 days ago

#1614 - Batched GEMM Multiple D based on Universal GEMM

Pull Request - State: open - Opened by zjing14 15 days ago

#1613 - fix clang format

Pull Request - State: closed - Opened by illsilin 15 days ago - 1 comment

#1612 - [HOTFIX] fix ci fail

Pull Request - State: closed - Opened by rocking5566 15 days ago - 1 comment

#1611 - CK Tile Batched gemm example

Pull Request - State: closed - Opened by aledudek 16 days ago

#1610 - Remove virtual destructors from unary ops

Pull Request - State: closed - Opened by bartekxk 16 days ago

#1609 - [CK_TILE] add scatter_gather

Pull Request - State: closed - Opened by valarLip 16 days ago

#1608 - [CK_TILE] Add fmha fwd headdim96 support

Pull Request - State: closed - Opened by qianfengz 16 days ago

#1607 - [CK_TILE] add generic_permute

Pull Request - State: closed - Opened by valarLip 16 days ago

#1606 - fix compilation errors for gfx12 with clang20

Pull Request - State: closed - Opened by illsilin 17 days ago

#1605 - [Ck tile] support rmsnorm and related fusion

Pull Request - State: closed - Opened by rocking5566 18 days ago - 1 comment

#1604 - [CK_TILE] layernorm support fused-quant/fused-add

Pull Request - State: closed - Opened by carlushuang 18 days ago

#1603 - [Discussion] Do we have/Where can we find swizzling rules in ck to avoid bank conflict?

Issue - State: open - Opened by LeiWang1999 18 days ago - 6 comments
Labels: Under Investigation

#1602 - Pipeline matrix b shuffle

Pull Request - State: open - Opened by ThomasNing 19 days ago - 1 comment

#1601 - Ck tile gemm fixes

Pull Request - State: closed - Opened by jakpiase 20 days ago

#1600 - Polished Grouped GEMM APIs and new BF16 instances

Pull Request - State: open - Opened by aosewski 20 days ago

#1599 - add rounding converter

Pull Request - State: open - Opened by dummycoderfe 21 days ago

#1598 - Hot fix ln precision rounding

Pull Request - State: closed - Opened by dummycoderfe 21 days ago

#1597 - hot_fix epsilon pos

Pull Request - State: closed - Opened by dummycoderfe 21 days ago

#1596 - Update GPU verification

Pull Request - State: closed - Opened by geyyer 23 days ago

#1595 - fix the logic of enabling XDL and WMMA instances

Pull Request - State: closed - Opened by illsilin 23 days ago

#1594 - [POST MERGE PR] Enable grouped conv bwd wei bf16 NHWGC

Pull Request - State: closed - Opened by bartekxk 23 days ago

#1593 - Explicit cast values to half

Pull Request - State: closed - Opened by cjatin 23 days ago

#1592 - topk_softmax

Pull Request - State: closed - Opened by carlushuang 24 days ago - 1 comment

#1591 - add int8 gemm multiply multiply a8w8

Pull Request - State: closed - Opened by valarLip 24 days ago

#1590 - [GEMM] Congruous GEMM optimization

Pull Request - State: open - Opened by aska-0096 24 days ago
Labels: enhancement

#1589 - Enable grouped conv bwd wei bf16 NGCHW

Pull Request - State: closed - Opened by bartekxk 24 days ago

#1588 - [CK_TILE] More fmha splitkv optimizations

Pull Request - State: closed - Opened by poyenc 24 days ago

#1587 - Bump rocm-docs-core from 1.8.2 to 1.8.3 in /docs/sphinx

Pull Request - State: closed - Opened by dependabot[bot] 25 days ago
Labels: documentation, dependencies, ci:docs-only

#1586 - [Development] How to create a temp tile window from a pointer?

Issue - State: closed - Opened by LeiWang1999 26 days ago - 8 comments

#1585 - [PT Inductor] Add parsing grouped conv fwd instances

Pull Request - State: closed - Opened by tenpercent 27 days ago

#1584 - disable bad instance detected on MI308CPX

Pull Request - State: closed - Opened by aska-0096 27 days ago

#1583 - Fix layernorm F16 type in ckProfiler

Pull Request - State: closed - Opened by rocking5566 27 days ago

#1582 - Add --lsr-drop-solution=1 compiler flag.

Pull Request - State: closed - Opened by illsilin 28 days ago

#1581 - [Issue]: Error linking ckProfiler

Issue - State: open - Opened by RandUser123sa 28 days ago - 2 comments
Labels: Under Investigation

#1580 - [Issue]: Cannot receive the correct result while using DeviceBatchedGemmMultiD_Xdl

Issue - State: closed - Opened by hoangvictor 28 days ago - 6 comments
Labels: Under Investigation

#1579 - Codegen hipRTC compilation

Pull Request - State: open - Opened by arai713 29 days ago

#1578 - added link to documentation

Pull Request - State: closed - Opened by spolifroni-amd 29 days ago
Labels: documentation, ci:docs-only

#1577 - [CK_TILE] Optimize fmha splitkv & splitkv combine kernels

Pull Request - State: closed - Opened by poyenc 30 days ago

#1576 - Update default stride

Pull Request - State: closed - Opened by geyyer 30 days ago

#1575 - Ck profiler instance support

Pull Request - State: closed - Opened by ThomasNing about 1 month ago - 1 comment

#1574 - Rebase the PR #1520 to ROCm repo.

Pull Request - State: open - Opened by illsilin about 1 month ago

#1572 - [EXPERIMENTNAL][DO NOT MEREG] Add a prototype of F16/BF16xINT4 GEMM

Pull Request - State: open - Opened by zjing14 about 1 month ago

#1570 - update layernorm

Pull Request - State: closed - Opened by ltqin about 1 month ago

#1568 - remove the --rm docker container flags

Pull Request - State: closed - Opened by illsilin about 1 month ago

#1565 - only build tests and examples if user sets GPU_TARGETS

Pull Request - State: closed - Opened by illsilin about 1 month ago

#1562 - Grouped gemm fixes

Pull Request - State: closed - Opened by bartekxk about 1 month ago

#1558 - [CK-Tile] Universal gemm memory bound pipeline

Pull Request - State: closed - Opened by aosewski about 1 month ago - 1 comment

#1556 - Build codegen as standalone

Pull Request - State: closed - Opened by pfultz2 about 1 month ago

#1554 - Fixes small memory leak from missing hipEventDestroy

Pull Request - State: closed - Opened by cgmillette about 1 month ago

#1551 - [Issue]: amd_wave_read_first_lane is ambiguous

Issue - State: closed - Opened by RichardGe about 1 month ago - 4 comments
Labels: Under Investigation

#1546 - Generic threshold calculation

Pull Request - State: closed - Opened by aledudek about 1 month ago

#1543 - Introduce gemm_elementwise_gemm

Pull Request - State: open - Opened by mirza-halilcevic about 1 month ago

#1542 - Introduce gemm_softmax_gemm to codegen

Pull Request - State: open - Opened by mirza-halilcevic about 1 month ago - 1 comment

#1541 - BF16 GEMM Stream-K

Pull Request - State: open - Opened by ozturkosu about 1 month ago - 2 comments
Labels: bug

#1520 - Enable hipRTC compilation of codegen tests

Pull Request - State: closed - Opened by music-dino about 2 months ago - 3 comments

#1514 - [Question] Register data layout for two consecutive GEMMs in flash attention kernel (how is TransposedC implemented)?

Issue - State: closed - Opened by bulffi about 2 months ago - 3 comments
Labels: question, Under Investigation

#1434 - WMMA / RDNA3+ kernels for backwards fused attention?

Issue - State: closed - Opened by Googulator 3 months ago - 3 comments
Labels: Under Investigation

#1431 - debug build got error: R_X86_64_REX_GOTPCRELX | R_X86_64_PC32 out of range

Issue - State: closed - Opened by ZJLi2013 4 months ago - 3 comments
Labels: help wanted, Under Investigation

#1426 - Add dynamic elementwise op

Pull Request - State: closed - Opened by bartekxk 4 months ago

#1333 - Add custom type vector support

Pull Request - State: closed - Opened by geyyer 5 months ago - 7 comments

#1199 - int4 inverse quantization and gemm on existing templates

Issue - State: closed - Opened by xiabo123 8 months ago - 5 comments
Labels: question, Under Investigation

#886 - Fused Attention Kernel with gfx1030?

Issue - State: closed - Opened by onesnep about 1 year ago - 4 comments
Labels: Under Investigation

#779 - Sequence length 1 GEMV alternative for fused attention

Issue - State: closed - Opened by cloudhan over 1 year ago - 2 comments
Labels: enhancement

#362 - Enhance PartitionedBlockwiseReduction interface to allow more diverse reduction use cases

Issue - State: open - Opened by rosenrodt about 2 years ago - 2 comments
Labels: Under Investigation

#266 - Pointwise kernel choose grid size based on number of CU

Issue - State: closed - Opened by asroy over 2 years ago - 3 comments
Labels: code quality, Under Investigation

#250 - Jenkins CI doesn't carry build cache from last stages

Issue - State: closed - Opened by rosenrodt over 2 years ago - 4 comments
Labels: Under Investigation

#249 - Do not check for size compatibility during MakeArgument()

Issue - State: open - Opened by rosenrodt over 2 years ago
Labels: code quality, Under Investigation

#236 - example_conv2d_fwd_xdl_bias_relu_add produce wrong result

Issue - State: open - Opened by asroy over 2 years ago - 1 comment
Labels: bug, Under Investigation

#227 - github allow "merge" PR even CI is not finished

Issue - State: closed - Opened by asroy over 2 years ago - 1 comment
Labels: bug, Under Investigation

#177 - Kernels with LDS bank conflicts

Issue - State: open - Opened by rosenrodt over 2 years ago - 2 comments
Labels: Performance Issue, Under Investigation