Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / rocm/composable_kernel issues and pull requests
#1838 - Cka8w8 uc newpipe
Pull Request -
State: closed - Opened by aska-0096 7 days ago
#1837 - data type (fp8, bf8) support for gemm example in CK_Tile
Pull Request -
State: closed - Opened by kylasa 8 days ago
- 1 comment
#1836 - Adding support for bf16, fp8 and bf8 datatypes for ck_tile/gemm example
Pull Request -
State: closed - Opened by kylasa 8 days ago
#1835 - [CK-Tile] Enable vectorized reads on all layouts & improve perf.
Pull Request -
State: open - Opened by aosewski 8 days ago
#1834 - [CK_TILE] not using structures under ck_tile/ops for ck_tile/host
Pull Request -
State: closed - Opened by carlushuang 8 days ago
#1833 - [CK_TILE] not including tensor_layout from ck_tile/ops to ck_tile/host.hpp
Issue -
State: open - Opened by carlushuang 8 days ago
#1832 - Bump rocm-docs-core from 1.13.0 to 1.14.1 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] 8 days ago
Labels: documentation, dependencies, ci:docs-only
#1831 - Add Conv NGCHW client example
Pull Request -
State: closed - Opened by bartekxk 9 days ago
#1830 - [CK_TILE] moe-smoothquant support fp8 output
Pull Request -
State: closed - Opened by carlushuang 9 days ago
#1829 - Add OCP FP8 support in CK_TILE
Pull Request -
State: open - Opened by andriy-ca 10 days ago
- 3 comments
#1828 - [CK-Tile] Refactor the is_same_v conditions
Pull Request -
State: closed - Opened by mozga-amd 10 days ago
#1827 - Refactor file structure dev
Pull Request -
State: closed - Opened by aledudek 11 days ago
- 1 comment
#1826 - [CK_TILE] Refactor ck_tile file structure
Pull Request -
State: closed - Opened by aledudek 11 days ago
- 3 comments
#1825 - Added bf16 instances grouped gemm fixed nk
Pull Request -
State: closed - Opened by deepsek 12 days ago
#1824 - Add make_kernel_pt for specific architecture compilation guards
Pull Request -
State: open - Opened by alugorey 13 days ago
- 4 comments
#1823 - Add bf16 instances for grouped gemm fixed nk
Pull Request -
State: closed - Opened by deepsek 13 days ago
- 1 comment
#1822 - Add bf16 instances grouped gemm fixed nk
Pull Request -
State: closed - Opened by deepsek 13 days ago
- 1 comment
#1821 - [CK_TILE] Add error threshold calculation for gemm examples
Pull Request -
State: closed - Opened by bartekxk 13 days ago
- 1 comment
#1820 - fix a bug for int4 scale weight only kernel
Pull Request -
State: closed - Opened by mtgu0705 13 days ago
Labels: bug, CI - Testing
#1819 - Implementing Test Filters for Smoke and Regression Tests
Pull Request -
State: closed - Opened by AviralGoelAMD 14 days ago
- 1 comment
Labels: good first issue
#1818 - Fix and optimize dynamic unary elementwise
Pull Request -
State: closed - Opened by bartekxk 14 days ago
#1817 - Change flag to CK_GFX90A_DENORM_WORKAROUND
Pull Request -
State: open - Opened by darren-amd 15 days ago
#1816 - Disable inductor codegen tests on legacy OS
Pull Request -
State: closed - Opened by illsilin 15 days ago
#1815 - Add rounding for float to bf16 conversion as default (rel-6.2)
Pull Request -
State: closed - Opened by bartekxk 15 days ago
#1814 - [CK_TILE] Implement fp8 quant tests/examples for layernorm and rmsnorm
Pull Request -
State: closed - Opened by ruanjm 15 days ago
#1813 - Prec param new
Pull Request -
State: open - Opened by aledudek 16 days ago
#1812 - Add rounding for float to bf16 conversion as default
Pull Request -
State: closed - Opened by bartekxk 17 days ago
- 2 comments
#1811 - [CK_TILE] Use the GEMM example prec argument
Pull Request -
State: open - Opened by aledudek 18 days ago
- 1 comment
#1810 - Update for fmha_fwd qs_ks_vs pipeline
Pull Request -
State: closed - Opened by qianfengz 18 days ago
#1809 - CK Tile Gemm API and heuristics changes
Pull Request -
State: open - Opened by jakpiase 20 days ago
#1808 - [CK_TILE] Fix mock token id, support g1u1/g1u0 through same inline code block
Pull Request -
State: closed - Opened by carlushuang 21 days ago
#1807 - enable int4 scale (weight only) kernel
Pull Request -
State: closed - Opened by mtgu0705 21 days ago
#1806 - Created a branch int4_pr_based_on_JingPR
Pull Request -
State: open - Opened by mtgu0705 21 days ago
#1805 - Revert "[Draft] Revive QsKsVs FMHA pipeline"
Pull Request -
State: closed - Opened by poyenc 21 days ago
#1804 - Disable building DPP kernels by default
Pull Request -
State: closed - Opened by darren-amd 22 days ago
#1803 - updated int4_pr_debug
Pull Request -
State: closed - Opened by mtgu0705 22 days ago
Labels: bug, code quality, CI - Testing
#1802 - [CK_TILE] Add Various Fusion Functions to RMSNorm
Pull Request -
State: closed - Opened by ruanjm 22 days ago
- 1 comment
Labels: feature request
#1801 - Update LICENSE to 2025 (#1797)
Pull Request -
State: open - Opened by spolifroni-amd 23 days ago
Labels: documentation, ci:docs-only
#1800 - [Draft] | GPUAI-3720 - Integrate Universal GEMM into Grouped GEMM - Pt 1
Pull Request -
State: open - Opened by rtmadduri 23 days ago
#1799 - Update in GridSize() and using GridSize() for splitkv kernel
Pull Request -
State: closed - Opened by qianfengz 23 days ago
#1799 - Update in GridSize() and using GridSize() for splitkv kernel
Pull Request -
State: closed - Opened by qianfengz 23 days ago
#1798 - Bump rocm-docs-core from 1.12.1 to 1.13.0 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] 23 days ago
Labels: documentation, dependencies, ci:docs-only
#1798 - Bump rocm-docs-core from 1.12.1 to 1.13.0 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] 23 days ago
Labels: documentation, dependencies, ci:docs-only
#1797 - Update LICENSE to 2025
Pull Request -
State: closed - Opened by spolifroni-amd 24 days ago
#1796 - Fix parsing instances for pt inductor
Pull Request -
State: closed - Opened by tenpercent 24 days ago
#1795 - Cross GPU Reduce Operator Initial Development
Pull Request -
State: open - Opened by ThomasNing 24 days ago
#1794 - Add CK_TIME_KERNEL as toggleable CMake Variable
Pull Request -
State: closed - Opened by lucbruni-amd 24 days ago
- 1 comment
#1793 - [CK_TILE] Support moe with up gemm
Pull Request -
State: open - Opened by huaiguxu 26 days ago
- 2 comments
#1792 - terminology clean-up
Pull Request -
State: closed - Opened by illsilin 27 days ago
#1791 - [CK_TILE] Add GetName for GEMM kernels
Pull Request -
State: open - Opened by aledudek 27 days ago
#1790 - Fix universal gemm profiler for pk_i4_t
Pull Request -
State: closed - Opened by bartekxk 27 days ago
- 1 comment
#1789 - [CK_TILE] fmha fwd splitkv optimization for decode (seqlen_q=1)
Pull Request -
State: closed - Opened by poyenc 27 days ago
#1788 - Bump rocm-docs-core from 1.12.0 to 1.12.1 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] 27 days ago
Labels: documentation, dependencies, ci:docs-only
#1787 - Add afagaj to CODEOWNERS
Pull Request -
State: closed - Opened by afagaj 28 days ago
#1786 - Implement the fp16xint4 scale weight only kernel for Ali
Pull Request -
State: closed - Opened by mtgu0705 28 days ago
Labels: enhancement, CI - Testing, priority
#1785 - [CK_TILE] Sync fmha fwd splitkv minor optimizations
Pull Request -
State: open - Opened by poyenc 29 days ago
#1784 - Ck tile/layernorm: implement naive reduce, opt performance
Pull Request -
State: closed - Opened by coderfeli about 1 month ago
#1783 - Add NGCHW bf16 grouped conv fwd instances
Pull Request -
State: closed - Opened by bartekxk about 1 month ago
- 2 comments
#1782 - [Issue]: Does not honor cmake BUILD_SHARED
Issue -
State: open - Opened by trixirt about 1 month ago
- 2 comments
Labels: Under Investigation
#1782 - [Issue]: Does not honor cmake BUILD_SHARED
Issue -
State: open - Opened by trixirt about 1 month ago
- 2 comments
Labels: Under Investigation
#1781 - [Issue]: RFE cmake BUILD_EXAMPLES
Issue -
State: open - Opened by trixirt about 1 month ago
- 1 comment
Labels: Under Investigation
#1781 - [Issue]: RFE cmake BUILD_EXAMPLES
Issue -
State: open - Opened by trixirt about 1 month ago
- 1 comment
Labels: Under Investigation
#1780 - [Issue]: CK_TIME_KERNEL used by default
Issue -
State: closed - Opened by trixirt about 1 month ago
- 5 comments
Labels: Under Investigation
#1779 - [CK_TILE] Adjust kBlockSize of reduce example for better perf
Pull Request -
State: closed - Opened by ClementLinCF about 1 month ago
#1778 - Remove using partitioner for all fmha kernels
Pull Request -
State: closed - Opened by qianfengz about 1 month ago
#1777 - [Issue]: Some kernel pass AB0B1 and output as std::vector<const void*>
Issue -
State: closed - Opened by Jay19751103 about 1 month ago
- 3 comments
Labels: Under Investigation
#1776 - CK Tile GEMM CICD fixed & register block method refactor
Pull Request -
State: closed - Opened by ThomasNing about 1 month ago
- 3 comments
#1775 - [CK_TILE] Fix fmha fwd splitkv codegen error
Pull Request -
State: closed - Opened by poyenc about 1 month ago
#1774 - Dev/merge u8w8
Pull Request -
State: closed - Opened by coderfeli about 1 month ago
- 1 comment
#1773 - [Issue]: `lld: error: undefined hidden symbol: unsigned short ck::atomic_add`
Issue -
State: closed - Opened by tjtanaa about 1 month ago
- 5 comments
Labels: Under Investigation
#1772 - Grouped convolution backward weight special vector size loads
Pull Request -
State: open - Opened by bartekxk about 1 month ago
#1772 - Grouped convolution backward weight special vector size loads
Pull Request -
State: closed - Opened by bartekxk about 1 month ago
#1771 - [CK_TILE] optimize moe-sorting kernel
Pull Request -
State: closed - Opened by carlushuang about 1 month ago
#1770 - Promote develop into amd-develop
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1769 - fix typo for CK_USE_OCP_FP8
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1768 - hot-fix missing flags
Pull Request -
State: closed - Opened by carlushuang about 1 month ago
#1767 - Promote latest CK develop
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1766 - fix profiler_grouped_gemm
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1765 - [Draft] Revive QsKsVs FMHA pipeline
Pull Request -
State: closed - Opened by tenpercent about 1 month ago
- 1 comment
#1764 - fix: preprocessor directives logic error if/else
Pull Request -
State: closed - Opened by deepsek about 1 month ago
#1763 - device_prop.hpp: move static map to helper function and initialize there
Pull Request -
State: open - Opened by coconutruben about 1 month ago
- 2 comments
#1763 - device_prop.hpp: move static map to helper function and initialize there
Pull Request -
State: open - Opened by coconutruben about 1 month ago
- 1 comment
#1762 - Jing's contribution: prototype of mixed precision gemm FP16/BF16xint4 GEMM
Pull Request -
State: closed - Opened by aosewski about 1 month ago
- 7 comments
Labels: enhancement, CI - Testing, external contribution
#1762 - Jing's contribution: prototype of mixed precision gemm FP16/BF16xint4 GEMM
Pull Request -
State: closed - Opened by aosewski about 1 month ago
- 7 comments
Labels: enhancement, CI - Testing, external contribution
#1761 - disable _dpp instances for non-gfx10/gfx11 devices
Pull Request -
State: closed - Opened by LunNova about 1 month ago
- 6 comments
#1760 - Pass build flags to config.h
Pull Request -
State: closed - Opened by illsilin about 1 month ago
#1759 - [Issue]: Build failure for gfx908 when building without optimization flags
Issue -
State: closed - Opened by LunNova about 1 month ago
- 11 comments
Labels: Under Investigation
#1759 - [Issue]: Build failure for gfx908 when building without optimization flags
Issue -
State: open - Opened by LunNova about 1 month ago
- 7 comments
Labels: Under Investigation
#1758 - Apply Ck-tile argument parser for vectors [I/O]
Pull Request -
State: closed - Opened by mozga-amd about 1 month ago
#1758 - Apply Ck-tile argument parser for vectors [I/O]
Pull Request -
State: closed - Opened by mozga-amd about 1 month ago
#1757 - [Issue]: [xformers] NotImplementedError: No operator found for memory_efficient_attention_forward with inputs
Issue -
State: open - Opened by Looong01 about 1 month ago
- 4 comments
Labels: enhancement, Under Investigation, feature request
#1757 - [Issue]: [xformers] NotImplementedError: No operator found for memory_efficient_attention_forward with inputs
Issue -
State: closed - Opened by Looong01 about 1 month ago
- 5 comments
Labels: enhancement, Under Investigation, feature request
#1756 - CK-Tile Grouped GEMM refactor and post PR fixes
Pull Request -
State: open - Opened by mozga-amd about 1 month ago
#1756 - CK-Tile Grouped GEMM refactor and post PR fixes
Pull Request -
State: closed - Opened by mozga-amd about 1 month ago
#1755 - Use s_shuffling to replace p_shuffling which removes the needs of cross-warp reduction
Pull Request -
State: closed - Opened by qianfengz about 1 month ago
#1755 - Use s_shuffling to replace p_shuffling which removes the needs of cross-warp reduction
Pull Request -
State: closed - Opened by qianfengz about 1 month ago
#1754 - updated fp16 instances to be on parity with universal gemm instances
Pull Request -
State: closed - Opened by hsadasiv about 1 month ago
#1753 - Bump rocm-docs-core from 1.11.0 to 1.12.0 in /docs/sphinx
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: documentation, dependencies, ci:docs-only
#1752 - [Ck tile] Use raw store to improve layernorm performance
Pull Request -
State: open - Opened by rocking5566 about 1 month ago
#1752 - [Ck tile] Use raw store to improve layernorm performance
Pull Request -
State: open - Opened by rocking5566 about 1 month ago