Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / triton-lang/triton issues and pull requests
#4582 - [BACKEND] Optimize code generation for load with other arg
Pull Request -
State: closed - Opened by ThomasRaoux 3 months ago
- 4 comments
#4581 - [WIP] Optimize fma dot
Pull Request -
State: open - Opened by binarman 3 months ago
- 1 comment
#4580 - Bump tj-actions/changed-files from 44 to 45
Pull Request -
State: closed - Opened by dependabot[bot] 3 months ago
Labels: dependencies
#4579 - [BE][PIPELINE] Add fix for the wgmma pipelining bug with subview dist 1
Pull Request -
State: closed - Opened by pawelszczerbuk 3 months ago
#4578 - [BACKEND] Update LLVM version to https://github.com/llvm/llvm-project/commit/7f7f4feaf07dd3bb4b22d0c25d34b6c99c753aa2
Pull Request -
State: closed - Opened by karupayun 3 months ago
#4577 - RuntimeError: Triton Error [CUDA]: an illegal memory access was encountered
Issue -
State: open - Opened by Abhijit89Kumar 3 months ago
- 1 comment
#4576 - PTX version error with CUDA 12.6
Issue -
State: closed - Opened by sclarkson 3 months ago
- 1 comment
#4575 - The number of generated TTIR files
Issue -
State: closed - Opened by LHW-CLOUD 3 months ago
- 2 comments
#4574 - `tf32` matrix multiplication is much less accurate than `fp16` matrix multiplication (in relative error)
Issue -
State: closed - Opened by lengstrom 3 months ago
- 5 comments
#4573 - IEEE is common and TF32 is specific to CUDA
Pull Request -
State: closed - Opened by parsifal-47 3 months ago
#4572 - [Readme] Fix command to get compile command
Pull Request -
State: closed - Opened by jayzhan211 3 months ago
- 2 comments
#4571 - [BACKEND] Continue the backward slice when finding free convert
Pull Request -
State: closed - Opened by Jokeren 3 months ago
- 1 comment
#4570 - .so文件的作用?
Issue -
State: closed - Opened by LHW-CLOUD 3 months ago
- 3 comments
#4569 - [AMD] Cherry-pick commits from mainline to support Flex attention on AMD gpus
Pull Request -
State: closed - Opened by jerrymannil 3 months ago
#4568 - [Proton] Move additional hatchet import into try/except check
Pull Request -
State: closed - Opened by CRobeck 3 months ago
- 1 comment
#4567 - [BE][PIPELINE] Handle the case when values from the peeled prologue may escape out of the loop
Pull Request -
State: closed - Opened by pawelszczerbuk 3 months ago
#4566 - [FRONTEND] Print full file name when overriding kernel
Pull Request -
State: closed - Opened by htyu 3 months ago
#4565 - [Proton] Add a better description when possibly importing incorrect hatchet in Proton
Pull Request -
State: closed - Opened by CRobeck 3 months ago
#4564 - [RELEASE] Cherry pick device agnostic do_bench change
Pull Request -
State: open - Opened by anishsarum 3 months ago
- 1 comment
#4563 - [nvidia backend] Replace cvt instructions with bitwise operations in s8->bf16 conversions
Pull Request -
State: closed - Opened by chsigg 3 months ago
#4562 - Fix underflow in highestPowOf2Divisor()
Pull Request -
State: closed - Opened by chsigg 3 months ago
- 1 comment
#4561 - [BACKEND] Add a knob to fall back to the legacy mma layout conversion code
Pull Request -
State: closed - Opened by Jokeren 3 months ago
#4560 - DeepSpeed came to Windows natively - pip install deepspeed - where are you Triton?
Issue -
State: open - Opened by FurkanGozukara 3 months ago
#4559 - Hoist reduction outside a loop
Pull Request -
State: open - Opened by binarman 3 months ago
- 6 comments
#4558 - [TEST] Use device fixture for test_math_extern
Pull Request -
State: closed - Opened by int3 3 months ago
#4557 - Fix underflow in Triton's highestPowOf2Divisor function when the input is INT_MIN
Pull Request -
State: closed - Opened by Moerafaat 3 months ago
#4556 - [WIP] Try LLVM integrate
Pull Request -
State: closed - Opened by ThomasRaoux 3 months ago
#4555 - [CI][AMD] Re-enable MI200 CI
Pull Request -
State: closed - Opened by jungpark-mlir 3 months ago
- 2 comments
#4554 - [NFC] Simplify getThreadId function
Pull Request -
State: closed - Opened by linuxlonelyeagle 3 months ago
#4553 - Use base64 for shorter cache directories
Pull Request -
State: closed - Opened by minjang 3 months ago
#4552 - [Tutorial] 06-fused-attention.py - add tma
Pull Request -
State: closed - Opened by yzh119 3 months ago
- 1 comment
#4551 - Numeric errors with tf32 matmul on A100 GPU
Issue -
State: closed - Opened by axelfeldmann 3 months ago
- 3 comments
#4550 - Nothing found at https://oaitriton.blob.core.windows.net/public/llvm-builds/llvm-ce80c80d-centos-x64.tar.gz
Issue -
State: open - Opened by PaliC 3 months ago
#4549 - [CI][AMD] Disable MI200 CI
Pull Request -
State: closed - Opened by jungpark-mlir 3 months ago
- 5 comments
#4548 - [BACKEND] Update gcc debian package to point to a version 14.1.0-2 which exists in gcc-defaults.
Pull Request -
State: closed - Opened by khasanovaa 3 months ago
#4547 - [BACKEND][DRAFT] Adjust the padding heuristic for convert layout
Pull Request -
State: closed - Opened by Jokeren 3 months ago
#4546 - [RFC][FRONTEND] Use the index type for scf loop
Pull Request -
State: closed - Opened by htyu 3 months ago
- 4 comments
#4545 - [Frontend] Add TRITON_PRINT_AUTOTUNING_ALL flag
Pull Request -
State: open - Opened by plotfi 3 months ago
- 3 comments
#4544 - The question of implementing the addmm_ operator based on triton
Issue -
State: closed - Opened by hyx1999 3 months ago
- 1 comment
#4543 - [Release/3.0.x] Cherry pick Flex attention support from mainline
Pull Request -
State: closed - Opened by jerrymannil 3 months ago
- 1 comment
#4542 - [TEST] Insert barriers in test_atomic_cas to sequence store and atomic
Pull Request -
State: closed - Opened by jungpark-mlir 3 months ago
- 2 comments
#4541 - [Tutorial] Fused-attention `bwd` tutorial incorrect for `causal=True`
Issue -
State: open - Opened by michaelfeil 3 months ago
#4540 - Cherry Pick #4138 into release/3.0x branch
Pull Request -
State: open - Opened by drisspg 3 months ago
- 2 comments
#4539 - Add mechanism for remapping device-specific module imports
Pull Request -
State: closed - Opened by int3 3 months ago
- 2 comments
#4538 - [Backend] Bypass conversion for suitable blocked to dotOperand layout
Pull Request -
State: closed - Opened by binarman 3 months ago
#4537 - [BACKEND] Update LLVM version to https://github.com/llvm/llvm-project/commit/1115dee248e68a155001ac3712a189299d104863
Pull Request -
State: closed - Opened by khasanovaa 3 months ago
- 8 comments
#4536 - [BACKEND] Update LLVM version to https://github.com/llvm/llvm-project/commit/1115dee248e68a155001ac3712a189299d104863
Pull Request -
State: closed - Opened by khasanovaa 3 months ago
#4535 - [FRONTEND] `interleave` does not need to check shape
Pull Request -
State: closed - Opened by Mwsxy 3 months ago
- 2 comments
#4534 - [Pipeliner] Implement dynamic loop peeling
Pull Request -
State: closed - Opened by sjw36 3 months ago
- 15 comments
#4532 - [BACKEND] Fix common mistake of missing checks for null pointer
Pull Request -
State: closed - Opened by ThomasRaoux 3 months ago
- 4 comments
#4531 - Confusion about memory of pointers
Issue -
State: closed - Opened by FelixSchoen 3 months ago
- 1 comment
#4530 - [BACKEND] Fix the `divideRight` method in Linear Layout when eliminating input and output dimensions
Pull Request -
State: closed - Opened by Jokeren 3 months ago
- 7 comments
#4529 - [PIPELINER] Handling the case in WGMMA pipelining where subview is in previous iteration
Pull Request -
State: closed - Opened by pawelszczerbuk 3 months ago
#4528 - Enable verbose asm
Pull Request -
State: closed - Opened by ravil-mobile 3 months ago
- 4 comments
#4527 - triton build failing to access https://tritonlang.blob.core.windows.net/llvm-builds/ with HTTP Error 409
Issue -
State: open - Opened by lisaong 3 months ago
- 17 comments
#4526 - [CI][AMD] Create a CI docker with non-root user
Pull Request -
State: open - Opened by yiqian1 3 months ago
#4525 - [BACKEND][NFC] Interface for LinearLayout conversion.
Pull Request -
State: closed - Opened by hwnam831 3 months ago
- 5 comments
#4524 - feat: support `sem`/`scope` in `tl.{load,store}`
Pull Request -
State: closed - Opened by 0x804d8000 3 months ago
- 2 comments
#4523 - [NFC][BACKEND] Remove dead code in Nvidia backend's ConvertLayoutOpConversion
Pull Request -
State: open - Opened by Jokeren 3 months ago
- 1 comment
#4522 - What is the use of divisibility in coalesce pass
Issue -
State: open - Opened by Shoreshen 3 months ago
#4521 - FlexAttention Segmentation Fault
Issue -
State: open - Opened by drisspg 3 months ago
- 2 comments
#4520 - [CI][AMD] Enable MI300 CI
Pull Request -
State: closed - Opened by zhanglx13 3 months ago
#4519 - [BACKEND] Fp8E5M2Nv to Fp16 conversion support added to NV backend
Pull Request -
State: closed - Opened by plotfi 3 months ago
- 5 comments
#4518 - [AMD][gfx12] Support emit indices logic WMMAv2 layout
Pull Request -
State: closed - Opened by joviliast 3 months ago
#4517 - [DOC] Improve the description of Transformation passes
Pull Request -
State: closed - Opened by mfrancepillois 3 months ago
- 1 comment
#4516 - [Backend] Improve dot support to target FMA
Pull Request -
State: open - Opened by binarman 3 months ago
- 2 comments
#4515 - [NFC] Make the decomposeTensorCoreToDotLayoutConversion to be useable for the third party backend.
Pull Request -
State: closed - Opened by chengjunlu 3 months ago
- 3 comments
#4514 - [FRONTEND] Add hooks to signal that compilation is done
Pull Request -
State: closed - Opened by ThomasRaoux 3 months ago
- 1 comment
#4513 - make error
Issue -
State: open - Opened by tangpanyu 3 months ago
#4512 - [BACKEND] Set LLVM_ABI_BREAKING_CHECKS to be able to update the llvm version
Pull Request -
State: closed - Opened by karupayun 3 months ago
- 4 comments
#4511 - Cannot find 2.0.0.dev20221202 version
Issue -
State: open - Opened by anttitapsa 3 months ago
- 10 comments
#4510 - [backend] NFC: Fix ptx `st` argument order
Pull Request -
State: closed - Opened by chsigg 3 months ago
#4509 - Should @core.extern be part of the libdevice interface?
Issue -
State: open - Opened by int3 3 months ago
- 1 comment
#4508 - Seems like these slides are missing, if possible, would you mind update the links or re-upload the files?
Issue -
State: open - Opened by BruceDai003 3 months ago
#4507 - [BACKEND] Fix linear layout's distributed to distributed layout conversion using shared memory
Pull Request -
State: closed - Opened by Jokeren 3 months ago
#4506 - [AMD][Reorder] Remove reorder pattern that violates memory access order
Pull Request -
State: closed - Opened by sjw36 3 months ago
#4504 - [AMD] Add barrier at the beginning of each atomic operations.
Pull Request -
State: closed - Opened by jungpark-mlir 3 months ago
- 15 comments
#4503 - Allow third-party backends to add submodules to `triton.language.extra`
Pull Request -
State: closed - Opened by Alfie-Edwards 3 months ago
- 9 comments
#4502 - [Bug] Assertion `idx < size()' failed.
Issue -
State: open - Opened by zhyncs 3 months ago
- 10 comments
Labels: enhancement
#4501 - [BACKEND] Update LLVM version to https://github.com/llvm/llvm-project/commit/4c5ef6690040383956461828457ac27f7f912edb
Pull Request -
State: closed - Opened by vwbaker 3 months ago
#4500 - Try out MI210 build bot
Pull Request -
State: open - Opened by antiagainst 3 months ago
#4499 - [BUILD] Include backend files to build nvidia backend
Pull Request -
State: closed - Opened by Jokeren 3 months ago
#4498 - [nvidia] Support passing TMA descriptors by-value
Pull Request -
State: closed - Opened by embg 3 months ago
#4497 - Revert "[CI][AMD] Reenable MI300 CI (#4453)"
Pull Request -
State: closed - Opened by antiagainst 3 months ago
#4496 - [AUTOTUNER] Make autotuner take `do_bench` as a parameter
Pull Request -
State: closed - Opened by int3 3 months ago
- 3 comments
#4495 - [Proton] Add warning in Proton viewer about negative byte values
Pull Request -
State: closed - Opened by CRobeck 3 months ago
- 1 comment
#4494 - Worse performance on `H100` than `RTX3090`
Issue -
State: closed - Opened by jeromeku 3 months ago
#4493 - feat: add return all for do_bench
Pull Request -
State: closed - Opened by OrenLeung 3 months ago
#4492 - [BACKEND] Support Hopper MMA to MMA convert_layout ops
Pull Request -
State: closed - Opened by Jokeren 3 months ago
- 3 comments
#4491 - [AMD][gfx12] Support WMMAv2 dot instruction generation
Pull Request -
State: closed - Opened by joviliast 3 months ago
#4490 - fix
Pull Request -
State: closed - Opened by ArtificialZeng 3 months ago
#4489 - [amd] NFC: Fix typos.
Pull Request -
State: closed - Opened by chsigg 3 months ago
#4488 - Output dtype of 09-persistent-matmul.py
Issue -
State: open - Opened by xijiu9 3 months ago
#4487 - Allow location rewriting to apply to any part of IR lowering
Pull Request -
State: closed - Opened by int3 3 months ago
- 2 comments
#4486 - [triton][tool] A CLI Tool for Tensor Layout Printing
Pull Request -
State: closed - Opened by fywkevin 3 months ago
- 1 comment
#4485 - [TESTING] Remove the `fast_flush` parameter from `do_bench`
Pull Request -
State: closed - Opened by int3 3 months ago
- 2 comments
#4484 - FP8 Casting Tensor Size Alignment Error on AMD MI210
Issue -
State: closed - Opened by kimbo-a2labs 3 months ago
- 3 comments
#4483 - [WIP][gfx11] Support tied wmma instrucrions
Pull Request -
State: open - Opened by joviliast 3 months ago
- 2 comments
#4480 - Does Triton have a roadmap plan?
Issue -
State: open - Opened by MeJerry215 3 months ago
- 1 comment
#4476 - Fused Attention FP8 correctness on L20(Ada GPU)
Issue -
State: open - Opened by suluner 3 months ago
- 1 comment