Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / NVIDIA/apex issues and pull requests

#1874 - [Orin Nano] Error when trying to use sparsity

Issue - State: open - Opened by kyflores 10 days ago
Labels: bug

#1873 - multi_tensor_l2norm_scale_kernel compilation error

Issue - State: closed - Opened by zslefour 12 days ago

#1872 - multi_tensor_l2norm_scale_kernel compilation error

Issue - State: closed - Opened by zslefour 12 days ago

#1869 - Fix the typo of FusedRMSNorm doc

Pull Request - State: open - Opened by cqulilujia 19 days ago

#1866 - bug: FastLayerNorm

Issue - State: closed - Opened by KK666-AI about 1 month ago - 2 comments
Labels: bug

#1865 - Build sdist and wheel in CI

Pull Request - State: open - Opened by calebho about 1 month ago - 1 comment

#1864 - Traceable LayerNorm

Pull Request - State: closed - Opened by yanboliang about 1 month ago

#1863 - Use Mem Pool API for NCCL Zero-Copy

Pull Request - State: closed - Opened by Aidyn-A about 1 month ago - 1 comment

#1862 - Traceable RMSNorm part 2: non affine fused rms norm

Pull Request - State: closed - Opened by yanboliang about 2 months ago - 7 comments

#1861 - Traceable RMSNorm

Pull Request - State: closed - Opened by yanboliang about 2 months ago

#1860 - [NCCL] Updated fix for premature destruction of ProcessGroupNCCL

Pull Request - State: closed - Opened by eqy about 2 months ago

#1859 - [NCCL] Prevent premature destroy of PGs following PyTorch upstream change

Pull Request - State: closed - Opened by eqy about 2 months ago

#1858 - [GroupNorm] Skip GroupNorm tests on A16, A2 etc.,

Pull Request - State: closed - Opened by eqy about 2 months ago

#1857 - AttributeError: module 'torch.compiler' has no attribute 'is_compiling'

Issue - State: open - Opened by LukeLIN-web 2 months ago - 5 comments
Labels: bug

#1855 - `Tensor.type()` -> `Tensor.scalar_type()`

Pull Request - State: closed - Opened by crcrpar 3 months ago

#1854 - [PT2] Normalisation: use manual impl when compiling

Pull Request - State: closed - Opened by alexdremov 3 months ago - 1 comment

#1852 - 关于解决ModuleNotFoundError: No module named 'torch'导致安装失败

Issue - State: open - Opened by Eikwang 3 months ago - 26 comments
Labels: bug

#1851 - How to install apex

Issue - State: open - Opened by Zerycii 3 months ago - 12 comments

#1850 - 到底要怎样才能安装apex

Issue - State: open - Opened by poyingshihuang 3 months ago - 1 comment

#1849 - AdamW implementation does not truly decouple learning rate and weight decay

Issue - State: open - Opened by leenachennuru 3 months ago - 2 comments
Labels: bug

#1846 - Literature associated with fused_dense

Issue - State: open - Opened by prmudgal 4 months ago - 1 comment

#1845 - fix groupnorm int32 index overflow

Pull Request - State: open - Opened by tlogn 4 months ago - 2 comments

#1844 - Main

Pull Request - State: closed - Opened by 63days 4 months ago

#1840 - No module named 'amp_C'

Issue - State: open - Opened by KanyuBao 5 months ago
Labels: bug

#1839 - loss scale

Issue - State: open - Opened by yjy-10 5 months ago

#1837 - Reformat Grad Output If It's Not Channels Last

Pull Request - State: closed - Opened by alpha0422 5 months ago
Labels: contrib

#1836 - Add Unittest For Distributed Adam With CUDA Graph

Pull Request - State: closed - Opened by alpha0422 5 months ago

#1835 - Traceable GroupNorm

Pull Request - State: closed - Opened by alpha0422 5 months ago
Labels: contrib

#1834 - install bug with pytorch2.0.1

Issue - State: open - Opened by Duanjinyi1 5 months ago - 1 comment
Labels: bug

#1832 - Enhance Distributed Fused Adam

Pull Request - State: closed - Opened by alpha0422 5 months ago
Labels: contrib

#1831 - Installation with Cuda extentions is failling

Issue - State: open - Opened by SaiedaJN 5 months ago - 2 comments
Labels: bug

#1830 - remove `run_transformer` from default lists

Pull Request - State: closed - Opened by crcrpar 5 months ago

#1829 - Fix DistributedTestBase for transformer distributed tests

Pull Request - State: closed - Opened by xwang233 5 months ago - 1 comment

#1825 - Fix illegal memory access with multi_tensor_apply size above INT_MAX

Pull Request - State: closed - Opened by gdb 5 months ago - 3 comments

#1824 - Unable to install Apex

Issue - State: open - Opened by JoongunPark 5 months ago - 1 comment
Labels: bug

#1823 - Setting up Apex and get this error: ModuleNotFoundError: No module named 'torch'

Issue - State: closed - Opened by Mayolov 6 months ago - 12 comments
Labels: bug

#1822 - Install set.up

Issue - State: open - Opened by Maritime-Moon 6 months ago
Labels: bug

#1821 - Allow Configurable Cache Directory

Pull Request - State: closed - Opened by leimao 6 months ago

#1820 - [Distributed optimizer] Do not monkey-patch class methods

Pull Request - State: closed - Opened by timmoon10 6 months ago

#1818 - NCCLAllocator: Fix build failure

Pull Request - State: closed - Opened by Aidyn-A 6 months ago - 4 comments

#1817 - Unable to install Apex on Linux(debian) with CUDA 12.1 and torch 2.2.2

Issue - State: open - Opened by SamitM1 6 months ago - 2 comments
Labels: bug

#1816 - Release GIL

Pull Request - State: closed - Opened by crcrpar 7 months ago - 1 comment

#1815 - unable to install

Issue - State: open - Opened by lxy51 7 months ago - 5 comments

#1814 - Release GIL when calling C extensions

Issue - State: closed - Opened by szmigacz 7 months ago
Labels: bug

#1813 - deprecate uses of torch.cuda.amp

Pull Request - State: closed - Opened by Fuzzkatt 7 months ago - 2 comments

#1811 - fixup concats for grouped convolution

Pull Request - State: open - Opened by techshoww 7 months ago

#1810 - Unable to install Apex

Issue - State: open - Opened by Anupam-5 7 months ago - 2 comments
Labels: bug

#1809 - Win11+Visual Studio 2022,install successfully.

Issue - State: open - Opened by aswordok 7 months ago - 3 comments
Labels: bug

#1807 - Error

Issue - State: open - Opened by silentghost1412 7 months ago

#1806 - Use torch.testing.all_close instead of get_max_diff in test_lamb.py

Pull Request - State: closed - Opened by Fuzzkatt 7 months ago - 1 comment

#1805 - "packaging" library exists but not found

Issue - State: closed - Opened by mahmoodn 7 months ago - 3 comments

#1804 - Cannot import name 'UnencryptedCookieSessionFactoryConfig'

Issue - State: closed - Opened by mahmoodn 7 months ago - 3 comments

#1801 - Avoid importing apex transformer automatically

Pull Request - State: open - Opened by nWEIdia 8 months ago

#1800 - Not Able to install apex.

Issue - State: open - Opened by Avinash-py 9 months ago
Labels: bug

#1799 - [ INSTALLATION ] - Not able to install apex on a Linux machine

Issue - State: open - Opened by MBadriNarayanan 9 months ago - 4 comments
Labels: bug

#1798 - Fix reduce_blocks_into_lanes race condition

Pull Request - State: closed - Opened by Fuzzkatt 9 months ago

#1797 - NCCL userbuffer for DP RS in DistOpt

Pull Request - State: closed - Opened by WanZzzzzz 9 months ago

#1796 - Add nccl_allocator for zero-copy user buffer

Pull Request - State: closed - Opened by Aidyn-A 9 months ago

#1795 - Avoid unnecessary param write in distributed Adam kernel

Pull Request - State: closed - Opened by timmoon10 9 months ago - 1 comment

#1794 - Enhance Distributed Fused Adam

Pull Request - State: closed - Opened by alpha0422 9 months ago - 4 comments

#1793 - apex not installing

Issue - State: open - Opened by pradeepdev-1995 10 months ago
Labels: bug

#1791 - fix building torch extension with glog

Pull Request - State: open - Opened by petronny 10 months ago - 1 comment

#1790 - Add xentropy bf16 support

Pull Request - State: open - Opened by zyeric 10 months ago

#1789 - Unclear licensing for contrib/sparsity

Issue - State: open - Opened by hyandell 10 months ago
Labels: bug

#1788 - install failure

Issue - State: open - Opened by 52Hzaaa 10 months ago
Labels: bug

#1787 - No module named 'torch._six'

Issue - State: open - Opened by xujin1184104394 10 months ago - 1 comment
Labels: bug

#1786 - 64-bit indexing Adam

Pull Request - State: open - Opened by cdm114514 10 months ago

#1784 - Add 2D Fused RoPE

Pull Request - State: closed - Opened by yaox12 10 months ago

#1783 - Move to the correct device for v1 state dict

Pull Request - State: closed - Opened by acphile 10 months ago - 2 comments
Labels: contrib

#1782 - Bump thresholds for `test_backward` in `test_fused_softmax.py`

Pull Request - State: closed - Opened by eqy 10 months ago

#1781 - On installing apex (+without sudo/docker)

Issue - State: open - Opened by stet-stet 11 months ago - 1 comment
Labels: bug

#1779 - Installation Problem: RuntimeError: Error compiling objects for extension

Issue - State: open - Opened by wwma 11 months ago - 3 comments
Labels: bug

#1778 - Cannot compile/build cuda_ext on H100

Issue - State: open - Opened by GuanhuaWang 11 months ago
Labels: bug

#1777 - [CUDNN][cudnn-frontend] Bump cuDNN to 1.0.3

Pull Request - State: open - Opened by eqy 11 months ago

#1776 - memory format option is only supported by strided tensors

Issue - State: closed - Opened by Cheny1m 12 months ago - 1 comment
Labels: bug

#1775 - Skip the p2p test on single GPU platforms

Pull Request - State: closed - Opened by nWEIdia 12 months ago