Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / google/XNNPACK issues and pull requests
#6355 - Switch to the new `rational_9_6` microkernels for `f32-vtanh`.
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6354 - Softmax kernels for AVX/AVX512 generate smaller unrolled variants
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6353 - Add partial support for building/testing/benchmarking XNNPACK on Hexagon. Additional work would need to be done to get this fully working in the Bazel build (notably, connecting to a Qualcomm SDK) but this extends the basic build rules enough to add specializations for Hexagon to XNNPACK.
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6352 - AVX512skx RSUM F16F32ACC microkernels accumulate into output
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6351 - AVX512skx RSUM F16F32ACC microkernels accumulate into output
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6350 - Internal config change
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6349 - Fix GEMM config for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c4__neondotfp16arith
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6348 - Fix GEMM config for xnn_qd8_f16_qc8w_gemm_minmax_ukernel_2x8c2s4__neonfp16arith
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6347 - Fix GEMM config for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c8__neoni8mm
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6346 - Don't use mmap/munmap/mprotect for XNN_PLATFORM_QURT: the functions aren't available to ordinary user code. Instead, just use qurt_alloc/qurt_free.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6345 - Fix GEMM cnofig for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c4__neondotfp16arith
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6344 - Fix GEMM cnofig for xnn_qd8_f32_qc8w_gemm_minmax_ukernel_2x8c2s4__neon_mlal
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6343 - XNNPACK tests that use mmap() fail on Hexagon devices
Issue -
State: open - Opened by steven-johnson 7 months ago
#6342 - Introduce TransposeConv with dynamic range quantization Subgraph API
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6341 - Add WAsmSIMD rdsum accumulating microkernels
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6340 - Add F16F32ACC NEONFP16ARITH rdsum accumulating microkernels
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6339 - Amalgam RVV X32-Transpose microkernels
Pull Request -
State: open - Opened by phoebesv 7 months ago
- 1 comment
#6338 - Internal config change
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6337 - Add f32 Maxpool RVV implementation microkernels, tests and config changes.
Pull Request -
State: open - Opened by KaustubhIMG 7 months ago
- 1 comment
#6336 - Implement `f32-vtanh` microkernels using a 9/6 rational polynomial approximation.
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6335 - Add AVX512F rdsum accumulating microkernels
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6334 - Accumulating AVX rdsum microkernels
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6333 - Rdsum microkernels are accumlating
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6332 - Add SSE rdsum microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6331 - Introduce QD8 TranposeConv operator
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6330 - RDSum microkernels are no longer minmax and have their own tester.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6329 - Clean-up rdsum benches
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6328 - Enable F16-F32ACC-RSUM AVX512SKX microkernel for sum of FP16 values
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6327 - F16-F32ACC-RSUM AVX512SKX microkernel for sum of FP16 values
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6326 - Change Hexagon minimum supported version to v68 (from v66)
Pull Request -
State: closed - Opened by ejparkqc 7 months ago
- 2 comments
#6325 - Add f32 rsum discontig benchmarks
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6324 - Add f32 rsum discontig neon microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6323 - Add rsum discontiguous ukernels.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6322 - Add f32 Avgpool RVV implementation micro-kernels, tests and config changes.
Pull Request -
State: open - Opened by KaustubhIMG 7 months ago
- 1 comment
#6321 - Fix math.h spelling - compliment changed to complement
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6320 - F16-RMAX - enable rmax F16C microkernel for F16C instead of AVX2
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6319 - F16-RMAX - enable rmax scalar for all platforms
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6318 - F16-RMINMAX - move math_min_f16 and math_max_f16 to math.h
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6317 - F16-RMAX enable scalar microkernel for all cpus
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6316 - F16-RMAX benchmark include f16c_u32 for AVX
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6315 - Fix missing `#include`s in the `XNNPACK/src` subdirectory.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6314 - AVX512 GEMM/IGEMM enable asan
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6313 - F16-RMAX scalar optimized keep max sign complement
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6312 - Remove printf that were used during debugging reduce and resize
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6311 - GEMM unittest step thru NC by NextPrime
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6310 - Improve GEMM unittest performance
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6309 - Step thru k-block test range using prime numbers
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6308 - Pass VNNI and AMX flags to hardware-config.c
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6307 - Automated Code Change
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6306 - Load-from-misaligned-address failures on Hexagon simulator
Issue -
State: open - Opened by steven-johnson 7 months ago
- 3 comments
#6305 - test/sigmoid_nc_test fails on Hexagon simulator
Issue -
State: open - Opened by steven-johnson 7 months ago
- 1 comment
#6304 - Add support for broadcasting of scalar weights to Prelu
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6303 - X8-PACKW use unaligned_store_s32
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6302 - Add iterative `vsqrt` microkernels for `x86_64`, which computes `x*rsqrt(x)`, i.e.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6301 - Re-generate tests after template update
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6300 - Fix order of external values in slinky
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6299 - Use the FARF() macros for debug logging on Hexagon, rather than qurt_printf(); this allows multiple logging levels (like __android_log_vprint), but more importantly, it works much more reliably with the Hexagon simulator, which can drop stdout/stderr output.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6298 - Rollback of new `f32-vsqrt` microkernels.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6297 - Fix missing `#include`s in `XNNPACK/test` subdirectory.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6296 - Rsum ukernels accumulate into output.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6295 - Mean op can handle arbitrary reduction axis in the contiguous axes.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6294 - Member functions in `class` definitions need not be marked as `inline`.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6293 - Enable AVX2 F16-F32ACC GEMM for improved performance
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6292 - Add a `ReplicableRandomDevice` to create reproducible randomized tests.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6291 - Use the `ReplicableRandomDevice` instead of `std::random_device`/`std::mt19937` throughout the unit tests.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6290 - Enable x16_packw for AVX2 goi weights
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6289 - Don't attempt to call `cpuinfo_initialize()` unless `XNN_ENABLE_CPUINFO` is enabled.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6288 - Manually set a larger stack size for `WASM` tests.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6287 - AMX QD8_F32_QC8W GEMM generate all tile sizes
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6286 - Enable QD8_F16 AMX GEMM/IGEMM MRx64c4 microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6285 - cmake build failure with XNNPACK_BUILD_TESTS=ON and XNNPACK_LIBRARY_TYPE=shared
Issue -
State: open - Opened by loqs 7 months ago
#6284 - Relax assert in transpose kernels on strides when it doesn't matter because there is only one element.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6283 - Change AMX k-block from 64 to 4 for faster testing.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6282 - QD8_F16 AMX GEMM/IGEMM MRx64c4 microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6281 - AMX IGEMM M=1 specialized loop to use input pointer directly.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6280 - Add `WAsm SIMD` microkernel for `f32-rsqrt`.
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6279 - Move `xnn_weights_cache_is_finalized` to xnnpack.h to make it available to client code.
Pull Request -
State: closed - Opened by copybara-service[bot] 7 months ago
#6278 - When no weight cache is provided to XNNPack, create one to share packed weights between operations.
Pull Request -
State: open - Opened by copybara-service[bot] 7 months ago
#6277 - Support RVV x32-packw
Pull Request -
State: open - Opened by bhbruce 7 months ago
- 2 comments
#6276 - Enable QS8 AMX 16x16c4 GEMM/IGEMM microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6275 - AMX IGEMM fix params to use avx512 params
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6274 - Exported helper functions for transposition normalization.
Pull Request -
State: open - Opened by copybara-service[bot] 8 months ago
#6273 - Concat5 supports fp16
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6272 - Generate QS8 AMX MRx32c4 and MRx64c4 GEMM/IGEMM microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6271 - Fix up RVV gemm and transpose
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6270 - Clone #5893 and #5912
Pull Request -
State: closed - Opened by dsharlet 8 months ago
#6269 - Enable AVX512 and AVX2 F32_RADDSTOREEXPMINUSMAX microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6268 - How can I parallelize the execution of this benchmark? (https://github.com/google/XNNPACK/blob/master/bench/spmm-benchmark.h)
Issue -
State: open - Opened by AnonymousYWL 8 months ago
#6267 - fc_qc8w should use qcint32 bias
Pull Request -
State: closed - Opened by mcr229 8 months ago
- 4 comments
#6266 - Enable AVX512 and AVX2 F32_RADDSTOREEXPMINUSMAX microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6265 - Enable AVX512 and AVX2 F32_RADDSTOREEXPMINUSMAX microkernels
Pull Request -
State: open - Opened by copybara-service[bot] 8 months ago
#6264 - Fix ASAN error in QD8 AMX IGEMM microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6263 - Enable QD8 AMX 16x64c4 GEMM/IGEMM microkernels
Pull Request -
State: closed - Opened by copybara-service[bot] 8 months ago
#6196 - Fix Bazel build for bench subdir
Pull Request -
State: open - Opened by steven-johnson 8 months ago
- 1 comment
#6110 - Add Clang version guard for AMX support in CMake
Pull Request -
State: closed - Opened by GregoryComer 9 months ago
#6031 - Add iterative `vsqrt` microkernels for `x86_64`, which computes `x*rsqrt(x)`, i.e.
Pull Request -
State: closed - Opened by copybara-service[bot] 9 months ago
#6027 - Add RVV F32-IGEMM
Pull Request -
State: closed - Opened by bhbruce 9 months ago
- 3 comments
#5954 - Refactor `vunary` benchmarks.
Pull Request -
State: open - Opened by copybara-service[bot] 10 months ago
#5953 - Enable QS8_QC8W 7x8C8 AVX512VL VNNI microkernels for mobile
Pull Request -
State: open - Opened by copybara-service[bot] 10 months ago
#5952 - Enable QS8_QC8W 7x8C8 AVX512VL VNNI microkernels for mobile
Pull Request -
State: open - Opened by copybara-service[bot] 10 months ago