2025-03-27 nightly release (
a5f8150 )
Deploying to gh-pages from @
a5f8150 🚀
bf16 stacked group gemm (
#3888 )
Deploying to gh-pages from @
a9ea4a7 🚀
AdagradW (fbgemm backend) (
#3827 )
Deploying to gh-pages from @
5a1b835 🚀
Retuned CK GMM fp8/bf16 with perf fixes (
#3851 )
Deploying to gh-pages from @
851815d 🚀
Enable groupwise scales for F8I4 Grouped Gemm (
#3884 )
Deploying to gh-pages from @
6a6db7c 🚀
Deploying to gh-pages from @
c8900e5 🚀
Fix IMA in TBE grad indices kernel for int32 indices (
#3877 )
Integrate D71065405 and D71079311 into stochastic rounding
Deploying to gh-pages from @
0ee4923 🚀
A hotfix for FBGEMM fp8 rowwise with irregular gemm sizes (
#3883 )
2025-03-26 nightly release (
74db0ac )
Deploying to gh-pages from @
74db0ac 🚀
Fix empty input view in FP8 Grouped Gemm. (
#3880 )
Deploying to gh-pages from @
18f273c 🚀
Use PackedAccessor64 for index_remappings in pruned_array_lookup (
#3870 )
Deploying to gh-pages from @
b2312f7 🚀
Improve VBE benchmark (
#3867 )
Deploying to gh-pages from @
a33f50a 🚀
Skip empty groups in FP8 Stacked Gemm (
#3862 )
Deploying to gh-pages from @
d7c6053 🚀
Add overflow_safe_int_t for addressing the int overflow problem (
#3875 )
Deploying to gh-pages from @
37f5287 🚀
Deploying to gh-pages from @
d71093f 🚀
Transpose FP8 GEMM inputs for better tuning (
#3866 )
Clean up stochastic rounding benchmarks (
#3876 )
You can’t perform that action at this time.