Skip to content

Support power of 2 scaling factors in float8 training and use e4m3 everywhere #7955

Support power of 2 scaling factors in float8 training and use e4m3 everywhere

Support power of 2 scaling factors in float8 training and use e4m3 everywhere #7955

test (CPU 2.5.1, linux.4xlarge, torch==2.5.1 --index-url https://download.pytorch.org/whl/cpu, cpu)  /  linux-job

succeeded Feb 7, 2025 in 11m 2s