Skip to content

Support power of 2 scaling factors in float8 training and use e4m3 everywhere #2565

Support power of 2 scaling factors in float8 training and use e4m3 everywhere

Support power of 2 scaling factors in float8 training and use e4m3 everywhere #2565

Check PR Labels

succeeded Feb 10, 2025 in 2s