Skip to content

Support power of 2 scaling factors in float8 training #4922

Support power of 2 scaling factors in float8 training

Support power of 2 scaling factors in float8 training #4922