You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Eg. if
device: auto -> torch.cuda.is_available() -> cuda or mps.
dtype: float32 -> float32, no quantization
dtype: float16 -> float16, no quantization
dtype: bfloat16 -> float16, no quantization
dtype: auto -> (bfloat16 if possible else float16) if device is cuda else float32, no quantization
dtype: int8 -> float32, int8 quantization
dtype: fp8 -> float32, fp8 quantization
Motivation
Your contribution
The text was updated successfully, but these errors were encountered:
Feature request
Too much boilerplate template:
Resolves loading, quantization, and device
Eg. if
device: auto -> torch.cuda.is_available() -> cuda or mps.
dtype: float32 -> float32, no quantization
dtype: float16 -> float16, no quantization
dtype: bfloat16 -> float16, no quantization
dtype: auto -> (bfloat16 if possible else float16) if device is cuda else float32, no quantization
dtype: int8 -> float32, int8 quantization
dtype: fp8 -> float32, fp8 quantization
Motivation
Your contribution
The text was updated successfully, but these errors were encountered: