TF32 Quantization #4685
Unanswered
NightWinkle
asked this question in
Q&A
Replies: 1 comment
-
|
TF32 support is automatically detected based on the device properties, and so is MMA/WMMA. Autotune will select the best algo. This is not exposed at a higher level (burn) as it is backend-specific, instead we rely on cubek/cubecl for that. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi !
From what I've seen, there is no TF32 option in the quantization options of a module. How is TF32 math enabled ? (or disabled)
Also, I thought ConvStrategy was an available setting but I realised it's not exposed. Since the default is autotune, would that mean that by default convs on cuda platform are TF32 implicit wmma ?
Thanks !
Beta Was this translation helpful? Give feedback.
All reactions