You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
float_workspace_size_in_bytes is an input parameter that cannot be determined in advance through function calls or other means.
Currently, PrefillPlan may attempt to allocate variables like batch_prefill_tmp_v and batch_prefill_tmp_s without checking if they would exceed the available float workspace size.
To prevent this issue, a function should be implemented to notify users about the required float workspace size.
The text was updated successfully, but these errors were encountered:
Yes, the required float workspace buffer size can be determined given hardware information (number of SMs), head dimensions and query tile size (see appendix D.2 in the paper), I'll work on this after upgrading the scheduler to v2 (some of them have not been upstreamed yet).
Current PrefillPlan interface:
float_workspace_size_in_bytes
is an input parameter that cannot be determined in advance through function calls or other means.Currently,
PrefillPlan
may attempt to allocate variables likebatch_prefill_tmp_v
andbatch_prefill_tmp_s
without checking if they would exceed the available float workspace size.To prevent this issue, a function should be implemented to notify users about the required float workspace size.
The text was updated successfully, but these errors were encountered: