You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm using flashinfer for a text-to-speech model, and I need the attention score to get the alignment between the output(audio) to the input(text). I'm curious if it is possible to get the attention score to all tokens during decoding, not just the logsumexp?
The text was updated successfully, but these errors were encountered:
yawnzh
changed the title
[FEATURE] Can BatchDecodeWithPagedKVCacheWrapper return attention scores to all tokens, not just logsumexp?
[Feature] Can BatchDecodeWithPagedKVCacheWrapper return attention scores to all tokens, not just logsumexp?
Feb 14, 2025
I'm using flashinfer for a text-to-speech model, and I need the attention score to get the alignment between the output(audio) to the input(text). I'm curious if it is possible to get the attention score to all tokens during decoding, not just the logsumexp?
The text was updated successfully, but these errors were encountered: