You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While going through the paper again, just got curious about teacher-forcing gt outside the mask.
So my understanding is generating token as it is done in the provided code for the parts that are not masked. But what about the parts that are not masked? are they supposed to be initialized to 0? Not quite sure how the parts within mask can be generated without class info.
Thanks in advance!
The text was updated successfully, but these errors were encountered:
While going through the paper again, just got curious about teacher-forcing gt outside the mask.
So my understanding is generating token as it is done in the provided code for the parts that are not masked. But what about the parts that are not masked? are they supposed to be initialized to 0? Not quite sure how the parts within mask can be generated without class info.
Thanks in advance!
The text was updated successfully, but these errors were encountered: