You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for good work!Could you explain why adjacent layer feature output aggregation enhances video generation results? Since CogVideo's training data is not publicly available, I wonder if this benefit might come from differences in training data distribution between RepVideo and CogVideo, making direct comparisons difficult. Are there any experimental results for RepVideo without the aggregation operation?
The text was updated successfully, but these errors were encountered:
Thanks for good work!Could you explain why adjacent layer feature output aggregation enhances video generation results? Since CogVideo's training data is not publicly available, I wonder if this benefit might come from differences in training data distribution between RepVideo and CogVideo, making direct comparisons difficult. Are there any experimental results for RepVideo without the aggregation operation?
The text was updated successfully, but these errors were encountered: