Misaligned AOTI input; potential perf gains by fixing? #1424
Labels
actionable
Items in the backlog waiting for an appropriate impl/fix
bug
Something isn't working
Compile / AOTI
Issues related to AOT Inductor and torch compile
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🐛 Describe the bug
Picked up in #1367, and worked around via pytorch/pytorch#143236, it appears the input to the torchchat AOTI runner is not 16 byte aligned.
While the PR from pytorch/pytorch eases this constraint, this may be indicative of potential perf losses (common of misalignment)
hattip to @malfet for suggesting line of investigation
Versions
bb72b09
The text was updated successfully, but these errors were encountered: