Move tokenizer information into pte to reduce ExecuTorch runner args #1484

Jack-Khuu · 2025-01-30T23:05:19Z

🚀 The feature, motivation and pitch

After an ExecuTorch model is exported to a pte, tokenization information must be passed in as an arg (-l <#>) to the runner. This can be avoided by writing this information into the pte file itself since the tokenizer is known at export time (sentencepiece => 2, tiktoken =>3). Tokenization information can be stored during export as a constant_method.

For example: https://github.com/pytorch/torchchat?tab=readme-ov-file#deploy-and-run-on-android

cmake-out/et_run llama3.1.pte -z `python3 torchchat.py where llama3.1`/tokenizer.model -l 3 -i "Once upon a time"

Task:

Update ExecuTorch exporting to save tokenization information in the pte artifact
Update the ExecuTorch runner to read the newly saved metadata

For a similar optimization made for aoti: #1159.
See #1439 for conversation/more context

Alternatives

Continue to pass tokenizer arguments to the runner

Additional context

No response

RFC (Optional)

No response

The text was updated successfully, but these errors were encountered:

Jack-Khuu mentioned this issue Jan 30, 2025

Update run-docs to avoid code duplication #1439

Merged

Jack-Khuu added this to [torchchat] Looking for Contributors Feb 21, 2025

github-project-automation bot moved this to To triage in [torchchat] Looking for Contributors Feb 21, 2025

Jack-Khuu moved this from To triage to Ready in [torchchat] Looking for Contributors Feb 21, 2025

Jack-Khuu moved this from Ready to Backlog in [torchchat] Looking for Contributors Feb 21, 2025

Jack-Khuu moved this from Backlog to Ready in [torchchat] Looking for Contributors Feb 21, 2025

Jack-Khuu mentioned this issue Feb 21, 2025

[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

Closed

silverguo mentioned this issue Mar 13, 2025

Move tokenizer info into pte for et #1510

Merged

Jack-Khuu assigned silverguo Mar 14, 2025

Jack-Khuu moved this from Ready to Done in [torchchat] Looking for Contributors Mar 14, 2025

Jack-Khuu closed this as completed by moving to Done in [torchchat] Looking for Contributors Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move tokenizer information into pte to reduce ExecuTorch runner args #1484

Move tokenizer information into pte to reduce ExecuTorch runner args #1484

Jack-Khuu commented Jan 30, 2025

Move tokenizer information into pte to reduce ExecuTorch runner args #1484

Move tokenizer information into pte to reduce ExecuTorch runner args #1484

Comments

Jack-Khuu commented Jan 30, 2025

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)