Skip to content

Issues: pytorch/torchtune

v0.6.0 tracker
#2232 opened Jan 6, 2025 by joecummings
Open
Testing tracker
#1890 opened Oct 23, 2024 by felipemello1
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Gemma3
#2484 opened Mar 12, 2025 by krammnic
recursive_reshard
#2483 opened Mar 12, 2025 by caiqi
Add add_end_token to the Qwen Models bug Something isn't working community help wanted We would love the community's help completing this issue good first issue Good for newcomers
#2481 opened Mar 11, 2025 by pbontrager
Add add_end_token to Phi tokenizers bug Something isn't working community help wanted We would love the community's help completing this issue good first issue Good for newcomers
#2480 opened Mar 11, 2025 by pbontrager
Add add_end_token to Mistral tokenizer bug Something isn't working community help wanted We would love the community's help completing this issue good first issue Good for newcomers
#2479 opened Mar 11, 2025 by pbontrager
Add add_end_token to the Gemma Tokenizer bug Something isn't working community help wanted We would love the community's help completing this issue good first issue Good for newcomers
#2478 opened Mar 11, 2025 by pbontrager
MPS memory leak bug Something isn't working triage review This issue should be discussed in weekly review
#2473 opened Mar 10, 2025 by SalmanMohammadi
Chunked GRPO loss enhancement New feature or request
#2469 opened Mar 9, 2025 by SalmanMohammadi
Chunked DPO loss enhancement New feature or request
#2468 opened Mar 9, 2025 by SalmanMohammadi
Will support multi-turn conversations? enhancement New feature or request triage review This issue should be discussed in weekly review
#2463 opened Mar 6, 2025 by dz1iang
No chat template in evaluation community help wanted We would love the community's help completing this issue enhancement New feature or request
#2459 opened Mar 5, 2025 by xueyan-lii
Bring in DSV3 from torchtitan enhancement New feature or request
#2457 opened Mar 4, 2025 by EugenHotaj
torchtune dry-run feature request community help wanted We would love the community's help completing this issue enhancement New feature or request triage review This issue should be discussed in weekly review
#2453 opened Mar 3, 2025 by agunapal
Value Recomputation Shape Mismatch For Tensor Parallelism bug Something isn't working distributed Anything related to distributed env (multi-GPU, multi-node)
#2451 opened Mar 2, 2025 by TAplutos
How to run a multi-node, multi-GPU training in a Ray cluster? discussion Start a discussion distributed Anything related to distributed env (multi-GPU, multi-node)
#2450 opened Mar 2, 2025 by Hambaobao
Qwen2.5-VL support planned?
#2448 opened Mar 1, 2025 by paras-genmo
Add StatefulDataloader to remainder of recipes community help wanted We would love the community's help completing this issue enhancement New feature or request high-priority
#2439 opened Feb 26, 2025 by joecummings
8 tasks
Add support for Llama-Guard discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2434 opened Feb 25, 2025 by agunapal
ProTip! Exclude everything labeled bug with -label:bug.