Skip to content

Issues: pytorch/torchtune

v0.6.0 tracker
#2232 opened Jan 6, 2025 by joecummings
Open
Testing tracker
#1890 opened Oct 23, 2024 by felipemello1
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[RFC] Proposal for tune cat Command discussion Start a discussion rfc Request for comments
#2281 opened Jan 19, 2025 by Ankur-singh updated Jan 19, 2025
Roadmap for other parallelisms
#2280 opened Jan 19, 2025 by rahul-sarvam updated Jan 19, 2025
_checkpoint_client not installing
#2279 opened Jan 19, 2025 by maxwellreynolds updated Jan 19, 2025
DPO after / on top of LoRA tuning discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2272 opened Jan 16, 2025 by albertbn updated Jan 19, 2025
Very slow convergence with bf16
#2254 opened Jan 11, 2025 by EugenHotaj updated Jan 18, 2025
Add multiprocess dataset packing enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2180 opened Dec 19, 2024 by bratao updated Jan 18, 2025
Finetune meta-llama/Llama-Guard-3-1B triaged This issue has been assigned an owner and appropriate label
#2237 opened Jan 8, 2025 by jingzhaoou updated Jan 18, 2025
[RFC] Additional chat loss masking strategies community help wanted We would love the community's help completing this issue discussion Start a discussion enhancement New feature or request good first issue Good for newcomers rfc Request for comments
#2261 opened Jan 13, 2025 by RdoubleA updated Jan 18, 2025
Llama3.2 vision does not run with distributed state dict
#2277 opened Jan 17, 2025 by acisseJZhong updated Jan 17, 2025
Don't use _get_clones best practice Things we should be doing but aren't community help wanted We would love the community's help completing this issue
#2270 opened Jan 16, 2025 by ebsmothers updated Jan 17, 2025
Finetuning Llama 3.1 8B Base Model on ChatML Format Dataset – Loss Reaches NaN After 2000 Steps triaged This issue has been assigned an owner and appropriate label
#2246 opened Jan 10, 2025 by abdul-456 updated Jan 17, 2025
The current instantiation does not trigger the initialization of submodules discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2273 opened Jan 16, 2025 by dz1iang updated Jan 17, 2025
Qlora uses more memory than regular lora triaged This issue has been assigned an owner and appropriate label
#2255 opened Jan 11, 2025 by AndrewMead10 updated Jan 16, 2025
output resolved config with the checkpoint better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs enhancement New feature or request
#1968 opened Nov 7, 2024 by felipemello1 updated Jan 16, 2025
Lora and Dora finetuning produces identical results bug Something isn't working high-priority
#2250 opened Jan 10, 2025 by AndrewMead10 updated Jan 16, 2025
About the CLS token for the llama3_2_vision_encoder discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2268 opened Jan 15, 2025 by dfloreaa updated Jan 15, 2025
adding support for LR schedule for full distributed finetune best practice Things we should be doing but aren't better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs triaged This issue has been assigned an owner and appropriate label
#2263 opened Jan 13, 2025 by tginart updated Jan 15, 2025
Expose FSDP2 MixedPrecisionPolicy params enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2267 opened Jan 14, 2025 by EugenHotaj updated Jan 14, 2025
Overriding kv cache entries in torchtune models discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2241 opened Jan 9, 2025 by telgamal-1 updated Jan 14, 2025
GPU Middle Class? discussion Start a discussion distributed Anything related to distributed env (multi-GPU, multi-node) triaged This issue has been assigned an owner and appropriate label
#2161 opened Dec 16, 2024 by EugenHotaj updated Jan 14, 2025
[feature request] support input/output to fsspec path enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2217 opened Dec 31, 2024 by leoleoasd updated Jan 14, 2025
Request: adding py.typed for type checkers better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs triaged This issue has been assigned an owner and appropriate label
#2258 opened Jan 13, 2025 by jamesbraza updated Jan 14, 2025
raise error when running registered config with incompatible recipe better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
#1550 opened Sep 12, 2024 by felipemello1 updated Jan 14, 2025
hotw to estimate gpu memory needed for knowledge distillation? discussion Start a discussion triaged This issue has been assigned an owner and appropriate label
#2213 opened Dec 30, 2024 by chuangzhidan updated Jan 14, 2025
ProTip! Follow long discussions with comments:>50.