-
Notifications
You must be signed in to change notification settings - Fork 493
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[RFC] Proposal for Start a discussion
rfc
Request for comments
tune cat
Command
discussion
#2281
opened Jan 19, 2025 by
Ankur-singh
updated Jan 19, 2025
DPO after / on top of LoRA tuning
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2272
opened Jan 16, 2025 by
albertbn
updated Jan 19, 2025
Add multiprocess dataset packing
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2180
opened Dec 19, 2024 by
bratao
updated Jan 18, 2025
Finetune meta-llama/Llama-Guard-3-1B
triaged
This issue has been assigned an owner and appropriate label
#2237
opened Jan 8, 2025 by
jingzhaoou
updated Jan 18, 2025
[RFC] Additional chat loss masking strategies
community help wanted
We would love the community's help completing this issue
discussion
Start a discussion
enhancement
New feature or request
good first issue
Good for newcomers
rfc
Request for comments
#2261
opened Jan 13, 2025 by
RdoubleA
updated Jan 18, 2025
Llama3.2 vision does not run with distributed state dict
#2277
opened Jan 17, 2025 by
acisseJZhong
updated Jan 17, 2025
Don't use Things we should be doing but aren't
community help wanted
We would love the community's help completing this issue
_get_clones
best practice
#2270
opened Jan 16, 2025 by
ebsmothers
updated Jan 17, 2025
Finetuning Llama 3.1 8B Base Model on ChatML Format Dataset – Loss Reaches NaN After 2000 Steps
triaged
This issue has been assigned an owner and appropriate label
#2246
opened Jan 10, 2025 by
abdul-456
updated Jan 17, 2025
The current instantiation does not trigger the initialization of submodules
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2273
opened Jan 16, 2025 by
dz1iang
updated Jan 17, 2025
Qlora uses more memory than regular lora
triaged
This issue has been assigned an owner and appropriate label
#2255
opened Jan 11, 2025 by
AndrewMead10
updated Jan 16, 2025
output resolved config with the checkpoint
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
enhancement
New feature or request
#1968
opened Nov 7, 2024 by
felipemello1
updated Jan 16, 2025
Lora and Dora finetuning produces identical results
bug
Something isn't working
high-priority
#2250
opened Jan 10, 2025 by
AndrewMead10
updated Jan 16, 2025
About the CLS token for the llama3_2_vision_encoder
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2268
opened Jan 15, 2025 by
dfloreaa
updated Jan 15, 2025
adding support for LR schedule for full distributed finetune
best practice
Things we should be doing but aren't
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
triaged
This issue has been assigned an owner and appropriate label
#2263
opened Jan 13, 2025 by
tginart
updated Jan 15, 2025
Expose FSDP2 MixedPrecisionPolicy params
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2267
opened Jan 14, 2025 by
EugenHotaj
updated Jan 14, 2025
[Question] what to do when model doesn't have
tokenizer.model
?
high-priority
#2212
opened Dec 29, 2024 by
steveepreston
updated Jan 14, 2025
Overriding kv cache entries in torchtune models
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2241
opened Jan 9, 2025 by
telgamal-1
updated Jan 14, 2025
GPU Middle Class?
discussion
Start a discussion
distributed
Anything related to distributed env (multi-GPU, multi-node)
triaged
This issue has been assigned an owner and appropriate label
#2161
opened Dec 16, 2024 by
EugenHotaj
updated Jan 14, 2025
[feature request] support input/output to fsspec path
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2217
opened Dec 31, 2024 by
leoleoasd
updated Jan 14, 2025
Request: adding Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
triaged
This issue has been assigned an owner and appropriate label
py.typed
for type checkers
better engineering
#2258
opened Jan 13, 2025 by
jamesbraza
updated Jan 14, 2025
raise error when running registered config with incompatible recipe
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
#1550
opened Sep 12, 2024 by
felipemello1
updated Jan 14, 2025
hotw to estimate gpu memory needed for knowledge distillation?
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2213
opened Dec 30, 2024 by
chuangzhidan
updated Jan 14, 2025
Previous Next
ProTip!
Follow long discussions with comments:>50.