Skip to content

Issues: pytorch/torchtune

v0.5.0 tracker
#2008 opened Nov 14, 2024 by joecummings
Open
Testing tracker
#1890 opened Oct 23, 2024 by felipemello1
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Llama3.1 models do not allow configuring max_seq_len bug Something isn't working triaged This issue has been assigned an owner and appropriate label
#2202 opened Dec 23, 2024 by akashc1
How to use float8 for training?
#2201 opened Dec 23, 2024 by vgoklani
Adaptive batching
#2191 opened Dec 20, 2024 by krammnic
Model request. Phi4 enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2190 opened Dec 20, 2024 by krammnic
Add multiprocess dataset packing enhancement New feature or request triaged This issue has been assigned an owner and appropriate label
#2180 opened Dec 19, 2024 by bratao
load_in_8bit for model quantization
#2178 opened Dec 18, 2024 by SUMEETRM
Add sample packing for DPO, PPO enhancement New feature or request rlhf Anything related to reinforcement learning w/ human feedback
#2177 opened Dec 18, 2024 by SalmanMohammadi
Federated fine-tuning recipes
#2170 opened Dec 18, 2024 by krammnic
GPU Middle Class? discussion Start a discussion distributed Anything related to distributed env (multi-GPU, multi-node) triaged This issue has been assigned an owner and appropriate label
#2161 opened Dec 16, 2024 by EugenHotaj
Move update_recipe_state to its own util best practice Things we should be doing but aren't triaged This issue has been assigned an owner and appropriate label
#2158 opened Dec 13, 2024 by joecummings
Invalid kwarg fused passed to bitsandbytes AdamW8bit better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
#2152 opened Dec 12, 2024 by mlazos
70B Fine-tuning GPUs Utilization discussion Start a discussion distributed Anything related to distributed env (multi-GPU, multi-node)
#2142 opened Dec 10, 2024 by fabiogeraci
Are there any plans to support context parallel? enhancement New feature or request
#2141 opened Dec 10, 2024 by dz1iang
[small bug + generalization] saving config.yaml to output_dir better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs bug Something isn't working
#2137 opened Dec 9, 2024 by felipemello1
Query on Gradient accumulation discussion Start a discussion
#2134 opened Dec 9, 2024 by Vattikondadheeraj
[RFC] Unify activation checkpointing APIs rfc Request for comments
#2114 opened Dec 5, 2024 by ebsmothers
ProTip! Add no:assignee to see everything that’s not assigned.