Skip to content

Actions: hiyouga/LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,579 workflow runs
3,579 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Potential Support for Mistral Small 24B Models?
label_issue #2136: Issue #6782 opened by HuangZhen02
January 31, 2025 03:42 11s
January 31, 2025 03:42 11s
January 30, 2025 20:15 9s
[model] add qwen2.5 vl models (#6779)
tests #1889: Commit 999c7c8 pushed by hiyouga
January 30, 2025 19:00 8m 54s main
January 30, 2025 19:00 8m 54s
[model] add qwen2.5 vl models
tests #1888: Pull request #6779 opened by hiyouga
January 30, 2025 18:49 9m 7s hiyouga/add_qwen2_5_vl
January 30, 2025 18:49 9m 7s
[breaking] support transformers 4.48 (#6628)
tests #1887: Commit 15357cd pushed by hiyouga
January 30, 2025 17:36 7m 59s main
January 30, 2025 17:36 7m 59s
[version] support transformers 4.48 & Byebye python 3.8
tests #1886: Pull request #6628 synchronize by hiyouga
January 30, 2025 17:25 14m 13s hiyouga/upd_hf_4_48
January 30, 2025 17:25 14m 13s
[version] support transformers 4.48 & Byebye python 3.8
tests #1885: Pull request #6628 synchronize by hiyouga
January 30, 2025 16:21 8m 40s hiyouga/upd_hf_4_48
January 30, 2025 16:21 8m 40s
[webui] improve webui & reasoning mode (#6778)
tests #1884: Commit 45e68b9 pushed by hiyouga
January 30, 2025 16:09 8m 41s main
January 30, 2025 16:09 8m 41s
[webui] improve webui & reasoning mode
tests #1883: Pull request #6778 opened by hiyouga
January 30, 2025 16:02 8m 51s hiyouga/improve_r1
January 30, 2025 16:02 8m 51s
Fp8 quantization
label_issue #2133: Issue #6777 opened by HARISHSENTHIL
January 29, 2025 09:28 11s
January 29, 2025 09:28 11s
Template deepseekr1 does not exist
label_issue #2132: Issue #6776 opened by Fangkang515
January 29, 2025 07:35 9s
January 29, 2025 07:35 9s
[model] add deepseek-R1 & show think process (#6767)
tests #1882: Commit 28417f8 pushed by hiyouga
January 29, 2025 04:16 8m 29s main
January 29, 2025 04:16 8m 29s
有计划支持Deepseek的janus pro微调么
label_issue #2131: Issue #6775 opened by mkygogo
January 28, 2025 15:58 10s
January 28, 2025 15:58 10s
Add DeepSeek-R1 and its distilled model (Qwen&Llama)
tests #1881: Pull request #6767 synchronize by Qwtdgh
January 28, 2025 14:45 7m 40s Qwtdgh:main
January 28, 2025 14:45 7m 40s
Add DeepSeek-R1 and its distilled model (Qwen&Llama)
tests #1880: Pull request #6767 synchronize by Qwtdgh
January 28, 2025 13:55 Action required Qwtdgh:main
January 28, 2025 13:55 Action required
Add DeepSeek-R1 and its distilled model (Qwen&Llama)
tests #1879: Pull request #6767 synchronize by Qwtdgh
January 28, 2025 13:46 Action required Qwtdgh:main
January 28, 2025 13:46 Action required
Q_APOLLO?
label_issue #2130: Issue #6774 opened by inflatebot
January 28, 2025 12:47 10s
January 28, 2025 12:47 10s
Qwen2.5-VL support
label_issue #2128: Issue #6772 opened by tristanwqy
January 28, 2025 03:49 10s
January 28, 2025 03:49 10s
MiniCPM-o-2_6视频处理存在问题
label_issue #2127: Issue #6770 opened by jinzhuoran
January 27, 2025 14:54 11s
January 27, 2025 14:54 11s
Multiple Dataset Training Help
label_issue #2126: Issue #6769 opened by JiwenJ
January 27, 2025 13:18 10s
January 27, 2025 13:18 10s
Qwen2-VL多图推理
label_issue #2125: Issue #6768 opened by XiruiTeng
January 27, 2025 12:50 12s
January 27, 2025 12:50 12s
Add DeepSeek-R1 and its distilled model (Qwen&Llama)
tests #1878: Pull request #6767 opened by Qwtdgh
January 27, 2025 03:48 Action required Qwtdgh:main
January 27, 2025 03:48 Action required
training_args.parallel_mode param questions
label_issue #2124: Issue #6766 opened by boyu-zhu
January 27, 2025 02:47 12s
January 27, 2025 02:47 12s