Workflow runs · hiyouga/LLaMA-Factory

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

3,579 workflow runs

Potential Support for Mistral Small 24B Models? label_issue #2136: Issue #6782 opened by HuangZhen02

January 31, 2025 03:42

11s

January 31, 2025 03:42

11s

ValueError: The checkpoint you are trying to load has model type llava_mistral but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date. label_issue #2135: Issue #6781 opened by dainini

January 30, 2025 21:23

Is there specific reason not to set vision=True for MiniCPM-o-2_6-Chat and MiniCPM-V-2_6-Chat label_issue #2134: Issue #6780 opened by andrewliao11

January 30, 2025 20:15

[model] add qwen2.5 vl models (#6779) tests #1889: Commit 999c7c8 pushed by hiyouga

January 30, 2025 19:00

8m 54s main

main

January 30, 2025 19:00

8m 54s

[model] add qwen2.5 vl models tests #1888: Pull request #6779 opened by hiyouga

January 30, 2025 18:49

9m 7s hiyouga/add_qwen2_5_vl

hiyouga/add_qwen2_5_vl

January 30, 2025 18:49

9m 7s

[breaking] support transformers 4.48 (#6628) tests #1887: Commit 15357cd pushed by hiyouga

January 30, 2025 17:36

7m 59s main

main

January 30, 2025 17:36

7m 59s

[version] support transformers 4.48 & Byebye python 3.8 tests #1886: Pull request #6628 synchronize by hiyouga

January 30, 2025 17:25

14m 13s hiyouga/upd_hf_4_48

hiyouga/upd_hf_4_48

January 30, 2025 17:25

14m 13s

[version] support transformers 4.48 & Byebye python 3.8 tests #1885: Pull request #6628 synchronize by hiyouga

January 30, 2025 16:21

8m 40s hiyouga/upd_hf_4_48

hiyouga/upd_hf_4_48

January 30, 2025 16:21

8m 40s

[webui] improve webui & reasoning mode (#6778) tests #1884: Commit 45e68b9 pushed by hiyouga

January 30, 2025 16:09

8m 41s main

main

January 30, 2025 16:09

8m 41s

[webui] improve webui & reasoning mode tests #1883: Pull request #6778 opened by hiyouga

January 30, 2025 16:02

8m 51s hiyouga/improve_r1

hiyouga/improve_r1

January 30, 2025 16:02

8m 51s

Fp8 quantization label_issue #2133: Issue #6777 opened by HARISHSENTHIL

January 29, 2025 09:28

11s

January 29, 2025 09:28

11s

Template deepseekr1 does not exist label_issue #2132: Issue #6776 opened by Fangkang515

January 29, 2025 07:35

[model] add deepseek-R1 & show think process (#6767) tests #1882: Commit 28417f8 pushed by hiyouga

January 29, 2025 04:16

8m 29s main

main

January 29, 2025 04:16

8m 29s

有计划支持Deepseek的janus pro微调么 label_issue #2131: Issue #6775 opened by mkygogo

January 28, 2025 15:58

10s

January 28, 2025 15:58

10s

Add DeepSeek-R1 and its distilled model (Qwen&Llama) tests #1881: Pull request #6767 synchronize by Qwtdgh

January 28, 2025 14:45

7m 40s Qwtdgh:main

Qwtdgh:main

January 28, 2025 14:45

7m 40s

Add DeepSeek-R1 and its distilled model (Qwen&Llama) tests #1880: Pull request #6767 synchronize by Qwtdgh

January 28, 2025 13:55

Action required Qwtdgh:main

Qwtdgh:main

January 28, 2025 13:55

Action required

Add DeepSeek-R1 and its distilled model (Qwen&Llama) tests #1879: Pull request #6767 synchronize by Qwtdgh

January 28, 2025 13:46

Action required Qwtdgh:main

Qwtdgh:main

January 28, 2025 13:46

Action required

Q_APOLLO? label_issue #2130: Issue #6774 opened by inflatebot

January 28, 2025 12:47

10s

January 28, 2025 12:47

10s

When the part of Running training started, running speed is very low. Could anyone solve this problem？ thanks label_issue #2129: Issue #6773 opened by lxcxjxhx1

January 28, 2025 06:01

12s

January 28, 2025 06:01

12s

Qwen2.5-VL support label_issue #2128: Issue #6772 opened by tristanwqy

January 28, 2025 03:49

10s

January 28, 2025 03:49

10s

MiniCPM-o-2_6视频处理存在问题 label_issue #2127: Issue #6770 opened by jinzhuoran

January 27, 2025 14:54

11s

January 27, 2025 14:54

11s

Multiple Dataset Training Help label_issue #2126: Issue #6769 opened by JiwenJ

January 27, 2025 13:18

10s

January 27, 2025 13:18

10s

Qwen2-VL多图推理 label_issue #2125: Issue #6768 opened by XiruiTeng

January 27, 2025 12:50

12s

January 27, 2025 12:50

12s

Add DeepSeek-R1 and its distilled model (Qwen&Llama) tests #1878: Pull request #6767 opened by Qwtdgh

January 27, 2025 03:48

Action required Qwtdgh:main

Qwtdgh:main

January 27, 2025 03:48

Action required

training_args.parallel_mode param questions label_issue #2124: Issue #6766 opened by boyu-zhu

January 27, 2025 02:47

12s

January 27, 2025 02:47

12s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: hiyouga/LLaMA-Factory

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong.

All workflows

All workflows

Actions

Loading...
Loading