add fine-tuning code with lora support #44

Muhtasham · 2024-08-25T00:17:25Z

Closes #35 #41 #58

Muhtasham · 2024-08-25T00:24:41Z

@guoday

kael53 · 2025-01-29T11:34:56Z

I need this to fine-tune

BehsadRiemer · 2025-01-30T02:07:58Z

I remember you telling me about Deepseek in May, props to you @Muhtasham 🏃‍♂️

adamreed90 · 2025-02-01T19:14:12Z

@Muhtasham Should the prompt being built in build_instruction_prompt not match the example in the README.md:

<｜begin▁of▁sentence｜>User: {user_message_1}

Assistant: {assistant_message_1}<｜end▁of▁sentence｜>User: {user_message_2}

Assistant:

Apologies if I misunderstand.

Muhtasham · 2025-02-01T22:17:49Z

@adamreed90 Yes this PR is specifically for instruction fine-tuning, so the prompt format in build_instruction_prompt is intentionally different from the chat-based format:

As I pointed in the new README.MD, for training data preparation, please follow the Sample Dataset Format.

If you’re bringing a dataset in a different format (such as chat-based), it would require modification.

add finetuning code with lora support

19bdb69

Muhtasham closed this Aug 25, 2024

Muhtasham reopened this Aug 25, 2024

xximwon approved these changes Feb 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add fine-tuning code with lora support #44

add fine-tuning code with lora support #44

Muhtasham commented Aug 25, 2024 •

edited

Loading

Muhtasham commented Aug 25, 2024

kael53 commented Jan 29, 2025

BehsadRiemer commented Jan 30, 2025 •

edited

Loading

adamreed90 commented Feb 1, 2025

Muhtasham commented Feb 1, 2025

add fine-tuning code with lora support #44

Are you sure you want to change the base?

add fine-tuning code with lora support #44

Conversation

Muhtasham commented Aug 25, 2024 • edited Loading

Muhtasham commented Aug 25, 2024

kael53 commented Jan 29, 2025

BehsadRiemer commented Jan 30, 2025 • edited Loading

adamreed90 commented Feb 1, 2025

Muhtasham commented Feb 1, 2025

Muhtasham commented Aug 25, 2024 •

edited

Loading

BehsadRiemer commented Jan 30, 2025 •

edited

Loading