Add functions for input-masked loss calculation and batching #825

chimezie · 2024-06-07T16:37:02Z

Adds support for completion-only finetuning via functions for iterating over batching that also calculates input masks along with padding and a loss function using the masks

-- Updated 5 months later to keep up with mlx(_lm) changes, etc.

awni · 2024-11-04T20:31:10Z

llms/mlx_lm/tuner/trainer.py

+def iterate_input_masked_batches(
+    input_text, output_text, tokenizer, max_seq_length=2048
+):
+    batch_size = len(input_text)


Why set the batch size to the length of the dataset?

awni · 2024-11-04T20:32:02Z

llms/mlx_lm/tuner/trainer.py

+    input_lengths = mx.array(input_lengths)
+    lengths = mx.array(adjusted_lengths)
+
+    return batch[:, :-1], input_lengths, lengths


This only returns one example? Is that intentional? I assumed this is a drop-in replacement for iterate_batches but it's not clear that is the case based on how it's written..

… an updated attempt to better sync with iterate_batches logic

…_only

…iterate_batches) by default.

chimezie added 5 commits June 7, 2024 12:35

Add input_masked loss calculation and batching w/ padding

59e937c

Merge branch 'ml-explore:main' into completion_only

8c1d33d

Merge branch 'ml-explore:main' into completion_only

0a3ec90

Merge branch 'ml-explore:main' into completion_only

1929f53

Merge branch 'ml-explore:main' into completion_only

95fb224

awni reviewed Nov 4, 2024

View reviewed changes

chimezie added 3 commits November 4, 2024 22:00

Merge branch 'ml-explore:main' into completion_only

a1fbc52

Replace iterate_input_masked_batches with iterate_delineated_batches,…

b7b3332

… an updated attempt to better sync with iterate_batches logic

Merge branch 'ml-explore:main' into completion_only

603dab5

chimezie changed the title ~~Add functions for input-masked loss calculation and padded batching~~ Add functions for input-masked loss calculation and batching Nov 5, 2024

chimezie added 3 commits November 5, 2024 15:25

Minor documentation update

5579b48

Merge remote-tracking branch 'origin/completion_only' into completion…

e0d66f5

…_only

Updates CL lora tuner with input masking that uses default_loss (and …

4b88c33

…iterate_batches) by default.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functions for input-masked loss calculation and batching #825

Add functions for input-masked loss calculation and batching #825

chimezie commented Jun 7, 2024 •

edited

Loading

awni Nov 4, 2024

awni Nov 4, 2024

Add functions for input-masked loss calculation and batching #825

Are you sure you want to change the base?

Add functions for input-masked loss calculation and batching #825

Conversation

chimezie commented Jun 7, 2024 • edited Loading

awni Nov 4, 2024

Choose a reason for hiding this comment

awni Nov 4, 2024

Choose a reason for hiding this comment

chimezie commented Jun 7, 2024 •

edited

Loading