Add use_pretrained attribute for AutoTransformers #3498

arnavgarg1 · 2023-08-04T17:45:30Z

Fixes the following error:

'AutoTransformerConfig' object has no attribute 'use_pretrained'

when trying to train an custom transformer model from HF using a config that looks like this:

encoder:
      type: auto_transformer
      trainable: false
      pretrained_model_name_or_path: huggyllama/llama-7b
    preprocessing:
      tokenizer: space_punct
      max_sequence_length: null

arnavgarg1 · 2023-08-04T17:47:49Z

Will add a test

arnavgarg1 · 2023-08-04T18:48:07Z

Okay this is probably wrong, going to close and discuss this first.

ludwig/schema/encoders/text_encoders.py

tgaddair · 2023-08-04T20:19:48Z

ludwig/schema/encoders/text_encoders.py

@@ -3092,6 +3097,10 @@ def module_name():
        description=ENCODER_METADATA["AutoTransformer"]["type"].long_description,
    )

+    # Always set this to True since we always want to use the pretrained weights
+    # We don't currently support training from scratch for AutoTransformers
+    use_pretrained: bool = True


Let's make this a property so the user could never modify it.

@property def use_pretrained(self) -> bool: return True

jeffkinnison · 2023-08-04T20:21:48Z

tests/ludwig/encoders/test_text_encoders.py

@@ -292,3 +289,42 @@ def test_tfidf_encoder(vocab_size: int):
    inputs = torch.randint(2, (batch_size, sequence_length)).to(DEVICE)
    outputs = text_encoder(inputs)
    assert outputs[ENCODER_OUTPUT].shape[1:] == text_encoder.output_shape
+
+
+def test_hf_auto_transformer_use_pretrained(tmpdir, csv_filename):


nit: The True case would be tested implicitly elsewhere, correct? If not, maybe we could parametrize the test with both cases.

github-actions · 2023-08-04T21:13:25Z

Unit Test Results

  6 files ±      0   6 suites ±0 1h 3m 51s ⏱️ - 8m 28s
34 tests - 2 747 29 ✔️ - 2 739   5 💤 - 7 0 ❌ - 1
88 runs - 2 736 72 ✔️ - 2 730 16 💤 - 5 0 ❌ - 1

Results for commit 46f6c5a. ± Comparison against base commit 6b9a5e4.

♻️ This comment has been updated with latest results.

tgaddair · 2023-08-05T21:05:13Z

tests/ludwig/encoders/test_text_encoders.py

+            text_feature(
+                encoder={
+                    "type": "auto_transformer",
+                    "use_pretrained": False,


Ideally this should be an error if we were more strict with our config validation rules. We should instead just leave this out of the config.

Add use_pretrained attribute for AutoTransformers

7f1e27b

arnavgarg1 requested review from tgaddair and connor-mccorm August 4, 2023 17:45

arnavgarg1 marked this pull request as draft August 4, 2023 18:48

arnavgarg1 closed this Aug 4, 2023

arnavgarg1 reopened this Aug 4, 2023

arnavgarg1 added 4 commits August 4, 2023 19:17

Set to True always"

cd4cf68

Modify test to not have use_pretrained

796d8c7

Remove use_pretrained from config.yaml

88f013c

Add new test and modify existing tests

d7a9989

arnavgarg1 commented Aug 4, 2023

View reviewed changes

ludwig/schema/encoders/text_encoders.py Show resolved Hide resolved

arnavgarg1 marked this pull request as ready for review August 4, 2023 20:14

arnavgarg1 requested a review from w4nderlust August 4, 2023 20:15

tgaddair reviewed Aug 4, 2023

View reviewed changes

jeffkinnison approved these changes Aug 4, 2023

View reviewed changes

Address comments

46f6c5a

arnavgarg1 requested a review from tgaddair August 5, 2023 10:07

tgaddair reviewed Aug 5, 2023

View reviewed changes

tgaddair approved these changes Aug 5, 2023

View reviewed changes

arnavgarg1 merged commit 0c5f251 into master Aug 5, 2023
15 checks passed

arnavgarg1 deleted the use_pretrained branch August 5, 2023 21:20

dennisrall pushed a commit to dennisrall/ludwig that referenced this pull request Aug 9, 2023

Add use_pretrained attribute for AutoTransformers (ludwig-ai#3498)

177a7e6

dennisrall pushed a commit to dennisrall/ludwig that referenced this pull request Aug 9, 2023

Add use_pretrained attribute for AutoTransformers (ludwig-ai#3498)

f410511

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add use_pretrained attribute for AutoTransformers #3498

Add use_pretrained attribute for AutoTransformers #3498

arnavgarg1 commented Aug 4, 2023

arnavgarg1 commented Aug 4, 2023

arnavgarg1 commented Aug 4, 2023

tgaddair Aug 4, 2023

tgaddair Aug 4, 2023

jeffkinnison Aug 4, 2023

github-actions bot commented Aug 4, 2023 •

edited

Loading

tgaddair Aug 5, 2023

Add use_pretrained attribute for AutoTransformers #3498

Add use_pretrained attribute for AutoTransformers #3498

Conversation

arnavgarg1 commented Aug 4, 2023

arnavgarg1 commented Aug 4, 2023

arnavgarg1 commented Aug 4, 2023

tgaddair Aug 4, 2023

Choose a reason for hiding this comment

tgaddair Aug 4, 2023

Choose a reason for hiding this comment

jeffkinnison Aug 4, 2023

Choose a reason for hiding this comment

github-actions bot commented Aug 4, 2023 • edited Loading

Unit Test Results

tgaddair Aug 5, 2023

Choose a reason for hiding this comment

github-actions bot commented Aug 4, 2023 •

edited

Loading