`IndexError: tuple index out of range` in `input_ids` during pre-training #199

ebrarkiziloglu · 2025-02-13T09:13:09Z

Following the instructions in the README, I am working on pre-training from scratch. I ran training with the composer framework using the yamls/main/flex-bert-base.yaml config and with the c4 dataset ./my-copy-c4.
[note that I verified that the dataloader works fine following the instructions.]

However, I encountered the following error during training:

IndexError: tuple index out of range

  File "/.../ModernBERT/src/bert_layers/embeddings.py", line 153, in forward
    position_ids = self.position_ids[:, 0 : input_ids.shape[1]]
                                            ~~~~~~~~~~~~~~~^^^

Steps to Reproduce

Prepare the c4 dataset.
Set up the conda environment per instructions.
Run training with composer main.py yamls/main/flex-bert-base.yaml
The error occurs during training in bert_layers/embeddings.py

The text was updated successfully, but these errors were encountered:

NohTow · 2025-02-24T16:04:33Z

Sorry for the delay, I'll try to have a look at this.

NohTow · 2025-02-24T16:55:11Z

Hello again,

I just tested and indeed, there is an issue in flex-bert-base.yaml.
Those configurations are outdated anyways and you should be able to run your tests by using those configurations ; make sure to change the path of the dataset (and I also set streaming to False as well as sequence_packing).
We should merge this branch and update the readme (as well as remove the useless/outdated configurations) ASAP to avoid such issue, sorry about that.

(FWIU, copying the model_config of ModernBERT into the old config worked, I think it's because we don't use positional encoding anymore. I won't debug much more as we are deprecating those).

cc @warner-benjamin

onurgu · 2025-02-24T21:33:02Z

Hi, thank you for your comment. We saw them and started to use them but we hit a wall again, which we solved by fixing some part of the code.

Can you also look at PR #205 , this PR was also necessary for us to move on.

ebrarkiziloglu mentioned this issue Feb 13, 2025

How to run pre-training? #155

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`IndexError: tuple index out of range` in `input_ids` during pre-training #199

`IndexError: tuple index out of range` in `input_ids` during pre-training #199

ebrarkiziloglu commented Feb 13, 2025 •

edited

Loading

NohTow commented Feb 24, 2025

NohTow commented Feb 24, 2025 •

edited

Loading

onurgu commented Feb 24, 2025

IndexError: tuple index out of range in input_ids during pre-training #199

IndexError: tuple index out of range in input_ids during pre-training #199

Comments

ebrarkiziloglu commented Feb 13, 2025 • edited Loading

Steps to Reproduce

NohTow commented Feb 24, 2025

NohTow commented Feb 24, 2025 • edited Loading

onurgu commented Feb 24, 2025

`IndexError: tuple index out of range` in `input_ids` during pre-training #199

`IndexError: tuple index out of range` in `input_ids` during pre-training #199

ebrarkiziloglu commented Feb 13, 2025 •

edited

Loading

NohTow commented Feb 24, 2025 •

edited

Loading