-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to run pre-training? #155
Comments
Was wondering about this, too! |
same question |
same question. |
Hello, Note that the data path should point to data folder in the MDS format, you have and example with the C4 dataset here. Again, sorry for the delay and hopefully we'll have better documentation soon. |
@NohTow Hi there! Any update on a proper step-by-step guide for pretraining? |
Hello, Until we update the readmes and merge the configs, the above comment is the closest thing to a step-by-step guide. Edit: actually, I forgot but #183 that adds a bit of documentation to the main readme has been merged, so besides merging the configs, is there anything you are missing? |
Thank you for the great work! I have a question regarding pre-training. Could you please clarify which YAML configuration file should be used to achieve a similar pre-training setup as ModernBert, but for a different language? I noticed that in the yamls folder, there doesn’t seem to be a specific file for this purpose. The only related script I found is generate_eval_config.py, which, if I understand correctly, generates a YAML configuration using ModernBert’s training params. Is my understanding correct, or am I missing something?
The text was updated successfully, but these errors were encountered: