Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use generate_instructions.py script? #43

Open
beybars1 opened this issue Jun 17, 2024 · 1 comment
Open

How to use generate_instructions.py script? #43

beybars1 opened this issue Jun 17, 2024 · 1 comment

Comments

@beybars1
Copy link

Hey Ilya, much appreciate for you and your team's effort to open source contribution.

I have following question regarding the generate_instructions.py file. How can I run it and use it? Which arguments to pass in? Is there any example how to generate instruction dataset using this script?

@IlyaGusev
Copy link
Owner

Hey, thanks :)

The list of parameters is here

Run it from rulm/self_instruct:

python3 -m src.data_processing.generate_instructions \
  --output-path output.jsonl \
  --seed-tasks-path data/ru_alpaca_seed_tasks.jsonl \
  --settings-path external_prompts/ru_gen_settings.json \
  --template-path external_prompts/ru_instruct.txt

The script is ancient, though, so many things can go wrong. The most obvious one is that you will need an old version of OpenAI wrapper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants