-
Notifications
You must be signed in to change notification settings - Fork 61
Evaluating Policies
Neo-X edited this page Feb 13, 2018
·
1 revision
We can evaluate policies to examine the progress of the learning algorithms.
- to evaluate them automatically, use scripts/poli_eval/poli_val.py
- you will have to point it to the directory containing the intermediate files using poli_files_dir
- and you'll have to provide it with an arg file that specifies the argumenets
- the default is args/opt_int_poli_eval.txt
- but you should probably use a separate one for the hopper
- you'll also specify the output file in the arg file
- the output of the script will be a text file where each line contains the distance traveled for each individual expisode for a given policy
- the default uses 100 episodes, so there should be 100 numbers per line
- to generate the plots
- use scripts/poli_eval/plot_eval.m
- you just have to add the path to the eval output
- and it should generate the training curve automatically