Config file #153

internetcoffeephone · 2019-05-11T23:10:54Z

Implemented the config file as mentioned in #151.

4 questions:

All run scripts work, except train_baseline_actions_dqn, I presume you're working on a private/unpushed ray branch?
Currently wondering: should we get rid of tf.app.flags and retrieve values from the dictionary created by config_parser directly?
tf.app.flags adds the benefit of command line arguments, are you currently using those?
Some parameters have not been implemented/are commented out, specifically the redis/memory/debug parameters. Either the debug flag is not needed anymore, or would you prefer debug-specific values for these parameters?

I haven't reproduced the results from the paper yet. Currently it takes me 6 days to take the 3e8 steps required per experiment - I'm in the process of requesting more powerful hardware.

This reverts commit 2db631e.

…ow all are positive. For reasoning, see ray-project/ray#4374

Added localconfig requirement. Many user-specific and experiment-specific parameters have been moved to the config file. Renamed train scripts to have consistent names. Now follows the convention: train_[experiment]_[algorithm].py Experiment results are written to folders with the following naming convention: [experiment]_[algorithm]_[environment], where environment is either cleanup or harvest. Removed train_baseline as it is redundant with train_baseline_a3c.

Removed exp_name check, as this is now always handled by config_parser.

Simplified access to hyperparameters.

internetcoffeephone · 2019-05-12T13:17:48Z

Additionally, I imagine some train_* files can be merged when parametrized, there's a lot of duplicate code in there. Separating the experiment categories (baseline, visible actions, influence, moa) from their algorithms (A3C, A2C, DQN) so they can vary independently would be ideal, although I'm not sure whether it's easy to do.

… file. Curriculum is not used yet, as it does not cleanly map to a single parameter. Made some numbers more clear (100e6 -> 1e8) Renamed train_influence_a3c to train_influence_moa. Renamed train_moa_a3c to train_moa_baseline.

…ost instantly.

internetcoffeephone · 2019-10-31T23:36:36Z

These changes are all very outdated, and I'm getting rid of config_parser in my fork. Thus, closing.

internetcoffeephone and others added 11 commits May 8, 2019 22:04

Removed double ray requirement and added setproctitle requirement

32106f3

Removed user-specific settings

2db631e

Revert "Removed user-specific settings" to make way for config file

97bb322

This reverts commit 2db631e.

Updated ray requirement, fixed inconsistent/invalid entropy values, n…

598c803

…ow all are positive. For reasoning, see ray-project/ray#4374

Fixed config bugs.

0ad1321

Removed exp_name check, as this is now always handled by config_parser.

Added hyperparameter tuning to config.

9f4a519

Simplified access to hyperparameters.

Hyperparameter tuning now uses config file.

f66d2a1

Fixed unused positional argument for env_creator.

2b92166

Removed unused variable.

21fa361

Added object_store_memory and redis_max_memory to config file.

824fe29

internetcoffeephone added 2 commits June 1, 2019 19:48

Flipped sign of entropy coefficient - experiments were converting alm…

b202ba5

…ost instantly.

internetcoffeephone closed this Oct 31, 2019

internetcoffeephone deleted the config branch May 21, 2020 13:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Config file #153

Config file #153

internetcoffeephone commented May 11, 2019

internetcoffeephone commented May 12, 2019

internetcoffeephone commented Oct 31, 2019

Config file #153

Config file #153

Conversation

internetcoffeephone commented May 11, 2019

internetcoffeephone commented May 12, 2019

internetcoffeephone commented Oct 31, 2019