Skip to content
This repository has been archived by the owner on Aug 2, 2022. It is now read-only.
/ Thesis Public archive

My thesis work on exploring the performance impact of the exploration strategy

License

Notifications You must be signed in to change notification settings

TNieuwdorp/Thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exploration vs Exploitation

This is the piece of code I used for my thesis, so I could get to know TensorFlow and OpenAI gym. In this thesis I explore the tradeoff between exploration of new actions and exploitation of known working strategies. I compare several different strategies such as argmax (only exploitation), e-greedy, decaying e-greedy and softmax. The agent is a multilayer perceptron.

Requirements for the code to run:

  • TensorFlow
  • openAI gym

About

My thesis work on exploring the performance impact of the exploration strategy

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages