tum-adlr-10

TODO

Introduction to our topic/Introduction to the problem we are trying to solve (Yufei)
- hard to explore, use as few as possible samples to learn physical dynamic
- Mention the difference to traditional reinforcement learning
Presentation of active learning and random sampling shooting via flow chart (Ben)
- Active Learning: Algorithm 1
- RS: Algorithm 4
Experiments we have conducted so far including plots (Yufei)
- Environment we use (mass-spring-damper system)
- Model performances BNN
  - Learning curves including train and test error
- Active Learning evaluation
  - RS implementation
  - explain the plot
  - explain the results
- Plot for exploration efficiency
  - visualization methods
    - Save weights for le every activearning iteration to plot bayesian prediction variance (Yufei)
    - Create training plots for every ative learning iteration (Ben)
What are the next milestones? Are there any changes to the research hypothesis or problem statement from the pro-posal? (Ben)
- Compare to other approaches from paper (soft-actor critic)
- (Increase the model complexity)
- instead of scaling complexity, we might want to delve deeper into BNN and study if it is reasonable to use MC-Dropout for this task and we'll try other active learning method

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
environments		environments
experiments		experiments
metrics		metrics
models		models
sampling_methods		sampling_methods
utils		utils
.gitignore		.gitignore
README.md		README.md
active_learning.py		active_learning.py
requirements.txt		requirements.txt
train.py		train.py