Sunday, July 29, 2012

Feedback 29-07-2012

Progress
  • Implemented visualization for light-dark domain in RL-Viz:
 










 



  • Finished time-based performance experiment for light-dark domain (see below)
  • Thesis: 
    • Included the experiment mentioned before
    • Some small language / content corrections
Experimental Set-up
  • Environment: 10x10 discrete state space light-dark domain
  • Max. steps: 20
  • Discount factor: 0.95
  • Number of episodes: 2,000
Results
time vs roll-outs

time vs mean

No comments:

Post a Comment