Friday, April 27, 2012

Results: Horizon 100 Tiger Problem

Experimental Set-up
  • Mean value is taken over 1,000 episodes
  • Error bars represent standard error
Results
  • With random baseline
  • First 5,000 samples
 











 
  • Without random baseline
  • First 5,000 samples


No comments:

Post a Comment