Thursday, April 26, 2012

Results: Horizon 10 Tiger Problem

Experimental Set-up
  • Mean value is taken over 10,000 episodes
  • Error bars represent standard error
Results
  • With random baseline
First 10,000 samples

First 1,000 samples






























  • Without random baseline
First 10,000 samples

First 1,000 samples

No comments:

Post a Comment