can you provide a regret-like plot (although we don't know if the magic guesser reward is possible, take that as an optimum for now)?
Also, a comparison to your non TT version would be great, i.e., if you can compare TT vs OT. I like both variants of keep 1st, but perfect recall appears to be dominating it (better in all cases), so it does not have the problem we expected (dip if splitting just before the end)?
If you perform a split, do you hand down the action tree of that split to the newly generated node?
Looking forward to the comparison and regret-like plots. I think you are on to something here, considering that the performance after one playout is already 'close' to optimal. Best regards, Michael
'If you perform a split, do you hand down the action tree of that split to the newly generated node?'
That is exactly what the 'insertion' strategy does.
'I like both variants of keep 1st, but perfect recall appears to be dominating it (better in all cases), so it does not have the problem we expected (dip if splitting just before the end)?'
I think that with 'perfect recall', this problem does not occur - only with 'deletion' and 'insertion'.
Dear Andreas,
ReplyDeletecan you provide a regret-like plot (although we don't know if the magic guesser reward is possible, take that as an optimum for now)?
Also, a comparison to your non TT version would be great, i.e., if you can compare TT vs OT. I like both variants of keep 1st, but perfect recall appears to be dominating it (better in all cases), so it does not have the problem we expected (dip if splitting just before the end)?
If you perform a split, do you hand down the action tree of that split to the newly generated node?
Looking forward to the comparison and regret-like plots. I think you are on to something here, considering that the performance after one playout is already 'close' to optimal. Best regards, Michael
'If you perform a split, do you hand down the action tree of that split to the newly generated node?'
ReplyDeleteThat is exactly what the 'insertion' strategy does.
'I like both variants of keep 1st, but perfect recall appears to be dominating it (better in all cases), so it does not have the problem we expected (dip if splitting just before the end)?'
I think that with 'perfect recall', this problem does not occur - only with 'deletion' and 'insertion'.