Week 2 - Reinforcement learning basics

When evaluating different learners we normally evaluate them by the policy they produce. However, different methods of learning can create policies in different ways - therefore we may need to also consider:

Computation complexity: The time it takes for that learner to come up with that policy.
Sample complexity: The amount of interactions with its environment it needs to come up with that policy.

We don’t normally think about space complexity as with other subjects - as that is not normally a limiting factor.

Alex's Notes

Explorer

Week 2 - Reinforcement learning basics

Week 2 - Reinforcement learning basics

Graph View

Backlinks