Week 2 - Reinforcement learning basics

When evaluating different learners we normally evaluate them by the policy they produce. However, different methods of learning can create policies in different ways - therefore we may need to also consider:

  • Computation complexity: The time it takes for that learner to come up with that policy.
  • Sample complexity: The amount of interactions with its environment it needs to come up with that policy.

We don’t normally think about space complexity as with other subjects - as that is not normally a limiting factor.