Discounted rewards
Discounted rewards is a way to evaluate a Markov decision process with infinite steps. Let
for be a sequence of rewards for this decision process. Let be some constant. Then we set the value of these choices as If all
are bounded by some constant then we have and this is finite.