Alex's Notes

❯

❯

Optimistic exploration

Optimistic exploration

Apr 06, 20241 min read

machine-learning

Optimistic exploration

Optimistic exploration is a way of choosing actions in Q-learning. Here we set the initial values of our to all be very high and we always pick the action that maximises . Then in uncertainty it will explore actions it does not know about.

Graph View

Backlinks

Week 12 - Reinforcement learning

Created with Quartz v4.5.1 © 2025

GitHub
Discord Community