•
Deriving Sutton and Barton's UCB Bandit Algoritmhs
4 min read · December 28, 2023
2023 · math statistics reinforcement-learning