I learned very early the difference between knowing the name of something and knowing something.

Richard Feynman

Introduction to Markov Decision Process

I provide a brief introduction to MDPs.

Attention in the age of ADHD

I explain what is attention and self-attention in short.

Upper Confidence Bound

I derive the Hoeffding's Bound and analyze the UCB1 algorithm.

Epsilon-Greedy Strategy

I explain Exploration-Exploitation trade-off and Epsilon-Greedy Strategy.