Book/Paper Notes
Structured reading notes organized by book and topic series.
-
A complete walkthrough from the multi-armed bandit problem to modern deep RL and large-language-model alignment techniques (PPO, GRPO, DPO).
View Series -
In-depth notes on individual machine learning papers and techniques, from modern optimizers to causal inference.
View Series -
Strategic interactions explored from Nash Equilibrium and mixed strategies through extensive-form games, Bayesian games, and cooperative bargaining.
View Series