Posts

Paper Notes: Offline Reinforcement Learning
Landscape of Open Problems in Offline RL
Dec 16, 2020
Paper Notes: End-to-End Object Detection with Transformers
Notes from DETR paper
Jul 20, 2020
Paper Notes: Attention Is All You Need
Review of Attention architecture
Jul 20, 2020
Multi Armed Bandits
Overview of Multi Armed Bandit techniques
Jul 5, 2020
Paper Notes: Generalized Advantage Estimation
Notes from GAE paper
Jul 1, 2020
Paper Notes: Proximal Policy Optimization
Notes on PPO
Jun 15, 2020
Paper Notes: Asynchronous Advanatage Actor Critic (A3C)
Notes on A3C
Apr 25, 2020
Paper Notes: Soft Actor Critic
Notes on SAC
Jan 1, 2020