Tags | Shivam Shakti

Contents

attention
bandits
computer vision
rl

attention

Paper Notes: Offline Reinforcement Learning • Dec 16, 2020

Paper Notes: End-to-End Object Detection with Transformers • Jul 20, 2020

Paper Notes: Attention Is All You Need • Jul 20, 2020

bandits

Multi Armed Bandits • Jul 5, 2020

computer vision

Paper Notes: End-to-End Object Detection with Transformers • Jul 20, 2020

rl

Paper Notes: Generalized Advantage Estimation • Jul 1, 2020

Paper Notes: Proximal Policy Optimization • Jun 15, 2020

Paper Notes: Asynchronous Advanatage Actor Critic (A3C) • Apr 25, 2020

Paper Notes: Soft Actor Critic • Jan 1, 2020