NPTEL Video Course : NOC:Reinforcement Learning


Lecture 8 - Bandit Optimalities


            


DIGIMAT Learning Management Platform