====== Stage M1R 2017 ====== ===== Pointeurs ===== === RL === * cours M1: {{m1r2017:cm1m22016-17.pdf|MDP et planif}}, {{m1r2017:cm4a2016rl.pdf|RL}} * cours David Silver : [[http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html]] * livre de Sutton mis à jour: [[https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf]] === App Constructiviste === * Thèse S. Mazac: [[https://tel.archives-ouvertes.fr/tel-01310583/file/TH2015MazacSebastien.pdf]] === RL et Inspirations Constructivistes === * Intrinsically Motivated RL [Singh2005] [[https://web.eecs.umich.edu/~baveja/Papers/FinalNIPSIMRL.pdf]] ===== Mémentos ===== === App Constructiviste === * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] ===RL et Inspirations Constructivistes=== * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]] === Value function approximation === * [[memento-Value-function-approximation | Quelques infos]] === Temporal Difference - Growing Neural Gas === * [[memento-td-gng | TD-GNG]] ===== Comptes-rendu de réunion ===== * [[ reu02-03-17 | 02/03/17]]