Hotfix release available: 2025-05-14b "Librarian". upgrade now! [56.2] (what's this?)
Hotfix release available: 2025-05-14a "Librarian". upgrade now! [56.1] (what's this?)
New release available: 2025-05-14 "Librarian". upgrade now! [56] (what's this?)
Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
m1r2017

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
m1r2017 [2025/11/11 21:28]
177.189.112.64 old revision restored (2025/08/10 05:13)
m1r2017 [2025/11/13 02:35] (current)
216.73.216.15 old revision restored (2025/11/12 00:58)
Line 9: Line 9:
   * cours David Silver : [[http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html]]   * cours David Silver : [[http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html]]
   * livre de Sutton mis à jour:  [[https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf]]   * livre de Sutton mis à jour:  [[https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf]]
- 
-  * Multi-Agent RL :  
-      * en premier, lire le chapitre 4 de [[https://tel.archives-ouvertes.fr/file/index/docid/362529/filename/these_matignon.pdf]] 
-      * puis lire [[http://liris.cnrs.fr/laetitia.matignon/index/matignon2012KER.pdf]] 
-  
-  * Travaux de De Hauwere: Learning multi-agent state space representations 
-      * [[http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf]] 
-      * [[https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf]] 
-    
  
 === App Constructiviste === === App Constructiviste ===
   * Thèse S. Mazac: [[https://tel.archives-ouvertes.fr/tel-01310583/file/TH2015MazacSebastien.pdf]]   * Thèse S. Mazac: [[https://tel.archives-ouvertes.fr/tel-01310583/file/TH2015MazacSebastien.pdf]]
- 
-=== RL et Inspirations Constructivistes === 
-   * Intrinsically Motivated RL [Singh2005] [[https://web.eecs.umich.edu/~baveja/Papers/FinalNIPSIMRL.pdf]] 
- 
- 
- 
-===== Mémentos  ===== 
- 
-==== App Constructiviste ==== 
-   * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] 
- 
-==== RL ==== 
-   * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations]] 
- 
-=== Inspirations Constructivistes === 
- 
-   * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]] 
- 
-==== Value function approximation ==== 
- 
-   * [[memento-Value-function-approximation | Quelques infos]] 
- 
-==== Temporal Difference - Growing Neural Gas ==== 
- 
-   * [[memento-td-gng | TD-GNG]] 
- 
-===== Réflexions  ===== 
-   * [[reflexion-gng-qc | CQ-Learning et TD-GNG]] 
- 
-===== Comptes-rendu de réunion  ===== 
- 
-   * [[ reu02-03-17 |02/03/17]] 
-   * 14/03/17 
m1r2017.1762892918.txt.gz · Last modified: 2025/11/11 21:28 by 177.189.112.64