Differences

This shows you the differences between two versions of the page.

--- m1r2017 [2025/11/07 14:18]
160.225.176.125 old revision restored (2025/10/16 02:06)
+++ m1r2017 [2025/11/13 02:35] (current)
216.73.216.15 old revision restored (2025/11/12 00:58)
@@ Line 9: / Line 9: @@
   * cours David Silver : [[http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html]]
   * livre de Sutton mis à jour:  [[https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf]]
-  * Multi-Agent RL :
-      * en premier, lire le chapitre 4 de [[https://tel.archives-ouvertes.fr/file/index/docid/362529/filename/these_matignon.pdf]]
-      * puis lire [[http://liris.cnrs.fr/laetitia.matignon/index/matignon2012KER.pdf]]
-  * Travaux de De Hauwere: Learning multi-agent state space representations
-      * [[http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf]]
-      * [[https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf]]
-=== Construction de représentations en RL ===
-Approches par généralisation (//Apply knowledge to unseen but similar states//)
-  * Tile Coding et versions adaptatives {{http://www.cs.utexas.edu/~ai-lab/pubs/whitesontr07.ps|adaptative_tile_coding [Whiteson,2007] }} et {{wiki:consap:biblio:lin_wright_-2010-_evolutionary_tile_coding.pdf|evolutionary_tile_coding [Lin,2010] }}
-  * {{wiki:consap:biblio:bauman--gngq.pdf | GNG Q [Baumann,2012]}} + {{wiki:consap:biblio:bauman-2013-gngqpwt.pdf | slides }}
-  * Vector Quantization  {{wiki:consap:biblio:lee-2004-tdavq.pdf | TD AVQ [Lee2004]}}
-  * Combinaison de growing neural gaz GNG et Q-Learning pour discrétisation adaptative de l'espace d'états: {{ wiki:consap:biblio:vieira2013tdgng.pdf|TD-GNG [Vieira, Adeodato, Goncalves,2013]}}
-  * {{wiki:consap:biblio:kuipers_provost-2006-developing_navigation_behavior_through_self-organizing_distinctive_state_abstraction.pdf |Self-Organizing Distinctive-State Abstraction (SODA) [Kuipers,2006] }}
 === App Constructiviste ===
   * Thèse S. Mazac: [[https://tel.archives-ouvertes.fr/tel-01310583/file/TH2015MazacSebastien.pdf]]
-=== RL et Inspirations Constructivistes ===
-   * Intrinsically Motivated RL [Singh2005] [[https://web.eecs.umich.edu/~baveja/Papers/FinalNIPSIMRL.pdf]]
-===== Mémentos  =====
-A lire :
-   * https://ai.vub.ac.be/ALA2012/downloads/paper4.pdf
-   * http://ir.library.oregonstate.edu/xmlui/bitstream/handle/1957/39192/HolmesParkerChristopherG2013.pdf;sequence=1
-==== App Constructiviste ====
-   * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]]
-==== RL ====
-=== Multi-agents ===
-   * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]]
-   * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]]
-   * [[memento-Independent-reinforcement-learners-cooperative-Markov-games:-a-survey-regarding-coordination-problems | Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems (A terminer)]]
-   * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]]
-=== Inspirations Constructivistes ===
-   * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]]
-==== Value function approximation ====
-   * [[memento-Value-function-approximation | Quelques infos]]
-==== Temporal Difference - Growing Neural Gas ====
-   * [[memento-td-gng | TD-GNG]]
-===== Réflexions  =====
-   * [[reflexion-gng-qc | CQ-Learning et TD-GNG]]
-===== Comptes-rendu de réunion  =====
-Dossier contenant les slides présentés lors des réunions :
-[[https://drive.google.com/drive/folders/0B7dh6En0bP-KakRNYllvOVN3N2c?usp=sharing | slides]]
-   * [[ reu02-03-17 |02/03/17]]
-   * [[ reu14-03-17 |14/03/17]]

DokuWiki

Site Tools

Differences

Page Tools