This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
m1r2017 [2025/02/24 22:40] 20.171.207.154 old revision restored (2025/02/24 17:29) |
m1r2017 [2025/03/07 18:03] (current) 47.128.53.25 old revision restored (2025/01/03 18:13) |
||
---|---|---|---|
Line 9: | Line 9: | ||
* cours David Silver : [[http:// | * cours David Silver : [[http:// | ||
* livre de Sutton mis à jour: [[https:// | * livre de Sutton mis à jour: [[https:// | ||
- | |||
- | * Multi-Agent RL : | ||
- | * en premier, lire le chapitre 4 de [[https:// | ||
- | * puis lire [[http:// | ||
- | |||
- | * Travaux de De Hauwere: Learning multi-agent state space representations | ||
- | * [[http:// | ||
- | * [[https:// | ||
- | |||
- | === Construction de représentations en RL === | ||
- | |||
- | * Tile Coding et versions adaptatives {{http:// | ||
- | * Combinaison de growing neural gaz GNG et Q-Learning pour discrétisation adaptative de l' | ||
- | * {{http:// | ||
=== App Constructiviste === | === App Constructiviste === | ||
Line 34: | Line 20: | ||
===== Mémentos | ===== Mémentos | ||
- | A lire : | + | === App Constructiviste === |
- | * https:// | + | |
- | * http:// | + | |
- | ==== App Constructiviste | + | |
* [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | ||
- | ==== RL ==== | + | ===RL et Inspirations Constructivistes=== |
- | === Multi-agents === | + | |
- | * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] | + | |
- | * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] | + | |
- | * [[memento-Independent-reinforcement-learners-cooperative-Markov-games: | + | |
- | * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]] | + | |
- | === Inspirations Constructivistes === | ||
* [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]] | * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]] | ||
- | ==== Value function approximation ==== | + | === Value function approximation === |
* [[memento-Value-function-approximation | Quelques infos]] | * [[memento-Value-function-approximation | Quelques infos]] | ||
- | ==== Temporal Difference - Growing Neural Gas ==== | + | === Temporal Difference - Growing Neural Gas === |
- | * [[memento-td-gng | TD-GNG]] | + | |
- | ===== Réflexions | + | * [[memento-td-gng | TD-GNG]] |
- | * [[reflexion-gng-qc | CQ-Learning et TD-GNG]] | + | |
- | ===== Comptes-rendu de réunion | ||
- | Dossier contenant les slides présentés lors des réunions : | ||
- | [[https:// | ||
- | * [[ reu02-03-17 |02/03/17]] | ||
- | * [[ reu14-03-17 |14/03/17]] | ||
- | * [[ reu24-03-17 |24/03/17]] |