This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
m1r2017 [2025/09/15 05:10] 66.249.68.35 old revision restored (2025/08/23 10:02) |
m1r2017 [2025/09/23 23:51] (current) 216.73.216.184 old revision restored (2025/09/14 18:59) |
||
---|---|---|---|
Line 18: | Line 18: | ||
* [[https:// | * [[https:// | ||
+ | === Construction de représentations en RL === | ||
+ | |||
+ | * Tile Coding et versions adaptatives {{http:// | ||
+ | * Combinaison de growing neural gaz GNG et Q-Learning pour discrétisation adaptative de l' | ||
+ | * {{http:// | ||
=== App Constructiviste === | === App Constructiviste === | ||
Line 29: | Line 34: | ||
===== Mémentos | ===== Mémentos | ||
+ | A lire : | ||
+ | * https:// | ||
+ | * http:// | ||
==== App Constructiviste ==== | ==== App Constructiviste ==== | ||
* [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | ||
Line 36: | Line 44: | ||
* [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] | * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] | ||
* [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] | * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] | ||
- | * [[memento-Independent-reinforcement-learners-cooperative-Markov-games: | + | * [[memento-Independent-reinforcement-learners-cooperative-Markov-games: |
* [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]] | * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]] | ||
Line 52: | Line 60: | ||
===== Comptes-rendu de réunion | ===== Comptes-rendu de réunion | ||
+ | |||
+ | Dossier contenant les slides présentés lors des réunions : | ||
+ | [[https:// | ||
* [[ reu02-03-17 |02/03/17]] | * [[ reu02-03-17 |02/03/17]] | ||
- | * 14/03/17 | + | |
+ | * [[ reu24-03-17 |24/03/17]] |