This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
m1r2017 [2025/12/22 05:28] 47.128.115.10 old revision restored (2025/07/15 16:45) |
m1r2017 [2026/01/04 03:27] (current) 85.8.117.185 old revision restored (2025/07/28 05:06) |
||
|---|---|---|---|
| Line 17: | Line 17: | ||
| * [[http:// | * [[http:// | ||
| * [[https:// | * [[https:// | ||
| - | + | ||
| + | === NN === | ||
| + | |||
| + | * GNG article original de Fritzke [[https:// | ||
| + | * demo de Fritzke [[http:// | ||
| + | |||
| + | === Construction de représentations en RL === | ||
| + | |||
| + | * Tile Coding et versions adaptatives {{http:// | ||
| + | * Combinaison de growing neural gaz GNG et Q-Learning pour discrétisation adaptative de l' | ||
| + | * {{http:// | ||
| === App Constructiviste === | === App Constructiviste === | ||
| Line 29: | Line 39: | ||
| ===== Mémentos | ===== Mémentos | ||
| + | A lire : | ||
| + | * https:// | ||
| + | * http:// | ||
| ==== App Constructiviste ==== | ==== App Constructiviste ==== | ||
| * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]] | ||
| Line 36: | Line 49: | ||
| * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] | * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] | ||
| * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] | * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] | ||
| - | * [[memento-Independent-reinforcement-learners-cooperative-Markov-games: | + | * [[memento-Independent-reinforcement-learners-cooperative-Markov-games: |
| * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]] | * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]] | ||
| Line 48: | Line 61: | ||
| * [[memento-td-gng | TD-GNG]] | * [[memento-td-gng | TD-GNG]] | ||
| + | ===== Réalisations | ||
| + | |||
| + | * [[realisation_SOM | SOM]] | ||
| + | * [[realisation_GNG | GNG]] | ||
| + | |||
| + | === Environnement grille === | ||
| + | |||
| + | * [[realisation_env_grille_qlearning | Environnement Grille (+QLearning)]] | ||
| + | * [[realisation_env_grille_qlearning_sma| Environnement Grille SMA ILs (QLearning)]] | ||
| + | * [[realisation_env_grille_qlearning_sma_jsl | Environnement Grille SMA JSLs (Qlearning)]] | ||
| + | |||
| + | === Environnement Gym === | ||
| + | |||
| + | * [[realisation_env_mountainar_gym_qlearning | MountainCar Gym (Qlearning)]] | ||
| + | * [[realisation_env_pendulum_gym_qlearning | Pendulum Gym (Qlearning)]] | ||
| ===== Réflexions | ===== Réflexions | ||
| * [[reflexion-gng-qc | CQ-Learning et TD-GNG]] | * [[reflexion-gng-qc | CQ-Learning et TD-GNG]] | ||
| Line 53: | Line 81: | ||
| ===== Comptes-rendu de réunion | ===== Comptes-rendu de réunion | ||
| - | Dossier contenant | + | Dossier contenant les slides présentés lors des réunions : |
| - | [[https:// | + | [[https:// |
| * [[ reu02-03-17 |02/03/17]] | * [[ reu02-03-17 |02/03/17]] | ||
| - | * 14/03/17 | + | |
| + | * [[ reu24-03-17 |24/03/17]] | ||