Site Tools


Hotfix release available: 2025-05-14a "Librarian". upgrade now! [56.1] (what's this?)
New release available: 2025-05-14 "Librarian". upgrade now! [56] (what's this?)
Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
m1r2017

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
m1r2017 [2025/07/19 13:39]
20.171.207.28 old revision restored (2025/07/18 07:16)
m1r2017 [2025/07/20 02:50] (current)
20.171.207.28 old revision restored (2025/07/20 00:33)
Line 17: Line 17:
       * [[http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf]]       * [[http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf]]
       * [[https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf]]       * [[https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf]]
-  +   
-=== NN === +
- +
-  * GNG article original de Fritzke [[https://www.cs.swarthmore.edu/~meeden/DevelopmentalRobotics/fritzke95.pdf]] +
-  * demo de Fritzke [[http://www.demogng.de]] +
-   +
-=== Construction de représentations en RL === +
- +
-  * Tile Coding et versions adaptatives {{http://www.cs.utexas.edu/~ai-lab/pubs/whitesontr07.ps|adaptative_tile_coding [Whiteson,2007] }} et {{http://www.aaai.org/ocs/index.php/WS/AAAIW10/paper/viewFile/2068/2422|evolutionary_tile_coding [Lin,2010] }} +
-  * Combinaison de growing neural gaz GNG et Q-Learning pour discrétisation adaptative de l'espace d'états: http://liris.cnrs.fr/sasem/lib/exe/fetch.php?media=m1r2017:vieira2013tdgngoriginal.pdf +
-  * {{http://nn.cs.utexas.edu/downloads/papers/provost.connsci06.pdf|Self-Organizing Distinctive-State Abstraction (SODA) [Kuipers,2006] }}+
  
 === App Constructiviste === === App Constructiviste ===
Line 39: Line 29:
 ===== Mémentos  ===== ===== Mémentos  =====
  
-A lire :  
-   * https://ai.vub.ac.be/ALA2012/downloads/paper4.pdf 
-   * http://ir.library.oregonstate.edu/xmlui/bitstream/handle/1957/39192/HolmesParkerChristopherG2013.pdf;sequence=1 
 ==== App Constructiviste ==== ==== App Constructiviste ====
    * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]]    * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]]
Line 49: Line 36:
    * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]]    * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]]
    * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]]    * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]]
-   * [[memento-Independent-reinforcement-learners-cooperative-Markov-games:-a-survey-regarding-coordination-problems | Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems]]+   * [[memento-Independent-reinforcement-learners-cooperative-Markov-games:-a-survey-regarding-coordination-problems | Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems (A terminer)]]
    * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]]    * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]]
  
Line 61: Line 48:
    * [[memento-td-gng | TD-GNG]]    * [[memento-td-gng | TD-GNG]]
  
-===== Réalisations  ===== 
- 
-   * [[realisation_SOM | SOM]] 
-   * [[realisation_GNG | GNG]] 
-   * [[realisation_env_grille_qlearning | Environnement Grille (+QLearning)]] 
-   * [[realisation_env_mountainar_gym_qlearning | MountainCar Gym (Qlearning)]] 
-   * [[realisation_env_pendulum_gym_qlearning | Pendulum Gym (Qlearning)]] 
 ===== Réflexions  ===== ===== Réflexions  =====
    * [[reflexion-gng-qc | CQ-Learning et TD-GNG]]    * [[reflexion-gng-qc | CQ-Learning et TD-GNG]]
Line 73: Line 53:
 ===== Comptes-rendu de réunion  ===== ===== Comptes-rendu de réunion  =====
  
-Dossier contenant les slides présentés lors des réunions : +Dossier contenant les tous les slides présentés lors des réunions : 
-[[https://drive.google.com/drive/folders/0B7dh6En0bP-KakRNYllvOVN3N2c?usp=sharing | slides]]+[[https://drive.google.com/drive/folders/0B7dh6En0bP-KakRNYllvOVN3N2c | slides]]
  
    * [[ reu02-03-17 |02/03/17]]    * [[ reu02-03-17 |02/03/17]]
-   [[ reu14-03-17 |14/03/17]] +   * 14/03/17
-   * [[ reu24-03-17 |24/03/17]]+
m1r2017.1752925162.txt.gz · Last modified: 2025/07/19 13:39 by 20.171.207.28 · Currently locked by: 20.171.207.28