Hotfix release available: 2025-05-14b "Librarian". upgrade now! [56.2] (what's this?)
Hotfix release available: 2025-05-14a "Librarian". upgrade now! [56.1] (what's this?)
New release available: 2025-05-14 "Librarian". upgrade now! [56] (what's this?)
Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
memento-intrinsically-motivated-rl

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
memento-intrinsically-motivated-rl [2025/09/12 17:03]
66.249.68.35 old revision restored (2025/08/27 20:26)
memento-intrinsically-motivated-rl [2025/09/14 08:48] (current)
66.249.68.35 old revision restored (2025/07/21 19:03)
Line 16: Line 16:
    * Les modèles d'options : description probabiliste des effets de exécution de l'option. Cela donne la probabilité que l'option se termine sur un autre état que celui qui est prévu.    * Les modèles d'options : description probabiliste des effets de exécution de l'option. Cela donne la probabilité que l'option se termine sur un autre état que celui qui est prévu.
    * La méthode d'apprentissage intra-option : permet l'actualisation des politiques de plusieurs options pendant que l'agent interagi avec l'environnement.    * La méthode d'apprentissage intra-option : permet l'actualisation des politiques de plusieurs options pendant que l'agent interagi avec l'environnement.
 +
 +// Pourquoi utiliser le QLearning et le MDP au lieu de l'un ou l'autre ?
  
  
memento-intrinsically-motivated-rl.1757689439.txt.gz · Last modified: 2025/09/12 17:03 by 66.249.68.35