Site Tools


Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
realisation_env_pendulum_gym_qlearning

Pendulum Gym (Qlearning)

Lien vers le github : https:github.com/openai/gym/wiki/Pendulum-v0 Le pendule est sur 3 dimensions continues. L'action à renvoyer est aussi continue. La Qtable serait d'environ 21 * 21 * 161 = 70 000 états en arrondissant les 3 dimensions à 1 décimal. Sans compter les 21 actions possibles en discrétisant de la même manière. ( environ 1 470 000 états-actions). Je ne suis pas sur de pouvoir faire converger ca un jour :-| J'essaie pour le moment de faire marcher le Mountain Car qui me parait plus abordable.

realisation_env_pendulum_gym_qlearning.txt · Last modified: 2024/04/27 20:26 by 47.128.114.235