This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
memento-value-function-approximation [2025/12/27 23:17] 66.249.70.198 old revision restored (2025/08/25 12:39) |
memento-value-function-approximation [2025/12/29 05:28] (current) 106.49.60.24 old revision restored (2025/11/19 22:52) |
||
|---|---|---|---|
| Line 52: | Line 52: | ||
| * Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning | * Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning | ||
| * Utilise une variante de la descente de gradient stochastique | * Utilise une variante de la descente de gradient stochastique | ||
| - | + | | |