Filtra per anno accademico:
|
|
|
|
|
|
|
|
- n. 7.1 -
Bandit Problems (A.A. 2020/2021)
-
Bandit Problems
Problemi di scelta multipla
|
TD-Learning
- n. 8.1 -
TD-Learnig (A.A. 2020/2021)
-
TD-Learning
Dopamina
Q-Learning
Reward Prediction Error
- n. 8.2 -
TD-Learning e Q-Learning (A.A. 2021/2022)
-
MDP
TD-Learning
Q-Learning
Metodi Attore-Critico
- n. 8.3 -
Actor-Critic (A.A. 2021/2022)
-
Actor-Critic
Reinforcement Learning
TD-Learning
|
|
|
|
|
|
|
|
|