Φιλτράρισμα ανά ακαδημαϊκό έτος:
|
|
|
|
|
|
|
|
- -
Bandit Problems (Ακαδ.Έτος 2020/2021)
-
Bandit Problems
Problemi di scelta multipla
|
TD-Learning
- -
TD-Learnig (Ακαδ.Έτος 2020/2021)
-
TD-Learning
Dopamina
Q-Learning
Reward Prediction Error
- -
TD-Learning e Q-Learning (Ακαδ.Έτος 2021/2022)
-
MDP
TD-Learning
Q-Learning
Metodi Attore-Critico
- -
Actor-Critic (Ακαδ.Έτος 2021/2022)
-
Actor-Critic
Reinforcement Learning
TD-Learning
|
|
|
|
|
|
|
|
|