Preparing your experience...
Loading scripts and resources
Preparing your experience...
Loading scripts and resources
8 questions. Q1: RL learns from? → Rewards and penalties from interaction with environment (not labeled data). Q2: Exploration vs Exploitation tradeoff? → Try unknown actions (might find better) vs stick with known best (safe). Q3: AlphaGo used which ML approach? → Reinforcement Learning (+ supervis
4
Likes
0
Remixes
0
Comments