Computer Science - Artificial Intelligence - Reinforcement learning - RL Concepts Quiz
8 questions. Q1: RL learns from? → Rewards and penalties from interaction with environment (not labeled data). Q2: Exploration vs Exploitation tradeoff? → Try unknown actions (might find better) vs stick with known best (safe). Q3: AlphaGo used which ML approach? → Reinforcement Learning (+ supervis