← Blog
Reinforcement Learning
ML/dictionary/reinforcement-learning
Definition
ML where an agent takes actions, gets rewards, and learns a policy that maximises long-run reward. Behind AlphaGo and a key part of how modern LLMs are tuned to be helpful (RLHF).
Posts that use this term
- Inside AI: machine learning and deep learning
Open the AI umbrella. Machine learning is the part that learns from data. Deep learning is ML done with neural networks — and that's where today's models live.