What is Reinforcement Learning? | TapUp Digital Glossary

Reinforcement learning is a learning method in which an AI takes actions within an environment and aims to maximize the rewards it receives as a result.
The AI starts with no knowledge of what the correct actions are.

Instead of being told the right answers directly, the AI receives feedback — in the form of scores or points — that signals whether each action was good or bad.
Through repeated trial and error, it gradually learns which sequence of actions leads to higher rewards.

Unlike supervised learning, reinforcement learning does not rely on a pre-existing set of correct answers. This means the AI can sometimes discover unexpected strategies that even humans had not thought of.
It can also learn sophisticated decision-making — for example, deliberately choosing a short-term disadvantageous action in order to maximize the total reward achieved in the end.

This approach is used in fields that require complex judgment, such as training robots to walk, playing board games like Go, and developing autonomous driving technology.
A key strength is that even in situations where preparing all the correct answers in advance would be difficult, the AI can still discover better approaches on its own.

Reinforcement Learning

In Simple Terms

Behind the Name

Take a Closer Look!