Back to list
Lv.3

Transformer

Transformer

An AI model architecture that uses attention to efficiently calculate the connections and relationships between data.

In Simple Terms

A Transformer is the fundamental AI architecture for making sense of individual words as connected sentences and converting text into other languages. It's used in translation services that convert text between languages and in chat AI that generates natural responses. It works by determining which words in a sentence matter most, adjusting the weight of information to process text efficiently.

Behind the Name

The word "Transformer" combines "transform" (to change something into a different form) with the suffix "-er," which turns a verb into a noun meaning something that performs that action. So it literally means "something that transforms" — something that converts data from one form into another.

Take a Closer Look!

Transformer is a deep learning model developed to help AI understand and generate human language.
Unlike approaches that read text sequentially word by word, its standout feature is the ability to process an entire input in parallel all at once.
This makes it well-suited for handling large amounts of data efficiently, allowing it to mathematically compute connections between words even across long sentences.

More specifically, it uses a mechanism called attention to calculate which parts of a sentence deserve the most focus.
For example, in a sentence like "He wrote with a pen," the model determines that "wrote" is strongly connected to both "he" and "pen."
By converting the strength of these connections into numerical values, it becomes capable of generating natural, human-like text.

This technology is applied not only to translation and text generation, but also across a wide range of fields including image and audio analysis.
In short, it's built around the ability to efficiently identify patterns in information and convert them into a different form — making it one of the key building blocks behind large language models and many other generative AI systems.