Logo

Training/Pre-training

Learning Methods
Letter: T

The process by which an AI model learns by analyzing massive amounts of data.

Detailed Definition

Training is the process by which an AI model learns by analyzing massive amounts of data, such as large portions of the internet, books, and other media. For LLMs, the core training method is "next-token prediction," where the model learns to predict the next word in a sequence. As it trains, the model adjusts millions of internal settings called "weights." This process helps the model improve its understanding of facts, grammar, reasoning, and language. Training state-of-the-art models can take months and cost hundreds of millions of dollars.