What exactly is a 'Token'?

A token is the fundamental unit of data processed by an LLM. For English, one token is roughly 4 characters, or about 0.75 words. The word 'Apple' is one token, but a complex word like 'Unbelievable' might be broken into two or three tokens.

Why does the AI sometimes 'Hallucinate' or confidently lie?

Because it doesn't actually 'know' facts. It is a probability engine predicting the next most likely token based on its training data. If the highest probability next word creates a false sentence, the AI will confidently say it anyway.

What does 'RLHF' stand for?

Reinforcement Learning from Human Feedback. It is the crucial second stage of training where humans rank the AI's answers to teach it to be helpful, polite, and safe, rather than just spitting out random internet text.

HTML MASTER CLASS /// LEARN TAGS /// BUILD STRUCTURE /// SEMANTIC WEB /// HTML MASTER CLASS /// LEARN TAGS ///

⚡ Total XP: 0|💻 artificialintelligence XP: 0

Intro to LLMs in AI & Artificial Intelligence

Master the fundamental concepts of the Generative AI revolution. Learn about the transformer-based 'Large' architecture, understand the difference between pre-training and alignment, and explore the mechanics of tokens and context windows.

LOADING ENGINE...

Skill Matrix

UNLOCK NODES BY LEARNING NEW TAGS.

LLM Hub

Generative intelligence.

Quick Quiz //

At its most fundamental mathematical level, what is an LLM actually doing when it answers a question?

Large Language Models have redefined our relationship with technology. They are the first AI systems capable of general-purpose reasoning across almost any human domain.

1Large Language Models

Welcome to the age of Generative AI. Large Language Models (LLMs) like GPT-4 are the culmination of decades of deep learning research, trained on nearly all human-written knowledge.

These models are massive neural networks based on the Transformer architecture. They are called 'Large' because they have billions (or even trillions) of parameters—the internal mathematical 'knobs' that define their intelligence. This extreme scale is what allows them to demonstrate emergent reasoning capabilities.

editor.html

// Parameter Scale:
// GPT-2: 1.5 Billion
// GPT-3: 175 Billion
// GPT-4: > 1 Trillion
print("Loading model parameters...")

localhost:3000

2The Probability Engine

Despite their apparent intelligence, LLMs are fundamentally just incredibly advanced probability engines. Their only real job is to look at a sequence of text and predict the most likely next 'Token'.

When you ask an LLM a question, it doesn't 'think' like a human. It calculates the statistical probability for every single possible next token in its vocabulary, picks the best one, adds it to the sequence, and repeats the process. A token is typically a word or a sub-word unit.

editor.html

prompt = 'The best coding language is '
# The model calculates probability for every token
# [Python: 0.82, JS: 0.12, C++: 0.04]
next_token = model.generate(prompt)

localhost:3000

3Training: Reading the Internet

Creating an LLM requires two massive stages. The first is 'Pre-training'.

During pre-training, the model is fed essentially the entire internet—Wikipedia, Reddit, GitHub, books, and articles. It learns grammar, facts, coding syntax, and human logic. However, a purely pre-trained model isn't very useful; it just aggressively autocompletes text and can easily spout toxic or unhinged content.

editor.html

# Stage 1: Pre-training
# Objective: Read the entire internet.
# Result: A highly capable but chaotic text generator.

localhost:3000

4Alignment: RLHF

The second stage is what turns the chaotic text generator into a helpful assistant. This is called 'Alignment', often achieved through RLHF (Reinforcement Learning from Human Feedback).

Humans rate the model's responses, teaching it to favor helpful, honest, and harmless answers. This is why ChatGPT refuses to tell you how to pick a lock, and why it writes in a polite, conversational tone. Alignment is what makes the raw intelligence actually usable.

editor.html

# Stage 2: Alignment (RLHF)
# Objective: Learn to follow instructions and be safe.
# Result: A polite, helpful AI assistant.

localhost:3000

5The Context Window

Tokens are the fundamental currency of LLMs. Every model is constrained by a 'Context Window'—the maximum number of tokens it can hold in its short-term memory at any given time.

If you paste a 100-page document into a model with a small context window, it will literally 'forget' the first 50 pages because they get pushed out of its memory buffer. Managing this context window is the most critical skill for AI developers, especially when building advanced systems like RAG.

editor.html

context_window = 128000 # Tokens
# Approx 96,000 words.
# Too many tokens = The model 'forgets' the start.

localhost:3000

?Frequently Asked Questions

Pascual Vila

Frontend Instructor // Code Syllabus

Lesson Glossary

[01]LLM

Large Language Model: A massive neural network trained to understand and generate human-like text.

Code Preview

GPT-4 / Claude

[02]Token

The fundamental unit of text processed by an LLM (roughly 4 characters).

Code Preview

The Currency

[03]Context Window

The maximum number of tokens a model can process in one go.

Code Preview

Short-term Memory

[04]RLHF

Reinforcement Learning from Human Feedback: The process of aligning AI behavior with human preferences.

Code Preview

Alignment

[05]Hallucination

When an AI generates factually incorrect but plausible-sounding information.

Code Preview

Confident Error

Continue Learning

Foundations

K-Means Clustering

Read lesson→

Foundations

Linear Regression (Simple & Multiple)

Logistic Regression

Saving and Loading Models (Pickle, Joblib)

Read lesson→

Foundations

Using OpenAI / Anthropic APIs

Read lesson→

Foundations

Data Cleaning and Handling Missing Values

Read lesson→

Skill Matrix

LLM Hub

Interactive Challenges

1Large Language Models

2The Probability Engine

3Training: Reading the Internet

4Alignment: RLHF

5The Context Window

?Frequently Asked Questions

Lesson Glossary

[01]LLM

[02]Token

[03]Context Window

[04]RLHF

[05]Hallucination

Continue Learning

Article Contents