What is an AI agent?

The honest definition, why it matters, and how an agent differs from a chatbot you have already used.

20 minutesNo code requiredBrowser only

By the end of this lesson, you will:

Have an honest, working definition of "AI agent" you can use in conversation.
Be able to look at a system and decide whether it is an agent or not.
Understand why the agent idea has become important now, not five years ago.

The definition that actually works

An AI agent is a language model that takes actions in a loop to reach a goal it has been given.

That sentence has four things in it, and each one matters.

A language model. The brain is a large language model — Claude, GPT, Gemini, Llama, or any of the others. Without that brain, the system is not an agent. It is a script. The model is what decides what to do at each step.

Takes actions. An agent does not just answer your question. It does things in the world: searches the web, reads a document, sends an email, runs a calculation, books a meeting. Each of these is an action, and each action changes the state of something outside the agent.

In a loop. The agent does not act once and stop. It acts, observes what happened, decides what to do next, acts again, and so on. The loop continues until the goal is reached or until the agent gives up.

To reach a goal. Someone — usually you — has given the agent something to achieve. "Research three competitors and summarise their pricing pages" is a goal. "What is the capital of France?" is not — it is a question with one answer.

What an agent is not

It is useful to be specific about what does not count. A chatbot you have a conversation with is not an agent, even if it is powered by Claude or GPT. The chatbot answers each message in turn. It does not take actions in the world between messages, and it does not pursue a goal across messages. Each message is its own little task.

A script that uses an LLM to summarise a document is also not an agent. The LLM is being used as a clever function — text in, text out — but it is not deciding what to do next. The script decides; the LLM just helps.

A traditional automation tool — Zapier, IFTTT, or a cron job — is not an agent either. It follows rules a person wrote in advance. It does not decide.

Why this matters now and not five years ago

The agent idea is old. People have been writing about software agents since the 1990s. What changed is that language models became reliable enough to be the brain.

A few years ago, if you asked a model to plan a multi-step task, you got something incoherent. Now, the better models produce plans that often work on the first try. They can be given a tool — say, a search function — and they use it sensibly. They can recover when something goes wrong. They are not perfect, and we will spend a whole lesson on the ways they fail. But they are good enough that agents are a real engineering category, not a thought experiment.

This shift is the reason "agent" is everywhere in 2026. The brain finally works. The plumbing — function-calling, structured outputs, tool use, memory — has caught up. The remaining work is figuring out how to deploy these things responsibly.

A worked example: research vs answer

Imagine you ask Claude two questions, one after the other.

You What is the capital of France? Claude The capital of France is Paris.

That was not an agent. Claude answered from what it already knew, in one turn, with one response. There was no loop, no action, no goal that needed pursuing.

Now imagine the second question.

You I am visiting Paris next Tuesday. Find me three vegetarian restaurants in the 11th arrondissement that are open for lunch and within a 10 minute walk of Place de la République, and write a short note about each one.

If Claude tries to answer that directly from training data, the answer will probably be wrong. Restaurants close, change owners, and move. Some of them never existed. Claude does not actually know whether each restaurant is open on the Tuesday in question.

To answer that question well, the system needs to: search the web for current information about restaurants in that arrondissement, check the opening hours, check distances from a specific point, filter to vegetarian options, and write up the findings. That is several actions, with a loop, in pursuit of a goal. That is an agent.

The first task is something a model does on its own. The second is something an agent does, with the model at the centre.

The mental model to take with you

For the rest of this course, every time you see the word "agent", picture this: a language model sitting inside a loop, with access to one or more tools, working towards a goal you have given it. When you build your own agent in the lessons that follow, you will be building exactly this — and nothing more complicated.

Aside · Why people disagree about the word

You will hear engineers and researchers argue about what counts as a "real" agent. Some require autonomy (the agent has to start tasks on its own). Some require persistence (it remembers across sessions). Some require a body or environment. These are useful distinctions but they are not necessary for the working definition. For this course, an agent is an LLM that takes actions in a loop towards a goal. If someone gives you a stricter definition, they are not wrong — they are just being more precise about a sub-category.

Exercise 1.1 · 10 minutes

Is it an agent?

Below are six systems. For each one, decide whether it is an agent by our definition. There is a model answer at the bottom of this section, but try it yourself first.

A web form that takes your details and emails a confirmation back to you.
A customer support chatbot powered by GPT-4 that answers your questions about an airline's baggage policy.
A coding assistant that, when you describe a feature, opens your code editor, edits the right files, runs the tests, and if the tests fail, edits the files again.
A spam filter that uses a small machine-learning model to decide which emails are spam.
A research tool you give a topic to, and it searches the web, reads ten articles, summarises them, and writes you a report.
A car's lane-keeping assist system that uses a vision model to keep you in lane.

Tools required: none. Just thinking.

Model answers

Web form — not an agent. No LLM, no loop, no goal pursuit. Just rules.
Customer support chatbot — not an agent in our sense. It is an LLM, but it answers one question at a time without taking actions in the world. Some chatbots do call tools (like looking up your booking), and at that point they become agents — but a pure Q&A chatbot is not.
Coding assistant — yes, agent. LLM brain, multiple actions (edits, runs tests), a loop (re-edit if tests fail), in pursuit of a goal (feature works).
Spam filter — not an agent. It is a classifier. No LLM, no loop, no goal beyond "is this spam".
Research tool — yes, agent. Classic case: LLM brain, multiple actions, a loop, goal-directed.
Lane-keeping — not an agent in our sense. It has a vision model and a loop, but no LLM, and the goal is reactive (stay in lane) not pursued through a plan.

If you disagreed with any of these, that is a good sign. The edges are genuinely fuzzy. Most working engineers would call (3) and (5) agents without hesitation, and would argue about (2).

Self-check

In your own words, what are the four things our definition requires for something to be an agent?
Why is a chatbot generally not an agent, even when it is powered by a strong LLM?
What changed in the last few years that made the agent idea practical?

Looking ahead

In Lesson 2 we will look closely at the loop itself — the think-act-observe cycle every agent runs. You will walk through a real agent run, step by step, and see exactly where the model decides what to do, what it does, and what it learns.

← Course overview Lesson 2 — The agent loop →