Getting Started with OpenAI Agents

Introduction & Lesson Overview

Welcome to the first lesson in your journey to mastering the Model Context Protocol (MCP) and its use with the OpenAI Agents SDK in TypeScript. MCP is an open standard designed to enable large language models (LLMs) to securely and efficiently connect with external applications, tools, and data sources through a unified protocol. By providing a standardized way for AI models to access and interact with real-time information, MCP makes it possible to build more capable, context-aware agents that can perform complex tasks and integrate seamlessly with a wide range of systems.

Before we dive into MCP and tool integration, it's important to first build a solid foundation in how agents work using the OpenAI Agents SDK. In this lesson, you will learn how to create a simple agent and understand the basics of agent execution. This foundational knowledge will prepare you for more advanced topics in future lessons, including developing and integrating your own MCP server. Get ready to take your first step toward building intelligent, tool-empowered agents!

Overview Of The OpenAI Agents SDK

The OpenAI Agents SDK is a TypeScript/Node.js package that makes it easy to build, manage, and run AI agents powered by large language models. With this SDK, you can create agents that reason, use tools, interact with other agents, and perform complex tasks in a structured way. Its simple and flexible design lets you quickly prototype and deploy agentic applications. For more details, you can check out the OpenAI Agents SDK Documentation.

If you're working in your own local environment, you'll need to install the package and set your OpenAI API key before you can start building agents. To install the SDK, you can use either npm or yarn:

After installing, set your OpenAI API key as an environment variable. The command you use depends on your operating system:

While using the CodeSignal coding environment in this course, you don't need to worry about installation or API keys — everything is already set up for you. You can jump right in and start running the code examples as you learn.

Understanding the Agent Loop

A key feature of the OpenAI Agents SDK is the agent loop. This loop is what sets an agent apart from a simple chat model.

A simple chat model, like GPT-4 in a basic chat interface, takes your input and generates a single response — there's no memory of previous steps, no ability to call tools, and no way to interact with other systems. It's a one-shot exchange: you ask a question, and the model answers.

An agent, on the other hand, can perform much more complex tasks thanks to the agent loop. Here's how the agent loop works and why it's different:

Input Reception: The agent receives an input (such as a user query).
Reasoning and Planning: The agent uses the language model (LLM) to decide what to do next. This could be generating a direct answer, calling an external tool, or handing off the task to another agent.
Action Execution:
- If the agent needs to use a tool (like a calculator, web search, or database), the loop executes the tool call, collects the result, and feeds it back into the agent.
- If the agent needs to delegate, the loop can pass control to another agent.
Iterative Processing: The agent loop repeats this process — reasoning, acting, and updating — until a final answer is produced or a maximum number of steps is reached.
Final Output: Once the agent determines it has enough information, it produces a final output and the loop ends.

This iterative, multi-step process allows agents to break down complex problems, use external resources, and coordinate with other agents — all automatically managed by the SDK. The agent loop is what enables agents to go beyond simple Q&A and handle real-world tasks that require reasoning, tool use, and multi-step workflows.

Creating A Simple Agent

Now that you understand the agent loop and how agents differ from simple chat models, let's see how to put this into practice by creating a basic agent. Here is an example of how to create a simple agent to provide creative, healthy recipes:

In the OpenAI Agents SDK, you define an agent by creating a new Agent instance with an object literal containing the following properties:

name: A label to identify your agent (e.g., "Recipe Chef").
instructions: Guidance that shapes the agent's behavior and responses (for example, asking the agent to provide clear steps and healthy ingredients).
model (optional): Specifies which language model the agent will use to generate answers. For example, "gpt-4.1" is a cost-effective model optimized for agentic workflows. It offers strong performance, a large context window, and great instruction following. If you do not specify a model, the SDK will use a default model for the agent.

By clearly defining these parameters, you ensure your agent behaves as intended and leverages the right language model for your specific use case. This flexibility allows you to tailor agents for a wide range of applications and performance needs.

Agent Execution: run Helper vs. Runner Class

After creating an agent, you need to execute it to perform a task. The OpenAI Agents SDK provides several ways to do this, each with different levels of control and flexibility. Understanding the difference between the run helper function, the static Runner.run method, and creating your own Runner instance will help you choose the best approach for your application.

1. Using the run Helper Function

The simplest way to execute an agent is with the run helper function. This function is a convenient wrapper that handles the entire agent loop for you:

When to use: For quick prototyping, scripts, or when you don't need to customize the execution environment.
How it works: Internally, run uses a default runner instance to execute the agent.

2. Using the Static Runner.run Method

The SDK also exposes a Runner class with a static run method. This method works identically to the run helper function and is useful if you prefer a class-based API:

When to use: If you want to be explicit about using the class-based API, or if you plan to later switch to a custom runner instance.

3. Creating and Using a Runner Instance

For more advanced scenarios, you can create your own Runner instance. This is especially useful if you want to:

Reuse the same runner configuration across multiple agent runs
Set global options or hooks for all runs
Manage resources efficiently in long-running applications

When to use: In production applications, when you need to manage multiple agents, customize execution, or optimize resource usage.

Summary of Execution Approaches

Approach	Use Case	Customization	Example Function Call
`run` helper	Quick scripts, prototyping	Low	`await run(agent, input)`
`Runner.run` (static)	Class-based API, future flexibility	Low	`await Runner.run(agent, input)`
`runner.run` (instance)	Advanced, reusable, configurable environments	High	`await runner.run(agent, input)`

All three approaches execute the same agent loop under the hood, so you can choose the one that best fits your needs.

Understanding Agent Execution Results

When you run an agent using any of the three approaches, you receive a result object containing the agent's response and execution details. The most important property is finalOutput, which contains the agent's complete answer:

When you run the agent with the input 'Give me a quick recipe for a healthy smoothie', you might receive an output like this:

Using Agents in Environments Without Top-Level Await

If you're working in a Node.js environment that doesn't support top-level await, you can wrap your code in an async function:

This asynchronous approach is recommended for all applications, as it prevents your program from blocking while the agent processes requests and allows for efficient handling of multiple concurrent operations.

Streaming Agent Execution

Streaming execution enables real-time processing and delivery of the agent's outputs. Instead of waiting for the complete response, you can process partial results as they become available. This is useful for applications that require immediate feedback or progress updates, such as live chat interfaces or interactive tools.

Here is how to use streaming execution:

In this example, the run function is called with a third parameter { stream: true } to enable streaming mode. This returns a streaming result object that provides several useful methods:

toTextStream({ compatibleWithNodeStreams: true }) creates a Node.js-compatible readable stream that you can pipe to other streams, such as process.stdout for console output
completed is a Promise that resolves when the streaming is finished
finalOutput contains the complete response after streaming completes

When you run this code, you'll see the agent's response appear in real time as it's generated, character by character. This provides immediate feedback to users and creates a more interactive experience. Even when using streaming mode, you can still access the complete final output after streaming completes, giving you the flexibility to both monitor the process in real time and easily extract the final result when needed.

Recap & Next Steps

In this lesson, you built a solid foundation for working with the OpenAI Agents SDK in TypeScript. You learned how agents differ from simple chat models by exploring the agent loop — a process that allows agents to reason, plan, and act in multiple steps. You saw how to create a basic agent using object literal syntax, and then discovered three different ways to execute your agent: using the run helper function for simplicity, the static Runner.run method for class-based APIs, and creating your own Runner instance for advanced control and customization. You also learned how to handle streaming execution for real-time responses.

With these fundamentals in place, you're ready to move on to more advanced topics, where you'll get hands-on practice applying what you've learned and start extending your agents' capabilities. Dive into the practice exercises ahead — they're designed to help you solidify your understanding and gain confidence with the OpenAI Agents SDK.

Next Lesson: Deep Dive into Agent Results

Join the 1M+ learners on CodeSignal

Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal