Build a Tool-Using ReAct Agent
An LLM that can call tools stops being a debate partner and starts being useful. Build a ReAct agent with LangChain or LangGraph that reasons, calls 2-3 real tools, stays inside a tool-call budget so it cannot loop forever, and solves multi-step questions. Submit a single script or notebook for instant, rubric-based feedback.
4 hrs
Est. time
4
Outcomes
5
Rubric criteria
65%
Pass score
What you'll learn
Skills you'll have real reps in after shipping this.
See how it works
The ReAct loop
Reason, act, observe, repeat. The agent thinks about what it needs, calls a tool, reads the result, and loops until it can answer.
Why agents need a budget
Without a cap on tool calls, a confused agent loops forever and burns money. A step budget is the safety belt.
The scenario
Your product needs an assistant that can actually do things: look a value up, run a calculation, search a small knowledge source, and combine the results to answer a question no single tool could. A plain chat completion cannot; it can only talk. You need an agent that reasons about which tool to call, calls it, reads the result, and decides what to do next.
You have been asked to build a ReAct agent: a reason-and-act loop over a small set of real tools, with a hard budget on tool calls so a confused agent cannot spin forever and burn money.
Your role
You are an AI Engineer building a tool-using agent. Your goal is a single module with a ReAct loop, two or three real tools, a tool-call budget, and a demonstration of the agent solving multi-step questions while showing its reasoning and tool calls.
Start the task to unlock the full brief
You'll get the step-by-step requirements, setup commands, the 5-criterion grading rubric, tips, and the ability to submit your solution for instant AI grading.
Free to start · submit when you're ready
Learning resources
What you'll build in this agent task
This is a build-and-submit task. You build a tool-using ReAct agent with LangChain or LangGraph: a reason-and-act loop over two or three real tools, with a tool-call budget so it cannot spin forever, solving questions no single call could answer. The deliverable is one Python file that shows the agent's reasoning and tool calls.
An LLM on its own can only talk. The moment you give it tools and a loop, it can look things up, compute, and combine results into an answer. The engineering is in the loop and its limits: clear tool descriptions so the model calls the right one, visible intermediate steps so you can debug it, graceful handling of tool errors, and a hard budget so a confused agent does not run away.
Grading is rubric-based and explainable. Your submission is scored against weighted criteria (framework, real tools, the ReAct loop, the tool-call budget, and the multi-step demonstration) and returns per-criterion feedback with evidence quoted from your code. The pass threshold is 65 percent and you can resubmit.