Agentic AI in Python: Zero to Production · 05Intermediate

Build an Agentic AI App in Python: MCP Client (Part 5)

Build an MCP client in Python so your agent calls real tools — connect a LangGraph agent to an authenticated FastMCP server with langchain-mcp-adapters.

SK

Sukhveer Kaur

Published June 17, 2026 · Updated July 6, 2026

7 min read

Open in ChatGPT Open in Claude

On this page +

Why Your Agent Needs an MCP Client Step 1 — Connect the Agent to the MCP Server Step 2 — Let the Agent Choose and Call Tools Step 3 — Errors, Timeouts, and Tool-Call Guardrails Common Mistakes I Hit Wiring This Up Testing It End to End What's Next — and the Series Wrap

🧰 New here? Set up your environment first · ~5 min

Install Python 3.11+ — confirm with python3 --version.
Create and activate a virtual environment: python3 -m venv .venv then source .venv/bin/activate (Windows: .venv\Scripts\activate). venv, pip & uv primer →
Install the packages this tutorial lists: pip install -U pip <packages>.
Put your LLM API key in a .env file and never commit it. API key + .env primer →

Full walkthrough → Environment Setup primer

🟡 Intermediate⏱️ 25 minStack: Python 3.11+, LangGraph, langchain-mcp-adapters

Series: Agentic AI in Python — Zero to Production

This is Part 5 — building an MCP client so your agent can use real tools. The story so far:

Part 1: A local LangGraph agent with tools and a SQLite checkpointer

Part 3: A supervisor + workers multi-agent team

Part 4: Long-term memory across threads

New here? You’ll need Part 1’s agent.py and the MCP server from the build pillar — this post wires the two together.

By the end of Part 4 your agent could remember. By the end of this one it can act — calling real tools from the MCP server we built earlier, with authentication, straight from the app you’ve been growing since Part 1. That’s the gap an MCP client closes: it’s the piece on the agent’s side that discovers a server’s tools and lets the model call them.

Here’s the test that matters. Ask your current agent “how much disk space is left on the ops box?” and it will cheerfully make up a number, because it has no way to look. We’re going to give it that way. We’ll connect it to the authenticated FastMCP server from the build pillar, let the model pick tools on its own, and harden the connection so a slow or rude server can’t take your agent down. Three short steps. First, why your agent needs a client at all.

✅ Before you start

A working agent.py from Part 1 and the MCP server from the build pillar — this post wires them together
You know what MCP is — new to it? Read What is an MCP server? first
Comfortable with async/await in Python — new to it? Read the async and await primer first

🎯 Key takeaways

An MCP client is the agent-side piece that discovers a server’s tools and lets the model call them. The server already exists — the client is the only new thing you add.
Match the transport to how the server runs — use streamable_http for a deployed HTTP server, not stdio — and pass the Bearer token in headers on every request.
Tools from get_tools() are ordinary LangChain tools, so create_react_agent (or a Pydantic AI toolset) wires them in with almost no change to your agent.
Guard every remote tool call with a timeout and a fallback, and treat tool output as data, not instructions, to limit prompt-injection risk.

Why Your Agent Needs an MCP Client#

An agent without tools is a very expensive autocomplete. It can reason about your disk usage, your database, your GitHub issues — but it can’t touch any of them. The whole point of the Model Context Protocol (MCP — a standard way for agents to talk to tools) is to split that problem in two: a server exposes tools, and a client lets your agent consume them.

The diagram shows the split. Your agent (everything from Parts 1–4) talks to a thin MCP client, the client opens an authenticated connection to the server, and the server’s tools reach your real data. The client is the only new thing you’re adding — the server already exists, and the agent barely changes. You wrote the hard half (the server) back in the build pillar; this is the easy half that makes it pay off.

I’ll be honest about why this part waited until Part 5. Tools are tempting to add first, but an agent that can act before it can remember repeats itself and forgets what it already tried. Memory came first on purpose. Now the order pays off.

Step 1 — Connect the Agent to the MCP Server#

Before any code, here’s the short prerequisite list — everything assumes the server is already running from the build pillar.

The Part 1 agent.py (a compiled LangGraph graph)
The FastMCP Ops Server from the build pillar, reachable at http://localhost:8000/mcp
pip install -U langchain-mcp-adapters (I used 0.1.x, current in June 2026)
A valid Bearer token for the server (the OAuth 2.1 token your server validates)

The flowchart is the whole plan. The connection itself is a single object. MultiServerMCPClient takes a dict of named servers — each with a transport, a URL, and (for our authenticated server) the headers to send on every request:

python

# mcp_client.py — connect to the Ops Server
import os
from langchain_mcp_adapters.client import MultiServerMCPClient
 
client = MultiServerMCPClient(
    {
        "ops": {
            "transport": "streamable_http",
            "url": "http://localhost:8000/mcp",
            "headers": {
                "Authorization": f"Bearer {os.environ['OPS_MCP_TOKEN']}",
            },
        }
    }
)
 
tools = await client.get_tools()   # discovers every tool the server exposes
print([t.name for t in tools])     # ['disk_usage', ...]

Two things to notice. The transport must be streamable_http, not stdio — your server runs over HTTP (that was the whole point of making it production-ready), so a stdio client will never reach it. And get_tools() is where discovery happens: the client asks the server what it offers and hands you back LangChain-compatible tool objects, schemas and all. You never hand-write a tool definition.

The token lives in an environment variable, never in the file. The header is attached to every request the client makes, which is exactly what the server’s JWTVerifier expects.

Step 2 — Let the Agent Choose and Call Tools#

Discovery gives you tools; now the model has to actually use them. The good news is that the tools from get_tools() are ordinary LangChain tools, so they drop straight into the agent you already have. For a fresh LangGraph agent, that’s one line:

python

# agent_with_tools.py — LangGraph
from langgraph.prebuilt import create_react_agent
from langchain.chat_models import init_chat_model
 
model = init_chat_model("anthropic:claude-sonnet-4-5")
tools = await client.get_tools()
 
agent = create_react_agent(model, tools)
 
result = await agent.ainvoke(
    {"messages": [{"role": "user",
                   "content": "How much disk is left on the ops box?"}]}
)
print(result["messages"][-1].content)

create_react_agent wires up the loop: the model sees the tool schemas, decides disk_usage is the right call, the client executes it against your server, and the result comes back into the conversation. You don’t route anything by hand — the model chooses.

If you’re using Pydantic AI (the typed framework from this companion tutorial), the same server plugs in as a toolset:

python

# agent_with_tools.py — Pydantic AI
from pydantic_ai import Agent
from pydantic_ai.mcp import MCPServerStreamableHTTP
 
ops = MCPServerStreamableHTTP(
    "http://localhost:8000/mcp",
    headers={"Authorization": f"Bearer {os.environ['OPS_MCP_TOKEN']}"},
)
agent = Agent("anthropic:claude-sonnet-4-5", toolsets=[ops])
 
async with agent:                       # opens/closes the MCP connection
    result = await agent.run("How much disk is left on the ops box?")
print(result.output)

I prefer Pydantic AI’s async with agent: here because it ties the connection’s lifecycle to the run — open on entry, closed on exit, no leaked sockets. (Heads-up for the future: Pydantic is migrating these classes to a unified MCPToolset built on the FastMCP client, so check the docs if you’re on a newer version.) Either way, the agent that could only think about your ops box can now read it.

Step 3 — Errors, Timeouts, and Tool-Call Guardrails#

This is the section the happy-path tutorials skip, and it’s the one that decides whether your agent survives contact with production. A remote tool call is a network call: it can be slow, it can fail, and the tool can return something nasty. Wrap every tool call with a timeout and a fallback, or one stuck server will hang your whole agent.

python

# guarded tool execution
import asyncio
 
async def call_with_timeout(tool, args, seconds=10):
    try:
        return await asyncio.wait_for(tool.ainvoke(args), timeout=seconds)
    except asyncio.TimeoutError:
        return "Tool timed out — tell the user the ops box is unreachable."
    except Exception as e:                       # auth, transport, server errors
        return f"Tool failed: {e}. Do not retry automatically."

The returned string matters more than it looks. When a tool fails, you don’t crash — you hand the model a plain-language message it can relay to the user, so the agent degrades gracefully instead of dying. In my testing, a healthy disk_usage call round-trips in roughly 200–400ms; I set the timeout at 10 seconds so a genuinely stuck server trips it long before a user gives up.

The second guardrail is about trust. Anything a tool returns becomes part of the model’s context, so a compromised or buggy server can try a prompt injection (text crafted to hijack the model’s instructions). Two cheap defences go a long way:

Least privilege on the token — the Bearer token should grant only the scopes the agent truly needs, so a hijacked tool call can’t do much.
Treat tool output as data, not instructions — keep tool results in clearly labelled tool messages, and never paste them into the system prompt.

💡 Tip

Wrap every MCP tool call in a timeout and error guard. A hung or failing tool should degrade the agent gracefully, not freeze the whole run waiting on a response that never comes.

Common Mistakes I Hit Wiring This Up#

Three mistakes ate most of my debugging time on this part, and they’re the same three I see in every “my agent can’t call the tool” thread.

Common mistake: using stdio transport against an HTTP server. The client connects to nothing and get_tools() returns an empty list with no error. Match the transport to how the server actually runs — streamable_http for the deployed Ops Server.

The other two are quieter. A missing or expired token shows up as a 401 buried in the client logs, not as a friendly Python error — if get_tools() is empty, check auth before you touch anything else. And blocking calls inside an async agent: MultiServerMCPClient is async, so calling get_tools() without await (or from a sync function) silently returns a coroutine that never runs. If your tools list is a <coroutine object>, that’s the bug.

Testing It End to End#

The test that proves it works is the intro question, now with a real answer. Start the Ops Server, export OPS_MCP_TOKEN, then run the agent and ask “How much disk is left on the ops box?” A working setup does three visible things: get_tools() prints ['disk_usage', ...], the agent’s trace shows a disk_usage tool call, and the final answer contains a real number from your machine — not an invented one.

Then break it on purpose to trust your guardrails: stop the server and ask again. You should get “the ops box is unreachable,” not a stack trace. If you get the graceful message, your timeout and fallback are doing their job and you’re production-ready.

What’s Next — and the Series Wrap#

Your agent now has the full set: tools (this part), memory (Part 4), a multi-agent structure (Part 3), and a deployable FastAPI service (Part 2). It can think, remember, and act on real systems with authentication. That’s a genuinely useful agent, not a demo.

The honest next frontier is knowing whether it’s any good. Right now you find out it broke when a user tells you. Part 6 closes that gap with observability and evals — tracing every tool call and run, then scoring the agent against a fixed test set so you catch regressions before your users do. I’d build that before adding more tools: capability without measurement is how agents quietly rot.

So, a question to shape Part 6 — and drop your answer in the comments: what would you most want to see about your agent in production: every tool call it makes, the cost per run, or a pass/fail score against your own test cases? Tell me which, and I’ll lead with it.

The full series — Agentic AI in Python: Zero to Production:

Part 1 — Tools, StateGraph & Memory
Part 2 — FastAPI, Docker & Deploy
Part 3 — Multi-Agent Systems
Part 4 — AI Agent Memory
Part 5 — MCP Client & Real Tools — you’re here
Part 6 — Observability & Evals

🧭 Where to go from here

Don’t have a server yet? Build a production MCP server in Python first.
Next in this series: Part 6 — observability and evals.
New to MCP entirely? Start with What is an MCP server?.

Frequently asked questions

What's the difference between an MCP server and an MCP client? +

A server exposes tools; a client lets your agent discover and call them. This post builds the client that consumes the authenticated server from the build pillar.

Why does get_tools() return an empty list? +

Usually a transport mismatch (stdio against an HTTP server) or a missing/expired token surfacing as a hidden 401. Check the transport and auth before anything else.

Why is my tools variable a coroutine object? +

MultiServerMCPClient is async — you called get_tools() without await. Await it from an async function.

How do I stop a slow MCP server hanging my agent? +

Wrap each tool call in asyncio.wait_for with a timeout and return a plain-language fallback message, so the agent degrades gracefully instead of hanging.

References

#MCPClient #AgenticAI #LangGraph #PydanticAI #PythonTutorial #AIForDevelopers

Share

Written by

Sukhveer KaurSoftware Developer & AI Engineer

Sukhveer is a software developer specialising in AI systems and backend engineering. She has hands-on experience designing agentic AI applications, working with large language model pipelines, autonomous agent frameworks, and cloud-native services in Java and Python. At InfoWok, she bridges the gap between cutting-edge AI research and practical implementation — helping developers understand and apply emerging technologies through clear, experience-backed writing.

Linkedin ↗

Related guides

Intermediate · 1 minAgentic AI in Python: Zero to Production — The Full SeriesSukhveer Kaur · Jun 20, 2026 Comparison · 6 minPydantic AI vs LangChain: Which Framework Should You Use? (2026)Sukhveer Kaur · Jul 6, 2026 Intermediate · 6 minLangGraph vs CrewAI vs AutoGen: Which to Use in 2026?Sukhveer Kaur · Jun 15, 2026

More by Sukhveer Kaur

Guide · 8 minEvaluate an AI Agent on a Local LLM: Free, No API Key (2026)Sukhveer Kaur · Jul 18, 2026 Guide · 9 minAI Agent Guardrails in Python: Input & Output ValidationSukhveer Kaur · Jul 6, 2026 Comparison · 6 minAgentic Search vs RAG: Which One Do You Actually Need? (2026)Sukhveer Kaur · Jul 6, 2026

Continue the series

← Part 04

Build an Agentic AI App in Python: AI Agent Memory (Part 4)

Part 06 →

AI Agent Observability & Evals in Python (Part 6)

Get the next part the day it lands

One email per new part. No digest spam.