Model Context Protocol: The "USB-C" for AI

Every time you connect an LLM to a new database, API, or file system, someone writes bespoke glue. An organisation with N models and M tools ends up maintaining N×M custom integrations — a combinatorial mess that stifles exactly the tool-rich AI everyone wants to build. Anthropic’s Model Context Protocol (MCP) collapses that to N+M: one open standard, hailed as the “USB-C for AI,” that lets any model talk to any tool through a single plug. This post digs into how it works — the architecture, the core building blocks, and how it stacks up against context techniques like RAG.

Why We Need a Standard?

Before MCP, connecting an LLM to a new data source or tool was a one-off, custom endeavor. An organization with ‘$N$’ LLM applications and ‘$M$’ external systems (databases, APIs, file systems, etc.) faced the daunting task of building and maintaining N \times M unique integrations. This “N x M problem” stifled innovation, increased development costs, and created a fragmented and unscalable AI ecosystem.

MCP elegantly solves this by introducing a universal protocol. Instead of building a custom bridge for each connection, developers can now create a single MCP-compliant “server” for each external system and a single MCP “client” for each LLM application. This transforms the N \times M problem into a much more manageable $N + M$ scenario, fostering a more interoperable and efficient AI landscape.

Under the Hood: The Core Architecture of MCP

At its heart, the Model Context Protocol operates on a client-host-server architecture, communicating via JSON-RPC 2.0 messages. This design promotes a clear separation of concerns, enhances security, and allows for a modular and scalable approach to AI application development.

MCP High-Level Architecture

graph TD
    User[👤 User] --> Host[🖥️ Host Application<br/>Claude Desktop, VS Code, etc.]
    Host <--> Client[🔌 MCP Client<br/>JSON-RPC 2.0]
    Client <--> Server[⚙️ MCP Server<br/>Tools, Resources, Prompts]
    Server --> DB[(🗄️ Database)]
    Server --> API[🌐 REST API]
    Server --> Files[📁 File System]
    Server --> Tools[🔧 External Tools]

    subgraph "Host Environment"
        Host
        Client
    end

    subgraph "External Systems"
        DB
        API
        Files
        Tools
    end

The Host: The host is the user-facing application, such as an IDE, a chatbot interface like Claude Desktop, or any other AI-powered tool. It is responsible for managing the entire MCP lifecycle, including discovering and connecting to MCP servers, handling user permissions, and orchestrating the flow of information between the user, the LLM, and the various MCP servers.
The MCP Client: The client is a component within the host application that establishes and maintains a connection with a single MCP server. It acts as an intermediary, translating the host’s requests into MCP-compliant messages and forwarding them to the server. This one-to-one relationship between a client and a server ensures a secure and isolated communication channel.
The MCP Server: The server is a lightweight, standalone application that exposes the capabilities of an external data source or tool to the MCP ecosystem. It implements the MCP specification and responds to requests from the client, providing data, executing actions, and offering pre-defined prompts.

The MCP Workflow in Action

To truly understand how MCP works, let’s walk through a typical workflow, from a user’s query to the final, context-enriched response.

MCP Interaction Workflow

sequenceDiagram
    participant User
    participant Host as "Host (e.g., Claude)"
    participant LLM
    participant MCP Client as "MCP Client"
    participant MCP Server as "MCP Server"

    User->>Host: Enters prompt: "What's the weather like in Melbourne?"
    Host->>LLM: Sends prompt for analysis
    LLM-->>Host: Determines it needs a 'weather' tool
    Host->>MCP Client: Queries for available tools
    MCP Client->>MCP Server: Request available tools
    MCP Server-->>MCP Client: Returns schema for 'get_weather' tool
    MCP Client-->>Host: Provides tool schema
    Host->>LLM: Provides tool schema and user prompt
    LLM-->>Host: Generates request to call 'get_weather' with location="Melbourne"
    Host->>MCP Client: Send tool execution request
    MCP Client->>MCP Server: Execute get_weather(location="Melbourne")
    MCP Server->>MCP Server: Calls external weather API
    MCP Server-->>MCP Client: Returns result: {"temperature": "14°C", "condition": "Cloudy"}
    MCP Client-->>Host: Forwards tool result
    Host->>LLM: Provides tool result as context
    LLM-->>Host: Generates final response: "The weather in Melbourne is currently 14°C and cloudy."
    Host-->>User: Displays the final response

The Building Blocks of MCP: Primitives Explained

The power and flexibility of MCP stem from its core “primitives,” which define the types of capabilities that a server can offer.

Resources: Resources represent data that the LLM can access. This can be anything from a file on the local system (file:///path/to/document.pdf) to a record in a database or the content of a webpage. Resources are primarily used to provide the LLM with the necessary context to answer a query or complete a task.
Tools: Tools are executable functions that the LLM can invoke to perform actions in the real world. This is where MCP truly shines, enabling LLMs to go beyond simple text generation and interact with external systems. A tool could be used to send an email, create a calendar event, query a database, or even execute code.
Prompts: These are pre-defined, reusable templates for interacting with the LLM. They can be as simple as a fixed instruction (“Summarize the following resource”) or as complex as a multi-step workflow, often combining resources and tools. Prompts help guide the LLM’s behavior in a consistent and predictable manner.

Getting Your Hands Dirty: A Practical Implementation Guide

The best way to understand MCP is to see it in action. Let’s walk through a simplified example of how to create an MCP server using the official Python SDK. In this example, we’ll create a server with the simple “weather” tool from our workflow diagram.

First, ensure you have the Python SDK installed:

pip install model-context-protocol

Now, let’s write the server code:

weather_server.py

import asyncio
from model_context_protocol import (
    ContextProtocolServer,
    Tool,
    ToolCall,
    ToolResult,
    ToolSchema,
    Primitive,
)

# Define the schema for our tool. This tells the LLM what the tool is,
# what it does, and what parameters it accepts.
WEATHER_TOOL_SCHEMA = ToolSchema(
    name="get_current_weather",
    description="Get the current weather in a given location.",
    input_schema={
        "type": "object",
        "properties": {
            "location": {
                "type": "string",
                "description": "The city and state, e.g., San Francisco, CA",
            }
        },
        "required": ["location"],
    },
)

# This is the actual implementation of our tool.
# In a real application, this would call a weather API.
def get_weather_implementation(location: str) -> dict:
    """A mock implementation of the weather tool."""
    print(f"Fetching weather for {location}...")
    if "melbourne" in location.lower():
        return {"temperature": "14°C", "forecast": "Cloudy with a chance of rain"}
    else:
        return {"temperature": "22°C", "forecast": "Sunny"}

class WeatherTool(Tool):
    def get_schema(self) -> ToolSchema:
        return WEATHER_TOOL_SCHEMA

    async def execute(self, tool_call: ToolCall) -> ToolResult:
        location = tool_call.inputs.get("location")
        if not location:
            # Handle error case where location is not provided
            return ToolResult(
                tool_call_id=tool_call.tool_call_id,
                output={"error": "Location must be provided."},
                is_error=True
            )

        weather_data = get_weather_implementation(location)
        return ToolResult(tool_call_id=tool_call.tool_call_id, output=weather_data)


# Main function to set up and run the server
async def main():
    print("Starting weather MCP server...")
    server = ContextProtocolServer(
        # The list of primitives this server provides.
        primitives=[Primitive(id="weather_tool_1", implementation=WeatherTool())]
    )
    await server.run_forever()

if __name__ == "__main__":
    asyncio.run(main())

When you run this script (python weather_server.py), it starts a local MCP server. A host application like Claude Desktop can then discover this server, and the LLM can leverage the get_current_weather tool as described in the workflow diagram.

MCP vs. RAG: A Tale of Two Contexts

It’s common to wonder how MCP relates to Retrieval-Augmented Generation (RAG). While both aim to provide LLMs with external context, they operate differently and solve distinct parts of the problem.

Feature	Retrieval-Augmented Generation (RAG)	Model Context Protocol (MCP)
Core Idea	Retrieve relevant documents from a vector database before calling the LLM.	A real-time, interactive protocol for LLMs to use tools and access data during generation.
Workflow	1. User Query -> 2. Retrieve -> 3. Augment Prompt -> 4. LLM Call	1. User Query -> 2. LLM Call -> 3. LLM decides to use tool -> 4. MCP Call -> 5. LLM generates response.
Interaction	Static, one-way data push to the LLM.	Dynamic, two-way interaction. The LLM can decide which tools to use and when.
Use Case	Answering questions based on a large corpus of documents (e.g., a knowledge base).	Enabling AI agents to perform actions, query live APIs, and interact with local files.
Synergy	A RAG pipeline can be exposed as an MCP `Tool` or `Resource` for an agent to use.	MCP can provide the framework for an LLM to decide to use a RAG system.

In short, RAG is a technique for data retrieval; MCP is a protocol for interaction. They are not mutually exclusive and can be powerfully combined.

The Future is Composable: Implications of MCP

The standardization proposed by MCP has far-reaching implications:

A Composable AI Ecosystem: Developers will be able to build complex AI applications by assembling off-the-shelf MCP servers, much like building web applications with microservices.
An “App Store” for AI Tools: We can envision a marketplace where developers can publish and monetize MCP servers that provide access to proprietary data or specialized tools.
Enhanced Security and User Control: By standardizing the interaction layer, MCP provides clear points for implementing security policies, sandboxing, and fine-grained user consent for tool usage.
Democratized AI Development: MCP lowers the barrier to entry for building sophisticated, context-aware AI applications, empowering a broader range of developers to innovate.

Conclusion

The Model Context Protocol is more than just a new API; it’s a foundational piece of infrastructure for the next generation of AI. By solving the N x M integration problem and providing a standardized way for LLMs to interact with the outside world, MCP paves the way for more capable, secure, and composable AI systems. Just as USB-C unified the world of physical connectors, MCP is poised to unify the world of AI integrations, unleashing a new wave of innovation in the process.

References

Model Context Protocol Official Website: https://modelcontextprotocol.io/
Official MCP Specification: https://modelcontextprotocol.io/specification/2025-06-18
Anthropic’s MCP Documentation: https://docs.anthropic.com/en/docs/mcp
MCP GitHub Organization (SDKs and Examples): https://github.com/modelcontextprotocol

Model Context Protocol: The "USB-C" for AI

Why We Need a Standard?

Under the Hood: The Core Architecture of MCP

The MCP Workflow in Action

The Building Blocks of MCP: Primitives Explained

Getting Your Hands Dirty: A Practical Implementation Guide

MCP vs. RAG: A Tale of Two Contexts

The Future is Composable: Implications of MCP

Conclusion

References

Related Posts

From OpenAPI to MCP Server: Bridging REST APIs and AI Agents with Go

Building Robust Intent Classifiers with Generative AI

The Unseen Engine of Modern AI Agents: A Deep Dive into JSON-RPC

Model Context Protocol: The "USB-C" for AI

Why We Need a Standard?

Under the Hood: The Core Architecture of MCP

The MCP Workflow in Action

The Building Blocks of MCP: Primitives Explained

Getting Your Hands Dirty: A Practical Implementation Guide

MCP vs. RAG: A Tale of Two Contexts

The Future is Composable: Implications of MCP

Conclusion

References

// Related Posts

From OpenAPI to MCP Server: Bridging REST APIs and AI Agents with Go

Building Robust Intent Classifiers with Generative AI

The Unseen Engine of Modern AI Agents: A Deep Dive into JSON-RPC

Related Posts