Seenivasa Ramadurai

Posted on May 29

AI Agents Are the New Microservices & A2A Is Their HTTP(s)

#agents #ai #architecture #microservices

Introduction

As enterprises race to deploy generative AI Apps/Agents, the hardest question isn't "which foundation model do we use?." it's "how do they safely talk to each other?"

If you spent the 2010s building distributed systems, the architectural blueprints emerging for enterprise AI will feel strangely familiar. Bounded contexts, service registries, async message queues, and distributed tracing are all back. The vocabulary is almost identical except our "services" now reason in natural language, call tools, and produce probabilistic, context-aware outputs instead of deterministic ones.

The Agent-to-Agent (A2A) Protocol is the open-standard transport and interface layer that makes this architectural analogy concrete. And ,the protocol now has support from more than 150 organizations including Salesforce, Microsoft, SAP, Workday, PayPal, and LangChain.

Just as HTTP/REST became the lingua franca of Microservice communication, A2A (now hosted under the Linux Foundation) standardizes how autonomous agents discover capabilities, delegate tasks, and maintain security boundaries.

Defining the Ecosystem: A2A vs. MCP

To design an enterprise multi agent mesh, you must first separate agent orchestration from tool execution. A common architectural anti pattern is trying to force a single protocol to handle both.

Model Context Protocol (MCP): This handles the Agent-to-Tool layer. It standardizes how a single agent securely reads from local databases, hooks into enterprise storage, or accesses development environments.

Agent-to-Agent Protocol (A2A): This handles the Agent-to-Agent layer. It standardizes how separate, sovereign intelligent systems communicate with each other in their natural, semantic modalities (negotiating tasks, passing conversational state, or handing off workflows) across frameworks and lines of business.

The key distinction: MCP connects agents to tools (vertical integration). A2A connects agents to each other (horizontal integration). They are explicitly designed to be complementary, not competitive. Together, they form the two-layer interoperability stack for modern multi-agent systems.

Under the Hood: How A2A Actually Works

Before diving into communication styles, it helps to understand the technical foundation A2A is built on because it is deliberately not reinventing the wheel.

A2A leverages well established web technologies.

HTTP/HTTPS — primary transport layer (production deployments require HTTPS with modern TLS)
JSON-RPC 2.0 — structured data exchange format for all requests and responses
Server-Sent Events (SSE) — real-time, one-way streaming of updates from agent to client

Every A2A agent publishes a small JSON document called an Agent Card, typically served at /.well-known/agent.json. This file lists the agent's identity, skills, endpoint URL, and authentication requirements — enabling zero-configuration discovery between agents without any proprietary registry or coordination layer.

Security is baked in from the start. A2A incorporates enterprise-grade authentication and authorization mechanisms aligned with OpenAPI security schemes, including support for OAuth 2.0 and API keys passed via HTTP headers.

The Four A2A Communication Styles

The A2A standard defines clear execution modes that mirror the structural communication patterns distributed systems engineers have relied on for decades.

1. Synchronous (Blocking)

One agent sends a task and blocks its execution context until the responding agent returns a final artifact.

Microservices Analogy: A standard REST call (GET /resource).

AI Use Case: Fast, critical path dependency queries like an Orchestrator agent requesting a real time risk compliance score before formatting a customer response.

2. Asynchronous (Non-Blocking)

One agent dispatches a task object and immediately returns to other processing. The remote agent queues the work and processes it in the background.

Microservices Analogy: Message queues or event streams (Kafka, RabbitMQ).

AI Use Case: Long-running cognitive tasks such as a Legal Agent reading a 400-page corporate acquisition contract or a Data Agent running complex batch classification.

3. Streaming

Continuous data tokens or partial states flow dynamically between agents in real time, rather than waiting for a single completed payload.

Microservices Analogy: gRPC streaming or Server-Sent Events (SSE).

AI Use Case: Real-time speech transcription agents feeding an analysis agent, or interactive multi-agent chat interfaces where UX requires instant token delivery.

4. Push Notifications (Event-Driven)

An agent registers a web callback or subscription, receiving a proactive alert only when a specific upstream event or state change occurs. When significant task state changes happen such as completed, failed, or input-required the server sends an asynchronous HTTP POST notification to the client's provided web hook. This requires the server to declare push notification capability in its Agent Card.

Microservices Analogy: Web hooks or an Event Bus.

AI Use Case: Event-driven governance like an automated Compliance Agent waking up to audit a transaction only when an Account Agent drafts a contract exceeding $1M.

Key Architectural Insight: A mature multi-agent enterprise system never forces a single interaction pattern. It builds a mesh that combines all four, leveraging an internal API gateway plane to manage traffic, route tasks, and handle fallback strategies.

The Critical Shift: From Deterministic to Semantic Interfaces

In traditional microservices, the API contract is strictly deterministic: Send these exact bytes, receive those exact bytes.

In a multi-agent network, the interface is semantic: Send this intent, receive a reasoned response.

Instead of maintaining brittle endpoints for every hyper-specific query variation, an agent uses its Agent Card to advertise its overall "Skills" and expected structural input/output schemas. A Finance agent capable of calculating remaining Q3 headcount budgets does not require a new API endpoint deployment when business users slightly pivot the nuance of the request; it interprets the intent via the A2A task lifecycle.

The "beating heart" of this lifecycle is the task's input-required state, which allows agents to pause execution mid-task and request further information from clients or other agents something traditional REST APIs were simply never designed to do. This makes agent conversations stateful and adaptive in a way that static Microservice contracts are not.

Conclusion

The parallels between the microservices revolution of the 2010s and today's multi-agent AI ecosystem are not just cosmetic. The same hard-won lessons around service discovery, security boundaries, async communication, and composable architecture are being relearned and encoded into open standards like A2A and MCP.

A2A is an open standard that enables AI agents to discover, communicate, and transact with each other across different frameworks, vendors, and platforms. MCP handles how each of those agents connects to its tools. Together they give architects a principled, two-layer model for building AI systems that are modular, interoperable, and production-ready.

The momentum behind A2A growing from 50 launch partners to 150+ organizations in under a year underscores something simple fragmentation in AI agent ecosystems is a problem the industry is collectively choosing to solve. For engineers building in this space today, the question is no longer whether these protocols matter. It's whether your architecture is ready for the systems around you that already use them.

Thanks
Sreeni Ramadorai

Top comments (12)

TxDesk • May 31

The shift from deterministic to semantic interfaces is the part that excites me and worries me in equal measure.

Excites: agents can negotiate intent, adapt mid-task, ask for missing context. That's genuinely new.

Worries: in domains where the wrong action is irreversible (signing a transaction, executing a trade, sending funds), "send an intent, receive a reasoned response" is the wrong primitive. The reasoned response has variance. The downstream effect doesn't. A REST endpoint that moves money is brittle on purpose — you know exactly what bytes were sent and exactly what state will change. An A2A task with a stateful input-required loop has more degrees of freedom than the underlying state machine should allow.

The microservices analogy actually breaks here. HTTP earned its place partly because the safe methods (GET, HEAD) were genuinely safe and the unsafe ones (POST, DELETE) were explicitly marked. A2A doesn't have that distinction yet. Every task is implicitly POST. An Orchestrator agent delegating to a Compliance Agent that delegates to a Treasury Agent has no in-protocol way to say "this leg is read-only, retry freely; this leg moves real value, retry-with-idempotency-key-or-not-at-all."

We probably need an A2A equivalent of HTTP method semantics before this becomes infrastructure for anything money-shaped. Currently the only thing enforcing that is the developer of each individual agent, which is exactly the same coordination failure that distributed systems fixed 20 years ago by encoding it in the protocol.

Curious if any of the 150+ partners are pushing for explicit reversibility/idempotency semantics at the protocol layer, or if everyone's still treating it as application-level concern.

Seenivasa Ramadurai • May 31

You could strengthen your reply by connecting Agent Skills, Agent Harness, Human-in-the-Loop, guardrails, idempotency, and workflow orchestration together:

Hi TxDesk,

Excellent point. I think this is exactly why concepts such as Agent Skills, Agent Harness, Human-in-the-Loop (HITL), and workflow guardrails are becoming so important in Agentic AI.

We already know LLMs are inherently probabilistic. The challenge is not making the model deterministic; it is building a disciplined execution layer around it.

In my view, the Agent Harness serves a similar purpose to what orchestration, transaction management, and service meshes did for microservices. It provides:

• Structured execution flows
• State management and recovery
• Validation and verification gates
• Retry and idempotency controls
• Human approval checkpoints
• Observability and audit trails

For high-risk actions such as financial transactions, fund transfers, trading, or contract execution, the agent should not have unrestricted autonomy. The model can reason, gather context, and propose actions, but the Harness determines what can actually be executed.

This is where Agent Skills become valuable. Instead of allowing free-form tool usage, agents invoke well-defined skills with explicit contracts, permissions, inputs, outputs, and governance policies. The execution layer can then enforce whether an action is read-only, reversible, idempotent, or requires human approval.

I also agree that today's A2A protocols are still evolving. Much like early distributed systems, many of these concerns are currently handled at the application level rather than the protocol level. Over time, I expect standards to emerge around action classification, risk levels, reversibility, approval requirements, and idempotency semantics.

Ultimately, the future may not be about making probabilistic models deterministic. It may be about surrounding probabilistic reasoning with deterministic workflows, governance, and control layers.

Reasoning can remain probabilistic. Execution cannot.

TxDesk • Jun 2

Good framing on the harness as the orchestration / service-mesh analog. The "reasoning can remain probabilistic, execution cannot" line is the cleanest one-line statement of the problem I've seen. Will read your post.

Seenivasa Ramadurai • Jun 2

Thank you .
Ultimately, the "Harness" is the software discipline wrapped around the probabilistic nature of the AI. By controlling the context, isolating the tools, enforcing strict RBAC, and auditing every loop, we ensure the agent remains a reliable, predictable system in production.

Seenivasa Ramadurai • May 31

Hey , forgot to share the blog of mine which talks about Agent Harness

dev.to/sreeni5018/the-agent-harnes...

Harjot Singh • May 31

The microservices analogy is apt and worth pushing on, including the warning baked into it: microservices gave us scalability and also gave us distributed-systems pain - debugging across service boundaries, cascading failures, the "it's always the network" problem. If agents are the new microservices, we're about to re-learn all of that at the agent layer: non-deterministic services, opaque failure propagation, and coordination overhead. A2A as the protocol is necessary, but the hard-won lesson from microservices is that the protocol is the easy part - observability and failure isolation are what actually save you.

I think about this constantly building Moonshift (a multi-agent pipeline: prompt to a shipped SaaS on your own GitHub + Vercel) - the agents need clean contracts (your A2A point) but also verification gates so one agent's bad output doesn't cascade, plus routing so the "service mesh" isn't all premium-priced (full build ~$3 flat). First run's free, no card. Great framing - do you think A2A converges to a real standard, or do we get the gRPC-vs-REST fragmentation phase first? I'd bet fragmentation before consolidation.

Seenivasa Ramadurai • May 31

Hey Harjot ,
Excellent point. The microservices journey taught us that communication protocols are only a small part of the story. REST and gRPC solved connectivity, but production success came from observability, tracing, circuit breakers, retries, service meshes, and failure isolation.

I think Agentic AI is heading down a very similar path. A2A may become the equivalent of HTTP/gRPC for agent communication, but enterprises will still need an "Agent Mesh" layer that provides routing, governance, memory management, verification gates, cost controls, tracing, and agent-level observability.

Your point about preventing bad outputs from cascading is especially important. In multi-agent systems, validation and verification may play the same role that API contracts and schema validation played in microservices.

As for A2A standardization, I agree with your prediction. We'll likely see a period of fragmentation first—different agent frameworks, protocols, and vendor ecosystems—before the industry converges on a smaller set of standards. That's very similar to how distributed systems evolved over the last decade.

Appreciate you sharing your Moonshift experience. It's always valuable to hear lessons learned from teams already operating multi-agent systems in production.

NOVAInetwork • May 31

The microservices analogy is sharper than most people give it credit for, and the four communication patterns are well-mapped. One thing worth pulling on though: OAuth 2.0 and API keys handle "who is this agent" but not "what is this agent allowed to do once authenticated" or "what happens when two authenticated agents disagree about whether work was completed."

Microservices inside one org don't need to solve the second two problems because the org chart and the IAM admin do that work out-of-band. Agents transacting across vendor boundaries (which 150+ orgs adopting A2A is exactly the case for) don't have that shared out-of-band layer to fall back on. An authenticated Compliance Agent and an authenticated Account Agent can both be holding valid OAuth tokens and still produce two contradictory records of what happened in a $1M contract review, and neither HTTP nor OAuth tells you which one to believe.

The natural extension of the model you're describing is: keep A2A for the message-passing and capability discovery layer, keep MCP for tool access, but add a third layer below both for things HTTP wasn't designed to handle. Protocol-level identity bound to code, capability constraints that limit what an authenticated agent can call regardless of its OAuth scopes, and an append-only record of intents, deliveries, and acceptances that survives disputes between vendors. A2A's task lifecycle gives you the "what state is this conversation in" answer. It doesn't give you the "and three months from now when there's a dispute, what's the canonical record" answer.

The fragmentation point in your conclusion is the right one. The harder fragmentation problem isn't protocol-level (A2A solves that). It's trust-level, and that one doesn't get solved by getting more orgs onto A2A. It gets solved by adding the layer below A2A that nobody owns and everybody can read.

E Lion Reigns • Jun 3

A2A as HTTP(s) for agents is a useful frame. I run voice + web agents with one knowledge source — drift between channels was my hardest bug. Eric, solo builder; happy to compare notes.

Seenivasa Ramadurai • Jun 3

Well . I believe in your case there’s no A2A involved. Only I see web apps connecting to backend Agent Service via full duplex web socket over HTTP ..

View full discussion (12 comments)

Some comments may only be visible to logged-in visitors. Sign in to view all comments.