AI Agents Moving Decision-Making Up the Stack

F5 Ecosystem | July 10, 2025

You know that moment when you realize the architecture you've been perfecting for a decade is about to get steamrolled by something that wasn’t even on the radar two years ago?

Welcome to the age of agent architectures.

These aren’t your standard automation scripts or AI wrappers. Agents are goal-driven, generative, and increasingly self-directed. They don’t just call APIs; they chart their own path through them. And the kicker? They bring their own policy.

Every request an agent sends might carry:

Intent headers (X-Goal, X-Context, X-Route-Preference)Fallback instructions based on real-world SLA sensitivity A memory of what didn’t work last time Hints on what success even looks like (spoiler: it’s not “200 OK”)

This is decision-making in motion. It’s not centralized, preplanned orchestration. It’s runtime delegation and it’s going to change how infrastructure works.

Why this matters (sooner than you think)

Right now, most enterprise systems aren’t feeling the friction. Early agents are still chatbots, copilots, or isolated productivity enhancers.

But as they move into business workflows (you know, things like order resolution, claims processing, triage, fulfillment) they start interacting with real systems. That means:

Infrastructure needs to interpret agent-carried logic
Gateways need to parse more than just JWTs and paths
Load balancers need to stop thinking in terms of “fastest” and start thinking in terms of “fit for this task”

It’s not a crisis yet. But it’s coming. And when it hits, the issue won’t be lack of bandwidth. It’ll be a mismatch between how agents make decisions and how infrastructure handles traffic.

The shift: From enforcement to execution

Here’s the real architectural punchline: agents are moving decision-making up the stack.

The request becomes the control plane.

It doesn’t ask the infrastructure, “What should I do?” It tells it: “Here’s what I need. Here’s how I want you to behave. Now get it done.”

If your systems treat that like a normal request, just another GET or POST, then fallback logic will collide, retries will overlap, and agent performance will degrade for reasons that won’t show up in your dashboards.

Not because the infra failed. But because it wasn’t listening.

Where this is coming from

This shift isn’t theoretical, it’s emerging through real frameworks. Efforts like the Model Context Protocol (MCP), Agent-to-Agent (A2A) communication models, and even early work in policy-wrapped task routing are all pointing in the same direction:

MCP pushes full context—goals, constraints, fallback options—into the request payload itself.
A2A protocols enable agents to coordinate tasks and hand off decisions dynamically, without a central orchestrator.
Runtime task delegation via structured metadata lets the infrastructure interpret—not decide—at the point of execution.

They differ in syntax, structure, and abstraction—but they all land on the same architectural outcome: policy in the payload and purpose in the request.

And once the request becomes the carrier of logic, infrastructure has two choices: adapt or become little more than a pipeline moving payloads.

What you can do now (before it’s a problem)

This isn’t about ripping and replacing. It’s about getting ahead of the shift while it’s still early. Start here:

Capture context
Log X-Intent, X-Task-Profile, and any other metadata that might hint at what the agent is trying to accomplish. If your observability ends at the URI, you're already behind.
Think in terms of “task fitness,” not “availability”
A backend that’s up doesn’t mean it’s good enough. Start experimenting with health models that account for SLA, latency thresholds, and intent.
Get smart with your policies
Existing application delivery and security platforms have scripting layers. Use them. Start parsing intent headers. Modify routing rules based on request goals, not just service path.
Plan for adaptive enforcement
Instead of defining fallback logic in a static config file, begin exploring how infrastructure can enforce fallback instructions embedded in the request itself.
Build a feedback loop
If agents are rerouting traffic based on past performance, you want to be part of that loop. Feed outcome context back into your traffic and health systems.

Final thought: You're not being replaced, you're being repositioned

In an agent-driven architecture, the infrastructure doesn’t disappear. But it does change jobs. It stops being the one making decisions and becomes the one carrying them out, intelligently.

If you can make that shift early, you’ll be ready for the wave when it comes. And it’s coming.

Sooner than most people think.

Featured Blog Posts

F5 accelerates and secures AI inference at scale with NVIDIA Cloud Partner reference architecture

Securing AI models and agents without compromise: How F5’s acquisition of CalypsoAI will deliver end-to-end AI runtime protection

Quantum ready: A practical guide to enabling PQC with F5

Tags: Office of the CTO, 2025, Automation, Infrastructure, Operations

About the Author

Lori Mac VittiePrincipal Evangelist

More blogs by Lori Mac Vittie

Featured Blog Posts

F5 accelerates and secures AI inference at scale with NVIDIA Cloud Partner reference architecture

Securing AI models and agents without compromise: How F5’s acquisition of CalypsoAI will deliver end-to-end AI runtime protection

Quantum ready: A practical guide to enabling PQC with F5

Related Blog Posts

F5 Ecosystem | 10/22/2024

At the Intersection of Operational Data and Generative AI

Help your organization understand the impact of generative AI (GenAI) on its operational data practices, and learn how to better align GenAI technology adoption timelines with existing budgets, practices, and cultures.

F5 Ecosystem | 12/19/2022

Using AI for IT Automation Security

Learn how artificial intelligence and machine learning aid in mitigating cybersecurity threats to your IT automation processes.

Office of the CTO,

2022

F5 Ecosystem | 07/19/2022

The Commodification of Cloud

Public cloud is no longer the bright new shiny toy, but it paved the way for XaaS, Edge, and a new cycle of innovation.

Office of the CTO,

2022

F5 Ecosystem | 02/24/2022

Most Exciting Tech Trend in 2022: IT/OT Convergence

The line between operation and digital systems continues to blur as homes and businesses increase their reliance on connected devices, accelerating the convergence of IT and OT. While this trend of integration brings excitement, it also presents its own challenges and concerns to be considered.

Office of the CTO,

2022

F5 Ecosystem | 10/05/2020

Adaptive Applications are Data-Driven

There's a big difference between knowing something's wrong and knowing what to do about it. Only after monitoring the right elements can we discern the health of a user experience, deriving from the analysis of those measurements the relationships and patterns that can be inferred. Ultimately, the automation that will give rise to truly adaptive applications is based on measurements and our understanding of them.

2020,

Office of the CTO

F5 Ecosystem | 12/23/2019

Inserting App Services into Shifting App Architectures

Application architectures have evolved several times since the early days of computing, and it is no longer optimal to rely solely on a single, known data path to insert application services. Furthermore, because many of the emerging data paths are not as suitable for a proxy-based platform, we must look to the other potential points of insertion possible to scale and secure modern applications.

2019,

Office of the CTO

AI Agents Moving Decision-Making Up the Stack

Why this matters (sooner than you think)

The shift: From enforcement to execution

Where this is coming from

What you can do now (before it’s a problem)

Final thought: You're not being replaced, you're being repositioned

About the Author

Related Blog Posts

At the Intersection of Operational Data and Generative AI

Using AI for IT Automation Security

The Commodification of Cloud

Most Exciting Tech Trend in 2022: IT/OT Convergence

Adaptive Applications are Data-Driven

Inserting App Services into Shifting App Architectures

WHAT WE OFFER

RESOURCES

SUPPORT

PARTNERS

COMPANY