Main content

From Automation to Decisions: Autonomous Networks in the AI Supercycle

Abstarct image depicting decisions

The moment that changes everything in network operations is not dramatic. Something starts to drift. A capacity threshold edges over. A latency tail starts to fatten. By the time a ticket opens, the condition has already impacted operations, customers, and potentially revenue.

This is the gap that automation cannot close, not because automation has failed, but because the operating environment has fundamentally changed. For decades, automation gave us scripts, workflows, orchestration, zero-touch provisioning, closed loops, and increasingly intelligent assurance. It helped operators reduce manual effort, accelerate repetitive tasks, and improve consistency across large, complex networks. That work remains essential. But in the AI supercycle - the current era defined by the rapid industrialization of AI across infrastructure, enterprise, and physical systems - automation is no longer the full ambition.

AI is not simply adding another traffic curve to the network. It is changing the shape of traffic itself, and with it, the demands placed on the network. Traffic is becoming more interactive, bursty, and uplink-heavy, driven by real-time inference at the edge and agent-to-agent communication patterns that look nothing like prior workload profiles. Inference is becoming distributed; model placement, edge inference, and agent coordination are now part of the operational workload, not just applications running over the network. And the role of the network itself is changing, from a passive carrier of intelligence to an active host, coordinator, and producer of it.

AI will not just run on networks. It will also be defined by them

Once that happens, the central question changes. It is no longer only “what tasks can the network automate?” It becomes “what decisions can the network make; how well, how fast, across which systems, with what authority, with what governance, and with what ability to explain, secure, bound, and reverse the outcome?”

Answering that question is what makes a network autonomous.

Autonomy is not a choice

Autonomy is becoming mandatory because the operating environment has outpaced the operational model built to manage it.

The first driver is pace. Network conditions increasingly change faster than human operational cycles. Automation can execute a predefined response, but it cannot reason about conditions it was not designed to anticipate.

The second driver is scale. Network complexity is now beyond manual intervention. Hand-correlation across dashboards cannot keep up with the number of domains, devices, services, policies, telemetry streams, and customer-impacting dependencies that must be held in mind at once.

The third driver is coupling. Effects increasingly span systems. A local optimization can create instability somewhere else. Siloed actions may solve the immediate problem while amplifying the systemic one. A customer-impacting event may begin in access, surface in service experience, depend on transport conditions, be amplified by cloud placement decisions, and require coordinated action across multiple operational systems, none of which share a common view of the problem.

The question is not whether networks will become more autonomous. They will. The harder question is what autonomy actually means, and what must be built to make it real.

Automation was the path. Decisions are the destination

The distinction between automation and autonomy matters, and it is often collapsed in ways that obscure what is actually required.  Automation executes a known task. It follows predefined logic. It reduces human effort within a known operating envelope. It is extremely valuable, but it is not the same as autonomy. Autonomy begins when the system can adapt to changing conditions, reason through unexpected events, and determine the right course of action within defined goals and constraints. A network can automate the provisioning of a service, an alarm workflow, a configuration push, or a closed loop within a specific operating area. But an autonomous network must do more.

It must understand intent. It must reason across systems. It must evaluate tradeoffs. It must decide when to act, when to wait, when to escalate, when to roll back, and when a local optimization creates a broader system risk.

This is why higher levels of autonomy cannot be reduced to “more automation.” Higher autonomy means the system can make more decisions well - faster, across more systems, with governance that operators can trust. This reframing matters architecturally. Autonomous Networks should not be viewed only as a maturity ladder, though the ladder is important as a shared language for progress from manual operation toward full autonomy. The ladder describes the destination. It does not, by itself, define what must be built to get there. The operating system is the road. 

The operating system for the AI-native network

When we say operating system, we do not mean a device OS or a software product in the traditional sense. We mean the architectural contract through which humans, systems, agents, models, data, policies, and operational tools interact. Every major computing transition has created a new operating system abstraction, and each has changed the operator’s contract. The shift from mainframe to personal computing moved the contract from batch job submission to direct application interaction. The shift from physical servers to cloud-native infrastructure moved the contract from managing individual machines to declaring desired state across clusters. In knowledge work, we are now moving from managing individual applications for each task to expressing intent to AI systems that act across tasks.

In network operations, the current model is still largely organized around per-system management: EMS, domain controllers, OSS/BSS, and operational tools that expose different contracts for different parts of the network. Autonomous Networks change that contract fundamentally. The operator no longer moves from device to device, dashboard to dashboard, system to system. The operator expresses intent, defines outcomes, sets policy boundaries, and governs autonomous decisions across the full network stack. The previous operating system becomes substrate. That is the architectural significance of Autonomous Networks. They are not a collection of automation use cases. They are the operating system of the AI-native network.

The unit of value is the decision

Across every domain moving toward autonomy, the unit of value is the same. It is not the data product, the model, the agent, the workflow, or the closed loop in isolation. It is the decision. A decision is where context, intent, data, policy, authorization, confidence, trust, and action come together.

This is also why the AI-native network cannot be an AI assistant layered on top of legacy dashboards. That may help with summarization or troubleshooting. It does not create autonomy. Autonomy requires an operating system in which decisions are represented, evaluated, coordinated, executed, secured, and governed.

Eight measures of operational autonomy

If the decision is the unit of value, operational autonomy needs new measures.

Decision rate is how many decisions the operating system can make per unit of time. AI-native networks will operate at a pace that human operational cycles cannot match.

Decision quality is how often the chosen decision is the right one. A fast wrong decision is not autonomy. It is automated instability.

Decision blast radius is the bounded scope of effect when a decision is wrong. It defines which resources, services, customers, and adjacent outcomes a decision is allowed to affect, and how quickly the impact can be contained or reversed.

Decision reversibility is whether a decision can be safely rolled back. Autonomy without reversibility will not earn operational trust.

Decision lineage is whether the system can trace why a decision was made, using which data, which model, which agent, which policy, which authorization, and which intent.

Decision coupling is whether the decision remains coherent across related systems. A radio optimization can affect transport. A transport action can affect service experience. A cloud placement decision can affect latency, energy, and sovereignty. In an AI-native network, local decisions increasingly create broader consequences.

Decision security is whether the decision is authorized, integrity-preserved, and executed within least-privilege bounds. In autonomous operations, security is not only about protecting the network from external attack; it is also about ensuring that autonomous action itself cannot exceed its authority.

Decision trustworthiness is the system’s calibrated confidence in the decision. It is not a vague belief that the system is “smart.” It is the disciplined ability to know when a decision is high confidence, when it requires simulation, when it needs human approval, and when it should not be taken at all. A fast decision that cannot be trusted can be more disruptive than a slower decision that is explainable, authorized, bounded, and reversible.

Autonomous Network maturity is not a statement about how much automation exists. It is a statement about how well the operating system makes, secures, governs, and improves decisions.

Telecom is the hardest instance of the pattern

Telecom is not late to autonomy because it lacks ambition. It is hard because it combines constraints that most other industries face only in part.

Consider a representative scenario: a customer-impacting degradation begins as a capacity anomaly in the radio access layer. It surfaces in service experience monitoring as elevated latency. It is partially masked by a transport path that is already operating near threshold. And it cannot be resolved by any single domain acting alone. It requires coordinated action across radio, transport, and service operations, each governed by different vendors, different policies, and different operational contracts.

A local closed loop may make the right decision within its domain and still produce the wrong outcome for the customer. This is the heart of the Autonomous Networks challenge: decision-making across dynamically changing, deeply coupled systems under carrier-grade guarantees.

Carrier-grade autonomy means the network must meet availability, determinism, resilience, regulatory accountability, lawful operations, and customer trust obligations, not as aspirational properties, but as non-negotiable constraints on every autonomous decision.

The open frontier is not more automation within existing silos. It is decisions that hold across silos, under those constraints, at the pace AI-native operations demand.

Inside the operating system

The operating system for Autonomous Networks requires several essential capabilities working in combination. It starts with a common substrate: data, compute, model serving, agent runtime, and governance primitives that operate across systems and vendors. Built on that, it needs an ontology: a semantic and operational representation of the network that captures not just topology but services, customers, intents, SLOs, faults, policies, configurations, and dependencies. It needs agents that can reason over context, invoke tools, and act within explicit boundaries: observers, advisors, actuators, and coordinators. It needs expert models that understand the consequences of decisions across the full network stack. And it needs intent as the decision contract: the definition of desired outcomes and the constraints under which the system is allowed to act.

Across all of this, it needs Glass Box governance. Where a black box AI produces an outcome with no explanation, a Glass Box system can show what data triggered a decision, what policy bounded it, what authorization approved it, and what rollback is available if conditions change. Glass Box governance is what makes decisions safe enough to deploy, precise enough to audit, and bounded enough to trust. Trust is not a feeling operators have about autonomy. It is a property the system must continuously prove.

From network operations to decision operations

This shift changes how we think about network operations as a discipline. Traditional network operations organized around alarms, tickets, dashboards, workflows, and manual escalation. Autonomous Networks organize around decisions. And the operational design challenge is knowing which decisions belong where. Which decisions should remain human-led because the blast radius is too high, the security boundary is sensitive, or regulatory accountability requires a human in the loop? Which decisions can be system-recommended, with the operator approving before execution? Which can be system-executed under policy, with human oversight at the governance layer? And which decisions should not be made at all, because the system’s confidence is not yet calibrated?

The goal is not to remove humans from the network. The goal is to move human attention to the right layer, from operator of every task to architect and governor of the decision system.

The proving ground for AI-native infrastructure

Autonomous Networks are more than a telecom challenge. They are the proving ground for a broader shift in how AI-native infrastructure is governed.

AI-Grids will need decisions across energy, compute, memory, workload placement, data movement, and connectivity. AI-Factories will need decisions across compute, power, cooling, networking, storage, data pipelines, model lifecycle, security, and operational resilience. Sovereign AI estates will need decisions across national-scale infrastructure, data governance, models, jurisdictions, security, and operational accountability.

Different domains will require different expert models, different policies, and different safety constraints. But the architectural shape will be familiar: ontology, data products, expert models, agents, intent, closed loops, and governance. Solving this problem in telecom, where the network already spans domains, vendors, geographies, customers, services, and regulatory regimes creates the operating logic for adjacent AI-native infrastructure. 

Networks will compete on decisions

The next phase of network competition will not be defined only by speed, coverage, capacity, or cost. Those remain essential. But they will not be sufficient. As networks become more dynamic, more intelligent, more distributed, and more tightly coupled to the physical world, differentiation will increasingly come from the quality of the decisions the network can make. Can the network detect earlier? Can it reason across systems? Can it act faster? Can it contain mistakes? Can it secure autonomous action? Can it explain what happened? Can it reverse course safely? Can it know when it is confident, and when it is not? Can it learn from each decision? Can it coordinate local actions into global outcomes?

The operators, and the vendors who serve them, that master the science and governance of decisions will define what the network is capable of next. Networks will still compete on performance. But increasingly, they will also compete on decisions. At Nokia, this is the operating system we are building with the operators we serve: one that turns autonomy from more automation into trusted, governed decisions.
 

Pallavi Mahajan

About Pallavi Mahajan

Pallavi Mahajan, Nokia’s Chief Technology and AI Officer, leads Nokia Bell Labs, Technology and AI Leadership, and Group Security to drive innovation in core technologies, strengthen AI and security capabilities, and create differentiation through open ecosystems and strategic partnerships. With deep expertise in networks, software, and AI, she has scaled multi-billion-dollar portfolios and shaped industry-defining shifts at Intel, HPE, and Juniper Networks. An inventor on multiple patents and a passionate advocate for women in tech and grassroots sports, Pallavi champions collaboration to unlock the next wave of growth.

Connect with Pallavi on LinkedIn

Oğuz Sunay

About Oğuz Sunay

Oğuz is CTO, AI & Autonomous Networks, Nokia, with 25+ years of experience at the intersection of AI, edge cloud, and wireless networking. His work bridges research and real-world deployment at scale, from architecting Intel’s data center AI stack to co-founding Ananki (acquired by Intel) and serving as co-PI on the DARPA Pronto program. He is the sole inventor on 40+ awarded patents, co-author of two technical books on 5G and edge systems, and a contributor to 3GPP and O-RAN.