When Amazon Web Services Inc. held its Recent York Summit last week, Vice President of Agentic AI Swami Sivasubramanian as usual was the headline act, delivering the opening keynote.
Sivasubramanian made the case to enterprise leaders that the unreal intelligence conversation has moved beyond pilots and productivity hacks right into a world where the actual advantage lies in compounding momentum across work, security, software delivery and data. For IT pros, meaning your architectural decisions over the following 12 to 18 months will determine whether AI agents grow to be a force multiplier or a brand new source of chaos.
Listed here are five big ideas from Sivasubramanian’s keynote and what they mean for those accountable for constructing and operating enterprise technology:
1. From ‘faster search bars’ to compounding agents
Sivasubramanian’s most important critique of the primary generation of AI assistants is that they never broke out of chat-window gravity. They sit on top of tools, answer an issue after which forget. “We gave them chat windows and connected them to our tools,” he said. “They answer one query, after which they forget. The promise was intelligence, but what we got was a rather faster search bar. Faster search doesn’t compound; it flatlines.”
The choice he laid out is an agentic model during which every accomplished task feeds the following. “What you actually need is agents that truly change the way in which you’re employed, not only speed up the steps, but completely eliminate them,” Swami argued. “If humans are still forced to be the orchestration layer, your momentum actually has a ceiling.” In his framing, “every task that their agents complete makes the following one smarter,” creating “compounding momentum” and widening the gap between early adopters and those that wait.
That’s the design center for Amazon Quick, an AI assistant that “states the consequence you would like and figures out tips on how to get there across all of your systems, all of your data and all of your context,” powered by a knowledge graph that reasons across people, documents, communications and data lakes. Within the live demo, Quick assembled a marketing report by pulling data from Slack, Google Drive and OneDrive in about 20 seconds — work, Sivasubramanian said, “would have taken probably hours of actual research” before.
Implications for IT pros: This model assumes your collaboration and data platforms are open to agent access and governed by strong identity and policy controls. The job shifts from selecting yet one more assistant to curating an ecosystem where agents can safely traverse silos. Connectors, metadata and policy enforcement grow to be as vital as model selection. This can be a vastly different role for IT pros, but one which’s critical for corporations that succeed with their agentic initiatives.
2. Security: Ending the ‘walled garden vs. wild garden’ tradeoff
On security, Sivasubramanian highlighted a dilemma many chief information security officers will face. On one side, “agents that work inside their very own walled garden only see what’s inside their very own productivity suite. The moment you wish something outside the wall, you’re back to being the orchestrator.” However, open tools “don’t offer the extent of security, compliance and governance that enterprises demand. You traded the walled garden for the wild one.”
“This can be a false selection,” he said. “Quick doesn’t ask you to decide on. No partitions, no copy-and-paste bridges, and each motion it takes carries its own governance. Who acted on it, what data they touched, where it went, and whether the policy allowed it.” That theme continues with AWS Continuum, a set of agent-driven security capabilities spanning penetration testing, threat modeling and code vulnerability assessment. Chet Kapoor, who leads security, observability, search and governance products, described the shift from “telemetry, storage, query and dashboards for humans” to “telemetry to context to reasoning to actions for agents.” Telemetry without context is “noise,” he said; with context, it becomes a “signal” agents can act on.
Customer stories were included to make the stakes concrete. Swami cited GoDaddy using Amazon Quick to eliminate “15,000 hours of manual work annually.” He also highlighted the NBA’s use of Quick to structure 25 years of prospect data into interactive leaderboards and comparisons.
Implications for IT pros: Security operations are headed toward agents taking actions under policy, not analysts looking at dashboards. That raises the importance of policy as code, identity boundaries, least-privilege design, and clear “rails” for where agents can operate. The conversation with the CISO is not any longer “Should we use AI?” but “What is going to we allow AI to do, and under what guardrails?”
3. Software delivery as a closed loop
If the primary wave of generative AI was about coding copilots, this keynote reframed the narrative around end-to-end software delivery loops. “Write it right, ship it fast, keep it modern – not three tools, one continuous loop, all the time running, all the time compounding,” he said. That loop is already in production at Amazon Stores, where teams behind the retail experience saw a “median 4.5x improvement in how briskly correct code reaches production, with some teams hitting as much as 17x,” and “AI-generated code changes landing with 95% accuracy, higher than the human baseline.”
Kiro is the engineering agent that anchors the “write it right” a part of the loop. You give it a prompt, and it generates “clear requirements, structured design docs, implementation tasks, and validated tests before a single line of code is generated.” It then uses agents and property-based testing to implement and confirm. Swami pointed to fintech startup Dhan, which needed to support greater than 170 complex trading indicators. Without agents, it estimated “over a dozen engineers in a period of 12 to 24 months;” with Kiro, “all this was built by a single engineer in only eight weeks.”
The loop extends into operations. AWS DevOps Agent began as an incident-response companion utilized by customers like T-Mobile and United Airlines; now AWS is adding release management. It will probably project production risk from a code change, explore an application reminiscent of an end user, rating releases, and feed its report “on to your coding agent to start out implementing those fixes mechanically.”
On the opposite side of the loop, AWS Transform moves from one-time modernization projects to “continuous modernization,” performing “continuous state evaluation and remediation at machine speed, all the time watching, all the time fixing across every code base you own.” AWS says customers have already used Transform to eliminate 1.6 million hours of manual modernization work.
Implications for IT pros: That is an opinionated pipeline: spec, code, test, release, modernize, repeat, with agents in each phase. To learn, enterprises might want to standardize how they organize their Git repositories, pipelines and quality gates so agents can act safely across services and to make a cultural shift that treats modernization and reliability work as continuous flows, not project-of-the-year initiatives.
4. Southwest Airlines: A playbook for a ‘modern fleet’ of systems
Essentially the most compelling customer story got here from Lauren Woods, executive vp and chief information officer at Southwest Airlines. She linked technology selections on to lessons from Winter Storm Elliott. “It wasn’t our systems that were failing, but they weren’t designed to maintain up with the pace and the extent of complexity happening across the operation suddenly,” she said. To run like a contemporary airline, “we’d like technology that operates like a contemporary fleet.”
Southwest selected AWS as its preferred cloud partner for a “secure, scalable foundation” and access to innovation. Regarding AI, Woods said she uses Amazon Quick every single day, describing a shift from “ data after the actual fact to interacting with it in real time” across fare and revenue evaluation and call center behavioral trends. The impact has been faster decisions, closer to the purpose of motion.
For engineering, Southwest scaled Kiro to “greater than 2,700 developers, about two-thirds of our engineering organization,” using it for unit test generation, infrastructure as code, and faster onboarding. The Southwest.com platform, which is mission-critical and built on legacy architecture, had an extended modernization roadmap. Using Kiro, “our teams have accelerated that modernization significantly, pulling the unique timeline in by three years,” Lauren said. “We’re making it easier to construct on, evolve and scale as our business changes.”
Implications for IT pros: Southwest is a superb case study. AI-augmented decision-making across the business, agents embedded within the SDLC at scale, and modernization and transformation running in parallel. It’s also a reminder that the important thing performance indicator for AI initiatives will increasingly be operational resilience and customer satisfaction, not only developer productivity.
5. Agent platforms: Harness, guardrails and context as first-class primitives
The ultimate act of the keynote shifted from AWS-built agents to the agents that customers will construct themselves. Sivasubramanian noted that “the agents that can matter essentially the most are those for what you are promoting that only you’ll be able to create,” but many are “stuck between prototype and production” because teams are re-implementing basics: authentication, memory, tool access, security and governance.
Amazon’s answer is AgentCore, which provides “core components to construct agents” and features a managed runtime, built-in identity, session memory, observability, evaluations and access controls. It’s designed to work with any agent framework and model. Over the past six months, Swami said, “the variety of tasks performed by agents in AgentCore has grown by 15x,” and customers reminiscent of PGA TOUR, Nasdaq and Visa are constructing production agents in weeks as a substitute of months.
Two concepts are vital here. First, the harness. Sivasubramanian described the model because the “brain” and the harness because the “body” that gives “state persistence, error recovery, context management, [and] session isolation.” AgentCore Harness can turn a model into an agent in minutes with three application programming interface calls. Second, Agent Core Policies define what agents can and can’t do and are enforced “outside the agent’s code, where the agent can’t bypass it,” including detection of prompt attacks, harmful content, and sensitive data. AWS plans to ingest signals from third-party security providers into that policy layer.
Underpinning that is context. AWS Context mechanically builds a knowledge graph across structured and unstructured data and exposes it to agents at runtime. Swami identified that inside Amazon, the semantic knowledge store behind Q processes “over 1.8 million requests” per day, mapping business semantics (“escalations” vs. “tickets”) and relationships across systems. Within the enterprise, that graph spans public web data via managed search tools, organizational content in S3, SharePoint, Confluence, and Google Drive, and structured data in lakes and warehouses.
Implications for IT pros: That is the AI platform north star: an agent runtime/harness, a policy and guardrail layer outside prompts, and a governed context service — often graph-based — that encodes how what you are promoting works. Whether you adopt AWS’ stack or assemble your personal, success will come down less to prompt engineering and more to how well you design skills, policies and knowledge graphs that reflect your domain.
Final thoughts
Sivasubramanian’s core point is that agents aren’t a feature toggle but an architectural selection. The advantage goes to organizations that design for compounding momentum across work, security, software delivery and data, reasonably than to those who simply turn on Amazon Quick, Kiro or DevOps Agent.
For information technology leaders, meaning treating agent access, guardrails and context as platform services, embedding AI more deeply in delivery and operations, and copying the Southwest playbook: Start with a high-impact domain, align business and engineering on outcomes, and let agents handle the undifferentiated heavy lifting while your teams deal with domain-specific decisions.
Zeus Kerravala is a principal analyst at ZK Research, a division of Kerravala Consulting. He wrote this text for SiliconANGLE.
Photo: Zeus Kerravala
Support our mission to maintain content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.
- 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
- 11.4k+ theCUBE alumni — Connect with greater than 11,400 tech and business leaders shaping the longer term through a singular trusted-based network.
About SiliconANGLE Media
Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our latest proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to assist technology corporations make data-driven decisions and stay on the forefront of industry conversations.

