We’re excited to introduce two highly effective improvements in Azure AI Foundry.
AI brokers are remodeling industries by automating workflows, enhancing productiveness, and enabling clever decision-making. Companies are leveraging AI brokers to course of insurance coverage claims, handle IT service desks, optimize provide chain logistics, and even help healthcare professionals in analyzing medical information. The potential is huge, and we’re excited to introduce two highly effective improvements in Azure AI Foundry:
- Responses API: A robust API enabling AI-powered purposes to retrieve info, course of information, and take motion seamlessly.
- Laptop-Utilizing Agent (CUA): A breakthrough AI mannequin that navigates software program interfaces, executes duties, and automates workflows.
Collectively, these capabilities empower companies to reimagine AI not simply as an assistant—however as an energetic digital workforce. Enterprise prospects will quickly achieve entry to those improvements driving automation, effectivity, and intelligence at scale.
Enhancing AI Brokers with the Responses API
The Responses API is the important thing to unlocking agentic AI in Azure AI Foundry, remodeling how enterprises harness AI for real-world affect. It’s the new basis for leveraging Azure OpenAI Service’s highly effective built-in instruments, combining the simplicity of the Chat Completions API with the superior capabilities obtainable by Assistants API and Azure AI Agent Service. The Responses API allows seamless interplay with instruments like CUA, code interpreter, perform calling, and file search—all in a single API name. This API allows AI methods to retrieve information, course of info, and take actions—seamlessly connecting agentic AI with enterprise workflows.
How the Responses API Works
The Responses API supplies a structured response format that permits AI to work together with a number of instruments whereas sustaining context throughout interactions. It helps:
- Device calling in a single easy API name: Now, builders can seamlessly combine AI instruments, making execution extra environment friendly.
- Laptop use: Use the pc use instrument throughout the Responses API to drive automation and execute software program interactions.
- File search: Work together with enterprise information dynamically and extract related info.
- Code interpreter: Create and execute Python code effortlessly inside AI-powered purposes.
- Perform calling: Develop and invoke customized capabilities to boost AI capabilities.
- Chaining responses into conversations: Hold observe of interactions by linking responses collectively utilizing distinctive response IDs, guaranteeing continuity in AI-driven dialogues.
- Enterprise-grade information privateness: Constructed with Azure’s trusted safety and compliance requirements, guaranteeing information safety for organizations.
By consolidating retrieval, reasoning, and motion execution right into a single API, the Responses API simplifies AI agent improvement, lowering the complexity of orchestrating a number of AI instruments inside an automation pipeline.
This scalability makes it well-suited for enterprise use instances throughout industries reminiscent of customer support, IT operations, finance, and provide chain administration, the place AI-powered automation can streamline workflows and enhance effectivity. For even higher flexibility and management, organizations can discover Azure AI Agent Service, which presents further instruments and fashions for creating and scaling AI brokers. Azure AI Agent Service integrates with Semantic Kernel and AutoGen, enabling seamless multi-agent orchestration for extra complicated situations requiring a number of brokers to collaborate on duties.
Empowering AI Brokers with the Laptop-Utilizing Agent
The Laptop-Utilizing Agent (CUA) is a specialised AI mannequin in Azure OpenAI Service that permits AI to work together with graphical person interfaces (GUIs), navigate purposes, and automate multi-step duties—all by pure language directions. In contrast to conventional automation instruments that depend on predefined scripts or API-based integrations, CUA can interpret visible parts, adapt dynamically, and take motion primarily based on on-screen content material.
What makes the Laptop-Utilizing Agent distinctive?
- Autonomous UI navigation: Can open purposes, click on buttons, fill out kinds, and navigate multi-page workflows.
- Dynamic adaptation: Interprets UI modifications and adjusts actions accordingly, lowering reliance on inflexible automation scripts.
- Cross-application job execution: Operates throughout web-based and desktop purposes, integrating disparate methods with out API dependencies.
- Pure language command interface: Customers can describe a job in plain language, and CUA determines the right UI interactions to execute.
With at present’s announcement, builders can begin constructing further agentic capabilities instantly with CUA. As enterprises look to deploy this expertise at scale, we’re evaluating integration with Home windows 365 and Azure Digital Desktop to allow CUA automation to run seamlessly in a managed host surroundings on Cloud PCs or digital machines (VMs), guaranteeing constant efficiency whereas sustaining enterprise compliance and safety requirements.
Guaranteeing safe and reliable AI automation
As AI methods change into extra autonomous, guaranteeing safety, reliability, and alignment with human intent is important. The CUA mannequin is among the first agentic AI fashions able to straight interacting with software program environments, bringing new challenges in misuse prevention, unintended actions, and adversarial dangers. To deal with these, Microsoft and OpenAI have applied a multi-layered security method spanning the mannequin, system, and deployment ranges.
The CUA mannequin is developed with safeguards to refuse dangerous duties, reject unauthorized actions, and stop misuse. On the system degree, Microsoft implements enterprise-grade content material filtering and execution monitoring to assist detect and stop coverage violations. To attenuate unintended actions, CUA is designed to request person confirmations earlier than executing irreversible duties and to limit high-risk actions reminiscent of monetary transactions.
Microsoft’s Reliable AI framework additional ensures real-time observability, logging, and compliance auditing for enterprise deployments. Automated and human-in-the-loop detection methods monitor execution patterns, figuring out anomalous behaviors and imposing governance insurance policies. These safeguards are constantly refined primarily based on inner red-teaming, exterior audits, and real-world testing to strengthen safety towards immediate injections, adversarial manipulations, and unauthorized entry. Given the present reliability degree of the CUA mannequin—notably in non-browser environments—human oversight stays strongly beneficial for delicate operations.
As AI brokers evolve, Microsoft is dedicated to transparency, safety, and ongoing threat mitigation. By combining CUA’s built-in safeguards with Azure’s enterprise compliance and governance instruments, organizations can deploy AI-powered automation with confidence, guaranteeing secure and accountable AI adoption at scale.
Getting began with CUA and Responses API
Azure AI Foundry continues to push the boundaries of AI-powered automation. Enterprise prospects will achieve entry to the Responses API and CUA in Azure OpenAI Service within the coming weeks.
We’re excited to see how builders and companies innovate with these new capabilities.