Archives

Google Expands Gemini Capabilities with Advanced Deep Research Agent and Interactions API

Google

Google announced a major expansion of its AI platform with the launch of an enhanced Gemini Deep Research agent accessible through the new Interactions API. This update empowers developers to integrate advanced autonomous research functions directly into their own applications, enabling richer, more comprehensive insights drawn from multi-step web exploration and synthesis.

The upgraded Deep Research agent leverages Gemini 3 Pro, Google’s most factual reasoning model to date, and is optimized to deliver highly accurate, context-rich research outputs across complex tasks. It also debuts DeepSearchQA, an open-source benchmark designed to measure the comprehensiveness and depth of agentic search results.

Transformative Research Intelligence for Developers

Google describes the Deep Research offering as an agent capable of autonomously planning, executing, and synthesizing extensive research activities. Through iterative querying, reading, gap identification, and refined search techniques, the agent navigates deep into the web to generate reports that are validated against benchmarks such as Humanity’s Last Exam, DeepSearchQA, and BrowseComp.

The agent’s performance represents a significant advance in AI-driven research:

  • It achieves state-of-the-art results on benchmark tasks, including strong scores on Humanity’s Last Exam (HLE), DeepSearchQA, and BrowseComp.
  • It produces well-researched, evidence-backed reports at lower cost than prior models.
  • It will soon be integrated across Google Search, NotebookLM, Google Finance, and the enhanced Gemini app.

Also Read: IBM Launches watsonx.governance 2.1 to Revolutionize AI Governance for Enterprises

“Build with Gemini Deep Research”

Google positions the Deep Research agent as a powerful tool for next-generation research applications. Its key developer-centric capabilities include:

  • Unified Information Synthesis – Combines user documents (PDFs, CSVs, Docs) and public web data to produce aggregated insights.
  • Report Steerability – Enables users to define structure, headings, and output format through prompt controls.
  • Detailed Citations – Provides precise sourcing to verify claims and data origins.
  • Structured Output Support – Offers JSON schema outputs for seamless integration into other systems.

Interactions API: Simplifying Advanced Agent Integration

The Interactions API serves as the unified interface for both models and agents, enabling developers to work with complex, stateful AI interactions and long-running agentic workflows. With this API, developers can:

  • Call the Deep Research agent using a single REST endpoint.
  • Manage asynchronous tasks and background execution for long-form research.
  • Build interactive, agentic apps that go beyond simple prompt-and-response patterns.

Developers can begin building with the Deep Research agent via their Gemini API key on Google AI Studio, with detailed documentation available for setup and implementation.

Driving Innovation Across Industries

Early use cases demonstrate that Deep Research is already making an impact in fields such as finance, biotechnology, and market research particularly for tasks that involve detailed due diligence, competitor analysis, and scientific literature exploration.

By integrating agentic reasoning with web search and document analysis, Google’s expanded Gemini platform further accelerates the development of tools capable of deep analytical tasks once reserved for human research teams.