Google I/O 2026, Gemini AI arrives with Gmail Live, Docs Live and Keep: the new voice functions

Credit: Google

Google announced during the Google I/O 2026 conference a major update to its Workspace ecosystem, introducing Docs Live and Gmail Live, conversational tools based on Gemini artificial intelligence that allow you to create, edit and interact with documents, notes and emails using only your voice. The Mountain View giant has shown – in addition to the new agent functionality of Gemini Spark – other tools such as voice interaction is evolving well beyond simple dictation, transforming into a dynamic and multi-step collaboration with AI.

The new features, along with a major overhaul of Google Keep, allow you to issue complex, structured commands in natural language. The system is able to manage changes of direction in speech and corrections on the fly, integrating data from different sources such as Drive or the Web.

The new conversational features of Gmail, Docs and Keep will initially be available in preview for Google Workspace enterprise customers, before rolling out globally to users enrolled in the Google AI Pro and Ultra plans.

Beyond dictation: what’s new with Docs Live

The goal declared by Google CEO, Sundar Pichai, is to allow the complete creation and modification of entire documents without touching the screen and keyboard. With Docs Live, for example, you don’t just convert speech into text: the system acts as a real editorial assistant, collecting ideas, organizing the logical structure of the text and, if authorized by the user, safely drawing on external sources. In practical demonstrations, it was shown how AI can extract data from a CV saved on Drive, retrieve the logistical details of an event from an email and combine everything into a coherent draft, refining style and syntax.

This quantum leap is made possible by advances in natural language processing, or NLP, models (Natural Language Processing), which allow systems to understand the structure and context of human sentences. Google’s new algorithms analyze the global context and manage so-called “changes of intention” in mid-sentence: if during speech you decide to change a detail that has just been mentioned, the AI ​​recognizes the self-correction and updates the request without getting confused.

Commenting on the introduction of Docs Live, Yulie Kwon Kim, Vice-President of Product at Google Workspace, explained:

Docs Live serves as a thought partner and co-author to help you get a first draft faster using just your voice. You can just talk, and Docs Live does the heavy lifting – organizing your thoughts, structuring the document, and, with your permission, securely retrieving relevant details from Gmail, Drive, and the web.

Keep and Gmail Live, the mail that responds by voice and searches through emails

This fluidity also extends to email management via Gmail Live, where voice interaction becomes a tool for immediate search and synthesis of information. It is possible to ask explicit questions to trace data buried among thousands of messages – the time of an appointment, the code of a reservation, the gate of a flight – obtaining a practically instantaneous response.

Keep, Google’s famous notes and notes app, also takes advantage of this technology, in a segment where apps like Voicenotes or AudioPen had already paved the way. When recording a disorganized thought, the AI ​​does not simply generate a literal transcription, but reworks the text in the background, structuring it autonomously into bulleted lists or schematic notes. This innovation comes alongside the recent release of Rambler, the voice typing system integrated into the Gboard keyboard and compatible with various apps.