ChatGPT launches new voice mode integrated into chat

  • ChatGPT's new voice mode is integrated directly into the chat window, without switching to a separate screen.
  • Users can speak, read the real-time transcript, and view images, maps, or other visual elements simultaneously.
  • The feature is available on web, Android and iOS, with the option to revert to the previous "Separate Mode" from the settings.
  • Advanced voice mode improves naturalness, reduces latency, and offers more personal voices supported by models such as GPT-5.1.

ChatGPT voice mode in chat

The way of speaking with ChatGPT It has just changed significantly. OpenAI has decided to simplify access to one of its most used features, the voice modewhich is now integrated directly into the same chat screen as always, without needing to jump to another view or open parallel interfaces.

With this update, the OpenAI assistant combines text and voice conversation into a single experience. Users can talk to the AI ​​while watching the screen real time transcriptionIn addition to maps, images, or other visual elements that the system displays depending on the context of the query, everything happens within the same thread, in a more fluid and natural way.

What changes with the new integrated voice mode

New voice mode integrated into ChatGPT

Until now, to use voice with ChatGPT on mobile or web, you had to switch to a stand-alone interfaceTapping the corresponding icon took us to a screen dominated by the classic blue orb, focused solely on audio. It was immersive, but it required switching environments every time we wanted to review previous messages or check something visual.

That functionality is outdated. With the new version, when you touch the wave icon Located next to the text input bar, ChatGPT activates voice mode within the chat itself. The user can continue viewing the entire message history while simultaneously initiating a spoken conversation with the assistant without leaving the main window.

During the conversation, the system displays a live transcription of what the user says and the AI's responsesThe idea is that the experience should be more like a face-to-face conversation, but with the added benefit of having a written record and being able to return to any point effortlessly.

In addition to text, the interface can incorporate real-time visual contentThis includes AI-generated images, screenshots, web page snippets, or maps, depending on the question asked. This way, you don't need to leave voice mode to see relevant visual information while continuing to speak to the assistant.

Another practical detail is that it can switch between writing and speaking at any timeEven when voice mode is active, if the user types a question, the answer can still be delivered in voice format, maintaining the continuity of the conversation.

Advanced voice mode: more natural, faster, and personal

Advanced ChatGPT Voices

Interface integration doesn't happen in isolation. OpenAI has taken the opportunity to introduce Improvements to advanced voice modeIts most sophisticated option for real-time spoken conversations. This mode offers more natural-sounding voices, with intonations closer to a person and a certain ability to convey emotional nuances.

According to the company, the AI ​​models have been adjusted to reduce latency and make the conversation smootherThe assistant can respond in just a few hundred milliseconds, approaching the speed of a conversation between two people. The goal is to reduce the feeling of interacting with a machine and lessen friction in daily life.

OpenAI has also incorporated support in this context for more recent models, such as GPT-5.1This allows for better control of voice tone, adaptation of response style, and management of more complex dialogues without disrupting the flow of the conversation. For those who use ChatGPT as a work, study, or personal assistant, this improvement can make all the difference.

In practice, this translates into The available voices are less robotic and more pleasant to listen to for extended periods. Although this approach to "humanizing" AI has received criticism in some specialized circles, OpenAI argues that it helps people feel more comfortable and makes the interaction less impersonal.

It's important to know that There are two levels of voice experienceOn one hand, there's the free, standard mode, which already allows voice chats and is available to everyone. On the other hand, there's the advanced mode, with more powerful audio capabilities and additional voices, which is accessible to those with paid plans like ChatGPT Plus, Pro, or Teams. In both cases, the chat integration is the same.

How to activate or deactivate the new voice mode in ChatGPT

Accessing the new functionality is quite simple. In the ChatGPT application, both in Android, iOS, and web versionSimply make sure you have the app updated. Once you do, a wave or speech bubble icon will appear on the right side of the message bar.

Pressing it immediately activates the voice conversation within the chat itselfThe user can start speaking and see how the AI ​​displays its responses in real time, in text form and, when appropriate, with images, maps, or other visual resources. There are no additional steps to accept or need to navigate to another menu.

If at any point you prefer to rewrite, you can press the same button again or simply start typing. While voice mode is on, even if you send text messages, the assistant can still reply verbally, maintaining the "hands-free" experience if desired.

For those who aren't entirely comfortable with this change, OpenAI offers a way to revert. Within the app's settings, in the section dedicated to voice mode, you can activate the setting called "Separate mode"By doing so, the tool reverts to its previous behavior, in which the user is taken to an audio-only interface whenever they want to talk to ChatGPT.

This "Separate Mode" can activate and deactivate as many times as desiredWithout limit. It's a way for each person to choose whether they prefer the more immersive, audio-focused experience, or the new unified interface that combines text, voice, and visuals.

Global availability and use in Spain and Europe

OpenAI has indicated that the Voice mode integration within chat is being rolled out globally This applies to both the web version and the mobile apps. In practice, in Spain and the rest of Europe, users only need to update the app from their mobile device's official app store or refresh the website to start seeing the new behavior.

The company points out that the Basic access to the voice assistant remains freeHowever, for accounts without a subscription, there may be limits on usage minutes or daily intensity, which are managed dynamically based on service load. Those with paid plans have greater flexibility and premium voice features integrated directly into the main chat window.

In the European context, this update comes at a time when the competition between AI-based voice assistants The race is intensifying, with offerings like Google's Gemini Live and tools integrated into mobile and desktop ecosystems. ChatGPT's full voice and text integration puts the service in a strong position in this competition.

For the average user in Spain, this means that they can consult routes on a map, ask for recommendations, review tasks, or resolve complex doubts by speaking naturally., while simultaneously viewing the information organized on the screen, without cuts between modes or abrupt interface changes.

In the professional and educational fields, this new form of interaction can facilitate voice summary generationAI-assisted meetings, correction of dictated texts, or support in language studies, taking advantage of both the auditory and visual aspects within the same workflow.

With this move, ChatGPT is moving towards a more unified conversational experience, in which Voice, text, and visual content coexist on a single screenThe option to return to the old "Separate Mode" leaves room for more traditional users, but OpenAI's main focus is clearly on a continuous interaction model, closer to how we talk and consult information in our daily lives, whether from Spain, the rest of Europe, or anywhere else on the map.

Using ChatGPT as your primary assistant on Android 3
Related article:
Complete Guide to Using ChatGPT as a Voice Assistant on Android