OpenAI’s ChatGPT Plus Introduces Advanced Voice Mode

OpenAI has launched an alpha version of Advanced Voice Mode for ChatGPT Plus users, available on the ChatGPT mobile app for iOS and Android. This feature offers real-time conversations with AI-generated voices, enhancing user interaction. Initially, it's accessible to a select group, with a broader rollout planned by fall.
OpenAI's ChatGPT Plus Introduces Advanced Voice Mode
Image Credit: OpenAI

OpenAI has initiated the alpha rollout of its new Advanced Voice Mode for a select group of ChatGPT Plus users, allowing them to engage in more natural conversations with the AI chatbot on the official ChatGPT mobile app for iOS and Android.

Limited Rollout to ChatGPT Plus Users


The company announced on X that the mode would be available to “a small group of ChatGPT Plus users,” adding that more users would be added on a rolling basis, with plans for all ChatGPT Plus subscribers to have access by fall. ChatGPT Plus, priced at $20 per month, provides enhanced access to OpenAI’s large language model (LLM)-powered chatbot, along with other subscription tiers like Free, Team, and Enterprise.

It remains unclear how OpenAI is selecting users for the initial access to Advanced Voice Mode. However, the company noted that “users in this alpha will receive an email with instructions and a message in their mobile app” for ChatGPT, so interested users should check their inboxes and app notifications.

Advanced Voice Mode Features

The Advanced Voice Mode, showcased at OpenAI’s Spring Update event in May 2024, enables real-time conversation with four AI-generated voices on ChatGPT. The chatbot can handle interruptions and detect, respond to, and convey different emotions in its utterances and intonations.

OpenAI demonstrated various potential use cases for this conversational feature, including acting as a tutoring aid, fashion adviser, and guide for the visually impaired, especially when combined with its Vision capabilities.

System Requirements and Usage

Advanced Voice Mode is currently available on the iOS and Android ChatGPT apps. To use this feature, Android users need app version 1.2024.206 or later, and iOS users need app version 1.2024.205 or later with iOS 16.4 or later.

To start a conversation, users should select the Voice icon at the bottom-right of the screen. During the conversation, users can mute or unmute the microphone by selecting the microphone icon at the bottom-left of the screen and end the conversation by pressing the red icon at the bottom-right. Users need to provide the ChatGPT app with microphone permission to use this feature.

Usage Limits and Current Constraints

Advanced Voice Mode is currently in a limited alpha and may make mistakes. Usage of advanced Voice Mode (audio inputs and outputs) is limited daily, with precise limits subject to change. The ChatGPT app will issue a warning when three minutes of audio usage remain. Once the limit is reached, the conversation will end, and users will be invited to switch to the standard voice mode.

Advanced Voice Mode cannot create or access previous memories or custom instructions. Conversations in this mode can be resumed in advanced Voice, text, or standard Voice. However, due to the lack of memory and custom instruction support, conversations in text or standard Voice cannot be resumed in advanced Voice Mode.

How to Minimize Conversation Interruptions

To minimize interruptions during conversations in advanced Voice Mode, OpenAI recommends using headphones. iPhone users can enable Voice Isolation mic mode by opening the Control Panel, selecting Mic Mode, and switching to Voice Isolation. If issues persist, restarting the app, increasing the assistant’s volume, or moving to a quieter environment may help. The feature is not optimized for use with in-car Bluetooth or speakerphone.

Data Usage and Privacy Considerations

During the alpha phase, audio from advanced Voice Mode conversations will be used to train OpenAI’s models if users have shared their audio. Users can opt out by disabling “Improve voice for everyone” in their Data Controls Settings. If this setting is not visible, it means the user hasn’t shared their audio, and it will not be used for training.

With Standard Voice Mode, if users share their audio, OpenAI will store audio from voice chats rather than deleting clips after transcription. Efforts will be made to reduce personal information in the audio used for training, and the team may review shared audio.

No Support for GPTs, Music, and Video

Advanced Voice Mode is not yet available for use with GPTs and cannot generate musical content due to protections for creators’ rights. Video and screen-sharing support are also not part of the current alpha but will be available in future updates.

How Advanced Voice Mode Stands Out

The release of ChatGPT Advanced Voice Mode differentiates OpenAI from competitors such as Meta’s new Llama model and Anthropic’s Claude, and pressures emotive voice-focused AI startups like Hume. In recent months, OpenAI has released numerous papers on safety and AI model alignment, following the disbanding of its superalignment team and criticisms from former and current employees about prioritizing new products over safety.

Future Availability

OpenAI plans for all ChatGPT Plus users to have access to Advanced Voice Mode by fall, contingent on meeting safety and reliability standards. The company is also working on rolling out the new video and screen-sharing capabilities, which are demoed separately, and will keep users updated on the timeline.

Clearly, the cautious rollout of Advanced Voice Mode aims to address these criticisms and reassure users, regulators, and lawmakers that OpenAI is committed to prioritizing safety alongside innovation and profitability.


Recent Content

Combining Site Reliability Engineering (SRE) and DevOps methodologies enhances incident response strategies, ensuring systems are both robust and agile. This approach aids in minimizing downtime, streamlining processes, and securing customer trust by effectively managing and mitigating incidents with a focus on continuous improvement.
Key Highlights from the Report “AI and RAN – How Fast Will They Run?” from Insight Research:
The addressable market for AI in RAN will grow by an impressive 45% annually during 2023-2028
The addressable market for AI in 5G RAN will grow faster than earlier telephony generations
The APAC region will be the largest market for AI in RAN caching applications
With a meticulous breakdown of the market by application, region, and telephony generations, the report offers unparalleled quantitative insights, empowering stakeholders to make informed decisions in a rapidly evolving landscape.
NVIDIA introduces a 6G Research Cloud Platform, integrating AI to transform future wireless communications. With partners like Ansys, Arm, ETH Zurich, Fujitsu, Keysight, Nokia, Samsung, and Northeastern University, this initiative aims to redefine connectivity.
Orange Business has teamed up with LightOn to deliver comprehensive GenAI solutions aimed at bolstering digital transformation efforts for French enterprises. This collaboration signifies a strategic move to provide secure, scalable, and innovative GenAI services across the country, highlighting the growing synergy between traditional telecom services and modern AI technologies.
This article delves into the role of Generative AI (GenAI) within the telecom industry. It highlights GenAI’s capabilities in enhancing customer interactions, streamlining network operations, and fortifying security protocols.We uncover the extensive benefits and emerging applications of GenAI, underscoring its transformative potential in redefining telecom services and customer engagement.

Currently, no free downloads are available for related categories. Search similar content to download:

  • Reset

It seems we can't find what you're looking for.

Download Magazine

With Subscription

Subscribe To Our Newsletter

Scroll to Top