Home > AI

ChatGPT voice control – how ChatGPT talks

What is ChatGPT voice mode?

Reviewed By: Kevin Pocock

Last Updated on February 1, 2024
How to use ChatGPT voice control and voice commands with OpenAI's GPT-4V multimodal AI model.
You can trust PC Guide: Our team of experts use a combination of independent consumer research, in-depth testing where appropriate - which will be flagged as such, and market analysis when recommending products, software and services. Find out how we test here.

The OpenAI AI chatbot just got a text-to-speech upgrade. On September 25th, 2023, OpenAI CEO Sam Altman announced ChatGPT voice control (or simply “ChatGPT Voice”). Not only can you now chat to ChatGPT in natural language through voice input, but thanks to OpenAI’s Whisper AI model, it can also talk back!

What is ChatGPT voice control?

Announced on September 25th, OpenAI co-founder Sam Altman confirmed ChatGPT voice control via Twitter. Starting early next month, you will be able to “voice type”, meaning ChatGPT will transcribe your voice commands into a text prompt, and then act on them as if you’d typed it. The ‘voice and images’ update was initially rolled out to ChatGPT Plus and Enterprise users, because the ChatGPT Teams plan didn’t exist at the time.

How to use ChatGPT voice control

OpenAI has since confirmed that “ChatGPT with voice is now available to all free users.” To use it, you can “Download the app on your phone and tap the headphones icon to start a conversation.”

✓ Steve says

Strong competition for ElevenLabs

OpenAI has filed to trademark the logo for ChatGPT Voice. You can find the USPTO Trademark & Patent filing, dated October 5th, by searching for Openai Opco L L C Trademarks & Logos.

ChatGPT voice control is exclusive to the ChatGPT app, as mobile devices are all microphone-enabled (a requirement of voice commands). In the mobile app for Android or iOS, access your OpenAI account and follow these instructions:

To get started with voice, head to Settings → New Features on the mobile app and opt into voice conversations. Then, tap the headphone button located in the top-right corner of the home screen and choose your preferred voice out of five different voices.

OpenAI.com

Essential AI Tools

Editor’s pick
Only $0.00019 per word!

Content Guardian – AI Content Checker – One-click, Eight Checks

8 Market leading AI Content Checkers in ONE click. The only 8-in-1 AI content detector platform in the world. We integrate with leading AI content detectors to give unparalleled confidence that your content appear to be written by a human.
EXCLUSIVE DEAL 10,000 free bonus credits

Jasper AI

On-brand AI content wherever you create. 100,000+ customers creating real content with Jasper. One AI tool, all the best models.
TRY FOR FREE

WordAI

10x Your Content Output With AI. Key features – No duplicate content, full control, in built AI content checker. Free trial available.
TRY FOR FREE

Copy.ai

Experience the full power of an AI content generator that delivers premium results in seconds. 8 million users enjoy writing blogs 10x faster, effortlessly creating higher converting social media posts or writing more engaging emails. Sign up for a free trial.
TRY FOR FREE

Writesonic

Create SEO-optimized and plagiarism-free content for your blogs, ads, emails, and website 10X faster. Start for free. No credit card required.

The new voice recognition input field will be powered by “a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech.” OpenAI also employed the use of its proprietary and open-source speech recognition system, Whisper, for transcribing spoken words into text.

To use speech input on iPhone or Android device:

Step

1

Open ChatGPT

Open ChatGPT on your mobile device. This can be either via web browser or via the ChatGPT app for iOS and Android.

How to access ChatGPT voice control.
How to access ChatGPT voice control.

Step

2

Allow audio recording

In the text prompt window, tap the microphone icon to the left of “Message”. You will need to “Allow ChatGPT to record audio” if you’ve not used the feature since the last time you installed the app on this device.

Allow ChatGPT to record audio.
Allow ChatGPT to record audio.

Step

3

Voice prompt ChatGPT

Speak into your phone as you would on a call. ChatGPT will listen, and once you’re done speaking, tap the screen where indicated. It will transcribe your speech to text, which you can edit via on-screen keyboard (GBoard on Android devices) before submitting it.

Voice transcription with ChatGPT voice mode.
Voice transcription with ChatGPT voice mode.

This can be used as voice dictation software, or to transcribe podcasts, among countless emerging use cases. No longer will you have to rely on voice control extensions like the talk-to-ChatGPT Chrome extension from the Google Chrome web store. You will not need a browser extension or plugin to use ChatGPT voice control. You will, of course, need an external microphone to use any form of speech recognition software on a Mac or PC.

ChatGPT audio input & audio output

This pits it head-to-head with Apple’s Siri and Amazon’s Alexa (who just announced their opposing developments from voice assistant into artificial intelligence). ChatGPT voice control also poses a competitive threat to tech giant Google, currently breaking new ground in AI audio.

Now that ChatGPT can see, hear, and speak, OpenAI hired professional voice actors to record the 5 realistic synthetic voices you can choose from.

When can you use ChatGPT’s voice feature?

All users now have free access to speech-to-text with ChatGPT Voice. It will be accessible through the app for iOS and Android. OpenAI’s focus on mobile is due to the necessity of a microphone – which is an uncommon standalone accessory on desktops but comes built-in on mobile devices.

Steve is the AI Content Writer for PC Guide, writing about all things artificial intelligence. He currently leads the AI reviews on the website.