Search
Close this search box.
Search
Close this search box.

OpenAI’s ChatGPT Now Can Speak, Hear, and See: a Multimodal Upgrade in History

ChatGPT Now Can Speak Hear and See

With the addition of additional speech and image capabilities in ChatGPT, OpenAI is once again pushing the boundaries of AI technology. These features are expected to change the way users engage with the AI model, providing a more intuitive and immersive experience.

Voice Conversations with ChatGPT

One of the most notable aspects of this version is the ability to do voice conversations using ChatGPT. Users may now converse with their AI helper in real time, bringing up a world of possibilities. ChatGPT’s voice skills are ready to assist you whether you’re on the go, looking for a bedtime story for your family, or settling a dinner table disagreement.

To begin using voice, go to the Settings menu in the mobile app, pick “New Features,” and enable voice conversations. Once activated, press the headphone icon in the top-right corner of the home screen to select one of five voices. Professional voice actors have meticulously developed these voices to provide a human-like audio experience. Furthermore, Whisper, OpenAI’s open-source speech recognition system, transcribes spoken words into text, improving the overall quality of the conversation.

Image Interaction with ChatGPT

The ability to share photos with ChatGPT is another game changer. Users can now use ChatGPT to troubleshoot difficulties, explore material, and evaluate complex data by displaying one or more photos. ChatGPT can help you figure out why your grill won’t start, design a dinner based on the contents of your fridge, or analyze a data graph for work.

Tap the photo button to capture or select an image to use this function. Tap the addition button first on iOS or Android to upload several photographs, or use the sketching tool to lead your assistant. Multimodal models, such as GPT-3.5 and GPT-4, power these picture capabilities by applying language reasoning skills to a wide range of visual input, such as photos, screenshots, and documents comprising text and images.

Gradual Deployment for Security and Resilience

Voice and image capabilities will be gradually handed out to Plus and Enterprise subscribers over the next two weeks. Voice will be available on both the iOS and Android platforms, with the option to opt in via settings, while photos will be available on all devices.

OpenAI recognizes the hazards involved with these increased capabilities. The emphasis for voice is on voice chat, and the technology was created in partnership with voice actors to assure authenticity and safety. Notably, Spotify is leveraging this technology for its Voice Translation service, which allows podcasters to increase their audience by translating content into several languages using their own voices.

To protect people’s privacy, OpenAI has limited ChatGPT’s capacity to analyze and make direct statements about them using image input. Real-world usage and user input will be critical in further improving these safeguards while ensuring the tool’s usability.


Subscribe to Our Newsletter

Related Articles

Top Trending

Farm-to-Table Agriturismo Trips in Italy
Why Millennials Are Choosing Farm-to-Table Agriturismo Trips in Italy?
Best Mobile Horror Games
The Best Mobile Horror Games That Will Keep You Up at Night
Evolution of Video Game Graphics
The Evolution of Video Game Graphics: 1980s to 2025
Healthy Gaming Routine
Build a Healthy Gaming Routine: Play Smarter, Avoid Burnout
Best Workouts Inspired by Video Games
Level Up Your Fitness: Best Video Game-Inspired Workouts

LIFESTYLE

summer birthday party ideas
Creative Summer Birthday Party Ideas for Kids in 2025
May 6 Zodiac
May 6 Zodiac: Positive Traits, Compatibility and More about Taurus
self storage solutions for life transitions
How Self Storage Can Help During Major Life Changes (Divorce, Moving, etc.)?
why is my poinsettia dying
Why Is My Poinsettia Dying? Tips To Revive Your Wilting Poinsettia Plant
crypto retirement plan strategies
7 Ways Crypto Can Reshape Your Retirement Plan for the Future

Entertainment

rocket league unblocked
Rocket League Unblocked: Soccer And Vehicular Mayhem Online Game
smokey robinson sexual assault allegations
Smokey Robinson Faces Sexual Assault Allegations by 4 Women
Disneyland Abu Dhabi
Disneyland Abu Dhabi: First Disney Theme Park in Middle East
Netflix OpenAI Partnership
Netflix Teams Up with OpenAI to Transform Streaming Experience
Bumassburner Leak
Bumassburner Leak Incident: What You Need to Know

GAMING

Best Mobile Horror Games
The Best Mobile Horror Games That Will Keep You Up at Night
Evolution of Video Game Graphics
The Evolution of Video Game Graphics: 1980s to 2025
Best Workouts Inspired by Video Games
Level Up Your Fitness: Best Video Game-Inspired Workouts
rocket league unblocked
Rocket League Unblocked: Soccer And Vehicular Mayhem Online Game
Maksym Krippa GSC Game World
S.T.A.L.K.E.R. Reimagined: How Maksym Krippa’s Entry Reshaped GSC Game World in 2023

BUSINESS

Business Behind Game Localization
The Business Behind Game Localization: How It Works
International Employment Agencies
How international employment agencies can help you find talent in hard-to-reach markets?
Transition Your Business to Web3
How to Transition Your Business to Web3 Successfully
How to Calculate Quarterly Tax Payments
How to Calculate Quarterly Tax Payments in 5 Easy Steps
credit suisse tax evasion
Credit Suisse Fined $511M for U.S. Offshore Tax Evasion Scheme

TECHNOLOGY

Maksym Krippa GSC Game World
S.T.A.L.K.E.R. Reimagined: How Maksym Krippa’s Entry Reshaped GSC Game World in 2023
Strengthening Cybersecurity with Security Operations
Strengthening Cybersecurity with Security Operations, CWPP, and Product Security
pitch a game idea
How to Pitch a Game Idea to a Developer or Publisher?
Web3 impact on virtual reality experiences
How Web3 Is Redefining Virtual Reality and Immersive Experiences?
Smart Contracts in Web3
How Smart Contracts Are Shaping the Future of Web3?

HEALTH

Yimusanfendi
7 Incredible Benefits of Yimusanfendi Meditation and Possible Side Effects
Connection Between Hydration and Urinary Health
The Connection Between Hydration and Urinary Health
Neuralink Brain Implant Patient Regains Speech
Neuralink Brain Implant Helps ALS Patient Regain Speech with AI Support
Wegovy for Weight Loss
Wegovy for Weight Loss: Is It Worth Buying Online?
Role of Sperm DNA Fragmentation Testing in IVF
The Role of Sperm DNA Fragmentation Testing in IVF with ICSI Success