OpenAI’s ChatGPT Now Can Speak, Hear, and See: a Multimodal Upgrade in History

ChatGPT Now Can Speak Hear and See

With the addition of additional speech and image capabilities in ChatGPT, OpenAI is once again pushing the boundaries of AI technology. These features are expected to change the way users engage with the AI model, providing a more intuitive and immersive experience.

Voice Conversations with ChatGPT

One of the most notable aspects of this version is the ability to do voice conversations using ChatGPT. Users may now converse with their AI helper in real time, bringing up a world of possibilities. ChatGPT’s voice skills are ready to assist you whether you’re on the go, looking for a bedtime story for your family, or settling a dinner table disagreement.

To begin using voice, go to the Settings menu in the mobile app, pick “New Features,” and enable voice conversations. Once activated, press the headphone icon in the top-right corner of the home screen to select one of five voices. Professional voice actors have meticulously developed these voices to provide a human-like audio experience. Furthermore, Whisper, OpenAI’s open-source speech recognition system, transcribes spoken words into text, improving the overall quality of the conversation.

Image Interaction with ChatGPT

The ability to share photos with ChatGPT is another game changer. Users can now use ChatGPT to troubleshoot difficulties, explore material, and evaluate complex data by displaying one or more photos. ChatGPT can help you figure out why your grill won’t start, design a dinner based on the contents of your fridge, or analyze a data graph for work.

Tap the photo button to capture or select an image to use this function. Tap the addition button first on iOS or Android to upload several photographs, or use the sketching tool to lead your assistant. Multimodal models, such as GPT-3.5 and GPT-4, power these picture capabilities by applying language reasoning skills to a wide range of visual input, such as photos, screenshots, and documents comprising text and images.

Gradual Deployment for Security and Resilience

Voice and image capabilities will be gradually handed out to Plus and Enterprise subscribers over the next two weeks. Voice will be available on both the iOS and Android platforms, with the option to opt in via settings, while photos will be available on all devices.

OpenAI recognizes the hazards involved with these increased capabilities. The emphasis for voice is on voice chat, and the technology was created in partnership with voice actors to assure authenticity and safety. Notably, Spotify is leveraging this technology for its Voice Translation service, which allows podcasters to increase their audience by translating content into several languages using their own voices.

To protect people’s privacy, OpenAI has limited ChatGPT’s capacity to analyze and make direct statements about them using image input. Real-world usage and user input will be critical in further improving these safeguards while ensuring the tool’s usability.


Subscribe to Our Newsletter

Related Articles

Top Trending

Jennifer Lopez and Ben Affleck Reportedly File for Divorce
Jennifer Lopez and Ben Affleck Reportedly Headed for Divorce: A Closer Look at Their Relationship Struggles
billie eilish new music hit me hard and soft
Billie Eilish: Rekindling a Love for Music with "Hit Me Hard and Soft"
Highest Paid Athlete Cristiano Ronaldo Forbes
Cristiano Ronaldo Earns $260M, Forbes' Highest-Paid Athlete
xander schauffele record round at pga championship
Xander schauffele's Record-Equalling Round at PGA Championship: A Stellar Performance
Reddit Shares Jump Openai Chatgpt Deal
Reddit Shares Surge Over 10% After Partnership Deal with OpenAI

LIFESTYLE

Creative Ways to Show Appreciation for Mothers
Creative Ways to Show Appreciation for Mothers on Mother's Day
Mothers Day Speech Ideas
Inspiring Mother's Day Speech Ideas for a Memorable Tribute
Rabindra Jayanti 2024
Rabindra Jayanti 2024: Celebrating the Life and Legacy of Rabindranath Tagore
May 6 Zodiac
May 6 Zodiac: Positive Traits, Compatibility and More about Taurus
why initial bracelets perfect personalized gifts
Why Initial Bracelets Make the Most Personalized Gifts

Entertainment

Jennifer Lopez and Ben Affleck Reportedly File for Divorce
Jennifer Lopez and Ben Affleck Reportedly Headed for Divorce: A Closer Look at Their Relationship Struggles
billie eilish new music hit me hard and soft
Billie Eilish: Rekindling a Love for Music with "Hit Me Hard and Soft"
Guruvayoor Ambalanadayil
Guruvayoor Ambalanadayil: A Delightful Comedy-Drama with a Touch of Nostalgia
Atila Altaunbay
Atila Altaunbay’s Mysterious Disappearance and Shocking Life Details
silvana prince
Silvana Prince's Bio, Height, Weight, Family, Net Worth, Career, and More in 2024

GAMING

How to Save Money on Video Games
How to Save Money on Video Games
ghost of tsushima pc preorders canceled
Ghost of Tsushima PC Pre-Orders Canceled in Non-PSN Countries
Tips and strategies for winning the feudle
A Step-By-Step Guide and Strategies for Winning the Feudle Word Game in 2024
Blackjack Basic Strategy
Blackjack Basic Strategy: Your FAQ Roadmap
Roulette
Roulette Bets Explained: No More Spinning Confusion

BUSINESS

Reddit Shares Jump Openai Chatgpt Deal
Reddit Shares Surge Over 10% After Partnership Deal with OpenAI
taylor swift eras tour boosts uk economy
Taylor Swift's Tour Hands UK Economy £1 Billion Boost: Study
Morning Star Candlestick Pattern
Unveiling the Morning Star Candlestick Pattern: A Comprehensive Guide for New Investors
dogecoin wallet sells early misses million profit
10-Year-Old Dogecoin Wallet Sells Early, Misses $1 Million Profit Opportunity
how credit score affects loan eligibility
Does Your Credit Score Affect Your Personal Loan Eligibility?

TECHNOLOGY

Google IO Gemini 1.5 AI Innovations
Gemini 1.5 & Beyond: Top AI Breakthroughs from Google I/O
viralyft
Viralyft Service Analysis, Case Study, and Price Comparison in 2024
Google AI Finds Lost Glasses
Google's New AI Helps You Find Lost Glasses
chatgpt ai voice assistant daily show humor
ChatGPT's AI Voice Assistant Hilarity on 'The Daily Show
Android 15 Beta 2 Coming Wednesday
Android 15 Beta 2 Launches Today: What to Expect?

HEALTH

massachusetts man dies after pig kidney transplant
Massachusetts Man Dies After First Successful Pig Kidney Transplant
International Nurses Day 2024
The Heart of Healthcare: Celebrating International Nurses Day 2024
Semaglutide Weight Management Revolution
Semaglutide Revolution: Shaping the Future of Medical Weight Management
neuralink brain implant fix update
Neuralink Update: Elon Musk Announces Fix for Brain Implant Issue
how to recover after car accident
Tips for a Comprehensive Recovery After a Car Accident