Search
Close this search box.
Search
Close this search box.

OpenAI’s ChatGPT Now Can Speak, Hear, and See: a Multimodal Upgrade in History

ChatGPT Now Can Speak Hear and See

With the addition of additional speech and image capabilities in ChatGPT, OpenAI is once again pushing the boundaries of AI technology. These features are expected to change the way users engage with the AI model, providing a more intuitive and immersive experience.

Voice Conversations with ChatGPT

One of the most notable aspects of this version is the ability to do voice conversations using ChatGPT. Users may now converse with their AI helper in real time, bringing up a world of possibilities. ChatGPT’s voice skills are ready to assist you whether you’re on the go, looking for a bedtime story for your family, or settling a dinner table disagreement.

To begin using voice, go to the Settings menu in the mobile app, pick “New Features,” and enable voice conversations. Once activated, press the headphone icon in the top-right corner of the home screen to select one of five voices. Professional voice actors have meticulously developed these voices to provide a human-like audio experience. Furthermore, Whisper, OpenAI’s open-source speech recognition system, transcribes spoken words into text, improving the overall quality of the conversation.

Image Interaction with ChatGPT

The ability to share photos with ChatGPT is another game changer. Users can now use ChatGPT to troubleshoot difficulties, explore material, and evaluate complex data by displaying one or more photos. ChatGPT can help you figure out why your grill won’t start, design a dinner based on the contents of your fridge, or analyze a data graph for work.

Tap the photo button to capture or select an image to use this function. Tap the addition button first on iOS or Android to upload several photographs, or use the sketching tool to lead your assistant. Multimodal models, such as GPT-3.5 and GPT-4, power these picture capabilities by applying language reasoning skills to a wide range of visual input, such as photos, screenshots, and documents comprising text and images.

Gradual Deployment for Security and Resilience

Voice and image capabilities will be gradually handed out to Plus and Enterprise subscribers over the next two weeks. Voice will be available on both the iOS and Android platforms, with the option to opt in via settings, while photos will be available on all devices.

OpenAI recognizes the hazards involved with these increased capabilities. The emphasis for voice is on voice chat, and the technology was created in partnership with voice actors to assure authenticity and safety. Notably, Spotify is leveraging this technology for its Voice Translation service, which allows podcasters to increase their audience by translating content into several languages using their own voices.

To protect people’s privacy, OpenAI has limited ChatGPT’s capacity to analyze and make direct statements about them using image input. Real-world usage and user input will be critical in further improving these safeguards while ensuring the tool’s usability.


Subscribe to Our Newsletter

Related Articles

Top Trending

How to Start a Mental Wellness Program at Work
How to Start a Mental Wellness Program at Your Office?
optimizing money6x real estate
Money6x Real Estate: The Power of Real Estate Without the Headaches
xqc net worth
XQc Net Worth Reaches $50 Million By 2025: A Streamer's Success Story
How to Build a Self-Care Toolkit
How to Build a Self-Care Toolkit for Your Workday?
Mindfulness Practices To Boost Focus At Work
10 Mindfulness Practices To Boost Focus At Work

LIFESTYLE

12 Budget-Friendly Activities That Won’t Cost a Penny
12 Fun and Budget-Friendly Activities That Are Completely Free
lovelolablog code
Unlock Exclusive Lovelolablog Code For Discount Deals in 2025
Sustainable Kiwi Beauty Products
10 Sustainable Kiwi Beauty Products You Should Try for a Greener Routine
Best E-Bikes for Seniors
Best E-Bikes for Seniors with Comfort and Safety in Mind
wellhealthorganic.com effective natural beauty tips
Top 5 Well Health Organic Beauty Tips for Glowing Skin

Entertainment

xqc net worth
XQc Net Worth Reaches $50 Million By 2025: A Streamer's Success Story
itzhak ezratti net worth
Itzhak Ezratti Net Worth in 2025: Behind GL Homes Growth
nicholas riccio net worth
Nicholas Riccio Net Worth: From Homeless to Millionaire With Karoline Leavitt
Demi Moore Knew Mikey Madison Would Win
Demi Moore Knew Mikey Madison Would Win: ‘I Wasn’t Gutted’
Nate Bargatze to Host Emmy Awards
Nate Bargatze to Host 2025 Emmy Awards: Family-Friendly Laughs Ahead

GAMING

Familiarity with Online Casino Games Builds Gameplay Confidence
How Familiarity with Online Casino Games Builds Gameplay Confidence?
Pixel Art Games
Why Pixel Art Games Are Still Thriving in 2025?
Most Unfair Levels In Gaming History
The Most Unfair Levels In Gaming History
Gacha Games
Top 10 Gacha Games That Are Actually Worth Playing
How Live Betting Works & Who Decides the Odds
How Live Betting Works & Who Decides the Odds?

BUSINESS

optimizing money6x real estate
Money6x Real Estate: The Power of Real Estate Without the Headaches
Crypto Tax Strategies for Investor
Don't Miss Out: Learn the Top 15 Crypto Tax Strategies for Investors in 2025
Flexible Trailer Leasing
How Flexible Trailer Leasing Supports Seasonal Demand and Inventory Surges?
Importance Of Continuous Compliance Monitoring
Understanding The Importance Of Continuous Compliance Monitoring
South Korea chip sector relief US tariff fears
Seoul Responds to U.S. Tariffs with $4.9B Semiconductor Aid

TECHNOLOGY

Platforms Offering Money-Back Guarantees And Free Trials
Top 10 Platforms Offering Money-Back Guarantees And Free Trials
Best Web3 VPNs For Secure And Private Browsing
10 Best Web3 VPNs For Secure And Private Browsing
openai launches advanced ai models and coding agent
OpenAI Launches New Reasoning Models and Coding Agent for Developers
Gemini Live camera screen sharing android
Gemini Live Camera and Screen Sharing Now Available to All Android Users
Importance Of Continuous Compliance Monitoring
Understanding The Importance Of Continuous Compliance Monitoring

HEALTH

How to Start a Mental Wellness Program at Work
How to Start a Mental Wellness Program at Your Office?
Tips For Mentally Healthy Leadership
10 Tips For Mentally Healthy Leadership
Back Pain In Athletes
Back Pain In Athletes: Prevention And Recovery Strategies
Sinclair Method
What is the Sinclair Method?
Small Things Neurologists Wish You’d Do For Your Brain
10 Small Things Neurologists Wish You’d Do For Your Brain