By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Times CatalogTimes CatalogTimes Catalog
  • Home
  • Tech
    • Google
    • Microsoft
    • YouTube
    • Twitter
  • News
  • How To
  • Bookmarks
Search
Technology
  • Meta
Others
  • Apple
  • WhatsApp
  • Elon Musk
  • Threads
  • About
  • Contact
  • Privacy Policy and Disclaimer
© 2025 Times Catalog
Reading: ChatGPT now understands real-time video, seven months after OpenAI first demoed it
Share
Notification
Font ResizerAa
Font ResizerAa
Times CatalogTimes Catalog
Search
  • News
  • How To
  • Tech
    • AI
    • Apple
    • Microsoft
    • Google
    • ChatGPT
    • Gemini
    • YouTube
    • Twitter
  • Coming Soon
Follow US
  • About
  • Contact
  • Privacy Policy and Disclaimer
© 2025 Times Catalog
Times Catalog > Blog > Tech > AI > ChatGPT > ChatGPT now understands real-time video, seven months after OpenAI first demoed it
AIChatGPTTech

ChatGPT now understands real-time video, seven months after OpenAI first demoed it

Usama
Last updated: December 13, 2024 1:07 pm
Usama
Share
6 Min Read
ChatGPT now understands real-time video, seven months after OpenAI first demoed it
SHARE

Nearly seven months after OpenAI first teased the feature, ChatGPT now supports real-time video analysis, marking a significant milestone in the evolution of AI-driven interactions. On Thursday, during a livestream, OpenAI unveiled this highly anticipated update, which integrates video capabilities into ChatGPT’s Advanced Voice Mode. Here’s how it works, who can access it, and what it means for the future of conversational AI.

Contents
Bringing Vision to Advanced Voice ModeA Staggered Rollout with LimitationsA Glimpse Into the Future of AI: The CNN DemoA Long Road to LaunchCompetition Heats UpA Touch of Holiday Cheer: Santa ModeWhat’s Next for ChatGPT?

Bringing Vision to Advanced Voice Mode

Advanced Voice Mode, already celebrated for its human-like conversational abilities, is now enhanced with visual understanding. Subscribers to ChatGPT Plus, Team, or Pro plans can use the ChatGPT app to point their phone cameras at objects and receive near real-time feedback. The feature expands the potential applications of ChatGPT, allowing it to interpret visual inputs as well as understand what’s happening on a device’s screen.

For instance, the app can help users decipher settings menus, solve math problems, or even explain the contents of an image. To enable this feature, users simply need to:

  1. Tap the voice icon next to the chat bar in the ChatGPT app.
  2. Select the video icon on the bottom left to activate video mode.
  3. For screen sharing, tap the three-dot menu and choose “Share Screen.”

This multimodal experience sets a new standard for AI interactivity by blending auditory, textual, and visual inputs.

A Staggered Rollout with Limitations

OpenAI plans to roll out Advanced Voice Mode with vision gradually, starting Thursday and completing the process within the next week. However, not all users will gain immediate access.

  • Delayed for Enterprise and Education Plans: ChatGPT Enterprise and Edu subscribers will have to wait until January to access the feature.
  • Unavailable in Certain Regions: Users in the European Union, Switzerland, Iceland, Norway, and Liechtenstein won’t see the update for now, with no confirmed timeline for these regions.

These rollout constraints reflect OpenAI’s ongoing efforts to address compliance, technical readiness, and user feedback as it scales the feature globally.

A Glimpse Into the Future of AI: The CNN Demo

During a recent appearance on CNN’s 60 Minutes, OpenAI President Greg Brockman showcased the capabilities of Advanced Voice Mode with vision. In an engaging segment, Brockman quizzed Anderson Cooper on anatomy using ChatGPT. As Cooper sketched body parts on a blackboard, ChatGPT analyzed his drawings in real-time.

ChatGPT now understands real-time video, seven months after OpenAI first demoed it
OpenAI employees demo ChatGPT’s Advanced Voice Mode with vision during a livestream. Image Credits: OpenAI

“The location is spot on,” ChatGPT commented, acknowledging that Cooper had correctly placed the brain. It added, “As for the shape, it’s a good start. The brain is more of an oval.”

The demo highlighted both the potential and limitations of the technology. While ChatGPT’s analysis impressed viewers, it made a notable error on a geometry problem, underscoring that it remains susceptible to hallucinations and inaccuracies.

A Long Road to Launch

The journey to this release has been anything but smooth. OpenAI initially promised Advanced Voice Mode with vision back in April, suggesting a rollout “within a few weeks.” However, the feature faced repeated delays, reportedly due to premature announcements and technical hurdles. When Advanced Voice Mode finally arrived in early fall, it debuted without visual capabilities, focusing solely on voice interactions. Since then, OpenAI has worked diligently to refine the technology, expand its availability, and address regional compliance issues, particularly in the EU.

Competition Heats Up

OpenAI isn’t alone in the race to develop real-time, video-capable AI. This week, Google announced a significant step forward with its conversational AI feature, Project Astra, which enables real-time video analysis. Currently, Project Astra is being tested by a select group of Android users. Meta is also reportedly working on similar technology, further fueling competition in the AI space.

A Touch of Holiday Cheer: Santa Mode

In addition to the high-tech rollout, OpenAI added a playful twist to ChatGPT with the launch of “Santa Mode.” This festive feature allows users to interact with ChatGPT in Santa’s voice, bringing a dose of holiday spirit to their conversations. To activate Santa Mode, simply tap or click the snowflake icon next to the prompt bar in the ChatGPT app.

What’s Next for ChatGPT?

With the introduction of Advanced Voice Mode with vision, OpenAI has once again raised the bar for AI-driven tools. By enabling real-time video analysis, the company is opening up new possibilities for education, accessibility, and productivity. However, the technology’s limitations—including occasional errors and a delayed rollout in certain regions—serve as reminders of the challenges inherent in pushing the boundaries of AI.

As OpenAI continues to refine its offerings, the competition from industry giants like Google and Meta will undoubtedly keep the pressure high. For now, though, the addition of vision capabilities cements ChatGPT’s status as a frontrunner in the ever-evolving AI landscape.

You Might Also Like

Logitech’s MX Creative Console now supports Figma and Adobe Lightroom

Samsung resumes its troubled One UI 7 rollout

Google Messages starts rolling out sensitive content warnings for nude images

Vivo wants its new smartphone to replace your camera

Uber users can now earn miles with Delta Air Lines

Share This Article
Facebook Twitter Pinterest Whatsapp Whatsapp Copy Link
What do you think?
Love0
Happy0
Sad0
Sleepy0
Angry0
Previous Article Google built an AI tool that can do research for you Google built an AI tool that can do research for you
Next Article WhatsApp lets you select specific people within a group to start a group call without disturbing anyone WhatsApp lets you select specific people within a group to start a group call without disturbing anyone
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

144FollowersLike
23FollowersFollow
237FollowersPin
19FollowersFollow

Latest News

Pinterest is prompting teens to close the app at school
Pinterest is prompting teens to close the app at school
News Tech April 22, 2025
ChatGPT search is growing quickly in Europe, OpenAI data suggests
ChatGPT search is growing quickly in Europe, OpenAI data suggests
AI ChatGPT OpenAI April 22, 2025
social-media-is-not-wholly-terrible-for-teen-mental-health-study-says
Social media is not wholly terrible for teen mental health, study says
News April 22, 2025
Google is trying to get college students hooked on AI with a free year of Gemini Advanced
Google is trying to get college students hooked on AI with a free year of Gemini Advanced
AI Gemini Google Tech April 19, 2025
Times CatalogTimes Catalog
Follow US
© 2025 Times Catalog
  • About
  • Contact
  • Privacy Policy and Disclaimer
Welcome Back!

Sign in to your account

Lost your password?