×
Google’s Gemini AI now turns photos into 8-second videos with audio
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Google has launched a new Gemini AI feature that transforms photos into video clips using its Veo 3 video model. The feature generates eight-second videos complete with AI-generated audio, including background noises, environmental sounds, and speech that sync to the visuals, expanding Google’s AI capabilities beyond text and static images.

What you should know: The photo-to-video capability is now available to Google AI Ultra and Pro subscribers in select regions, rolling out on web today and mobile devices throughout the week.

  • Users access the feature by clicking “tools” in the prompt bar, selecting “video,” and uploading their photo alongside a text description of desired movement.
  • Audio descriptions can be included for dialogue, sound effects, and ambient noise, which Google says will be “perfectly synced with the visuals.”
  • Finished videos are delivered as MP4 files at 720p resolution in 16:9 landscape format.

How it works: The feature leverages Google’s Veo 3 video model to animate static images based on user prompts and descriptions.

  • “You can get creative by animating everyday objects, bringing your drawings and paintings to life, or adding movement to nature scenes,” Google said.
  • All generated videos include both visible watermarks showing they are AI-generated and invisible SynthID digital watermarks for authenticity tracking.

Competitive landscape: This capability was previously available in Flow, Google’s generative AI filmmaking tool launched in March, but now integrates directly into Gemini for easier access.

  • Flow is also expanding to “an additional 75 countries” today alongside the Gemini video feature rollout.
  • The integration eliminates the need for users to open separate applications to animate their photographs.
Gemini AI can now turn photos into videos

Recent News

Moonvalley raises $84M for ethical AI video generation platform

CAA and Comcast Ventures bet on a model that pays creators instead of scraping their work.

Why AI success requires more human work, not less

Top-performing teams spend 90 minutes daily training AI systems for optimal results.