Ptechhub
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs
No Result
View All Result
PtechHub
No Result
View All Result

How To Use Gemini AI To Summarize YouTube Videos

By Wired by By Wired
April 27, 2025
Home AI & ML
Share on FacebookShare on Twitter


A follow-up question about the final score was answered correctly, but Gemini got the name of the scorer of the first touchdown wrong: The AI suggested it was Johan Dotson. Dotson was shown getting a touchdown in the highlights with the scores at 0-0, but it was ruled out—an example of the nuances that AI doesn’t necessarily pick up on.

Gemini did successfully identify when the Kansas City Chiefs got their first points, and even included a timestamp linking straight to the touchdown in the YouTube clip. It also got the name of the scorer right. It seems Gemini is heavily reliant on the commentary for sports clips, which isn’t surprising.

Summarize Video Contents

The AI can pick out video details—if they’re mentioned in the audio.

Photograph: David Nield

Next, we tried putting Gemini up against a behind-the-scenes featurette for The Grand Budapest Hotel, directed by Wes Anderson. The clip runs to four-and-a-half minutes, and Gemini fired back some replies almost instantly: It identified the name of the film being talked about, and the main beats of the clip’s narrative.

However, it’s all reliant on the audio (or the transcript) again—there doesn’t seem to be any analysis of the actual video contents. The AI couldn’t say who the talking heads were in the video, even though their names were shown on screen, and wasn’t able to say who the director was (even though this was also mentioned in the video description).

On the plus side, Gemini did do an impressive job of summing up the audio of the video. It correctly identified some of the filmmaking challenges that were mentioned throughout, and provided timestamps to them — from looking for a set to represent the Grand Budapest, to filling it with extras.

Summarize Interviews

Image may contain Page Text File and Webpage

Gemini can provide timestamps for the specified video.

Photograph: David Nield

Finally, we tried Google Gemini with an interview: Channel 4 in the UK speaking to Charlie Brooker and Siena Kelly about the latest series of Black Mirror (perhaps appropriate for an article on AI). Gemini proved itself very capable at picking out the talking points, and adding timestamps, though of course the whole video is mostly talking.

Again though, there’s no context about anything outside of the audio or the transcript. Gemini AI couldn’t say where the interview took place, or how the participants were acting, or anything else about the visuals of the video—which is worth bearing in mind if you use it yourself.

For videos where the answers you want are in the audio of a YouTube video, and its associated transcript, Gemini works really well at summarizing and providing accurate answers (provided the commentators mention when a touchdown is ruled out, as well as when one is scored). For any kind of visual information, you’re still going to have to watch the video yourself.



Source link

Tags: appsArtificial Intelligencegooglegoogle geminiVideoyoutube
By Wired

By Wired

Next Post
These U.S. consumer stocks face higher China risks, TD Cowen survey finds

These U.S. consumer stocks face higher China risks, TD Cowen survey finds

Recommended.

Ericsson, Nokia and Fraunhofer HHI join forces to drive 6G-era video coding standardization

Ericsson, Nokia and Fraunhofer HHI join forces to drive 6G-era video coding standardization

October 27, 2025
Stocks making the biggest moves midday: CoreWeave, Fermi, Paramount Skydance, Gemini and more

Stocks making the biggest moves midday: CoreWeave, Fermi, Paramount Skydance, Gemini and more

November 11, 2025

Trending.

Spirit of openness helps banks get serious about stopping scams | Computer Weekly

Spirit of openness helps banks get serious about stopping scams | Computer Weekly

April 10, 2025
Weibo Publishes 2025 Environmental, Social and Governance Report

Weibo Publishes 2025 Environmental, Social and Governance Report

April 28, 2026
It Takes 2 Minutes to Hack the EU’s New Age-Verification App

It Takes 2 Minutes to Hack the EU’s New Age-Verification App

April 18, 2026
Chunghwa Telecom 2025 Form 20-F filed with the U.S. SEC

Chunghwa Telecom 2025 Form 20-F filed with the U.S. SEC

April 15, 2026
2025 Wired, WLAN Gartner Magic Quadrant: Cisco Drops To Challenger, NaaS Specialists Join

2025 Wired, WLAN Gartner Magic Quadrant: Cisco Drops To Challenger, NaaS Specialists Join

July 14, 2025

PTechHub

A tech news platform delivering fresh perspectives, critical insights, and in-depth reporting — beyond the buzz. We cover innovation, policy, and digital culture with clarity, independence, and a sharp editorial edge.

Follow Us

Industries

  • AI & ML
  • Cybersecurity
  • Enterprise IT
  • Finance
  • Telco

Navigation

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Subscribe to Our Newsletter

  • About
  • Advertise
  • Privacy & Policy
  • Contact

Copyright © 2025 | Powered By Porpholio

No Result
View All Result
  • News
  • Industries
    • Enterprise IT
    • AI & ML
    • Cybersecurity
    • Finance
    • Telco
  • Brand Hub
    • Lifesight
  • Blogs

Copyright © 2025 | Powered By Porpholio