Google is using YouTube videos to train its AI video generator

27 rntn 33 6/19/2025, 3:44:00 PM cnbc.com ↗

Comments (33)

pier25 · 4h ago
To no one's surprise. If you're not the customer you're the product.
emodendroket · 4h ago
I think that phrase ought to be retired simply because even if you are paying money you often still are “the product.”
kube-system · 4h ago
If you're paying money, you still might not be a company's real customer: https://www.statista.com/statistics/1093781/distribution-of-...
emodendroket · 2h ago
I don’t think it’s even right to think that there’s one “real” customer and one “fake” one really. It seems like an oversimplified model that doesn’t accurately describe how anybody operates besides a mom-and-pop.
kube-system · 2h ago
The "unsimple" answer is:

1. if you don't have leverage with your vendors, they will not bend over backwards for you

2. companies are not incentivized to respond to complaints with no revenue at risk (e.g. you're going to use youtube anyway)

nuodag · 3h ago
Yes, and a way out of that is open source, where you aren't a customer…
pier25 · 3h ago
Absolutely. See TVs for example. Price has gone down because they sell the data of what you're watching.
techjamie · 2h ago
Drop tens of thousands on a new vehicle at a stealership and you'll get sketchy companies offering warranties on your exact vehicle within a week and then forever-more afterwards.

The way I understand it, usually either the dealership, the software they take your information in, or both typically sell off your data after the sale.

Also I get calls and letters from places asking to buy a vehicle I haven't owned since 2018 regularly.

josefritzishere · 2h ago
This is the most important thread here. I don't think even Andrew Lewis saw this coming. Now we are always the product because we lack digital rights. It's all been legislated away.
echelon · 4h ago
This should have resulted in an antitrust dismantlement by now. Google has every structural advantage in the world.

Years ago, Google would have been worth more if sold for parts. They were giving away far too much (and pissing on entire industries while doing so). Now they're activating all of those assets for strong, explosive incremental growth. It's hard to even call it incremental. More like checkmate world.

They're going to off so many businesses this decade and collect all the money.

They own the web, they own most of mobile, they control the other half of mobile, they own search, they own media, they own advertising. There's not a dollar that gets made that doesn't flow though Google somehow.

You can't even build a brand anymore without getting extorted by Google. You'll have your competitors paying to trademark squat you, and the browser itself defaults to Google search.

Google really needs to be split into about a half dozen companies. This is way bigger and way worse than Ma Bell.

bgwalter · 4h ago
Here is a free business idea: Create an agentic "AI" video watcher. "AI" YouTube creators can register with the service, which will then watch their videos, will generate click-throughs to the advertisers and interact with the advertiser's web pages. The service is financed by profit sharing.

This streamlines video watching, which humans are notoriously slow at. It could lead to efficiency gains in video and ad watching that are practically unlimited.

kube-system · 4h ago
I'm guessing that is a facetious response, but in case it isn't: this is just plain old fraud.
JohnFen · 1h ago
I don't think that's fraud unless its done by the channel operator. Me as an end user auto-clicking ads is not even in the same ballpark as actual fraud.
kube-system · 1h ago
> I don't think that's fraud unless its done by the channel operator.

That's exactly what the parent comment suggested.

lovich · 2h ago
Sounds like you just need to execute on it fast enough that the government cant respond before youre too big to fight. standard strategy
kube-system · 1h ago
Not really. Click fraud isn't anything new, it has existed for decades, and there are many ways that it can be (and currently is) mitigated privately. The most common way is to ban, shadowban, or demonetize the offender. And if that doesn't work you can always be held civilly liable.

Contracting with others to commit fraud and violate contracts is not a good business idea even if you stay off the government's radar.

gauku · 4h ago
Almost sure that's a tongue-in-cheek response. Right?
bgwalter · 4h ago
Yes! I'm pretty sure though that given the current hype someone can come up with an elaborate legal and moral justification for increasing video watching efficiency.
add-sub-mul-div · 42m ago
Be Facebook and call it pivot-to-video.
kunzhi · 4h ago
Reading this article I couldn’t help but remember the Key & Peele skit about joke theft - “high on potenuse.” All this AI training feels similar to me on some level. Yeah, it’s “just making a copy” on the other hand the person who originated the idea doesn’t get to participate in the success.

Life is hard, but at least on the other hand, it’s also unfair.

JohnFen · 1h ago
genAI videos are already making YouTube worse than it was, and that trend is only starting. Maybe that, plus Google using user videos in this way, will finally allow one of their competitors to gain more traction.
paxys · 4h ago
Well, no shit.

Remember when OpenAI's CTO was asked to confirm that they don't use YouTube to train Sora and she evaded the question...?

Everyone is training on everything they can get their hands on, period.

cavisne · 4h ago
Hilariously there was a story how Google could not train on Youtube data due to their TOS, so they changed it for new videos. Meanwhile everyone else was scraping Youtube as much as they liked and training on it.
superkuh · 4h ago
Additionally, no one blocks googlebot even though it's being used just as much for LLM/etc AI training as any web spider out there. Too big to block. Too big to not use.
krunck · 5h ago
... because Youtube videos meet some minimum level of content quality?
adzm · 4h ago
There is just so much of it, on so many different topics. Especially esoteric things that aren't popular "influencer" things that everyone is going to think of initially.
kube-system · 4h ago
Most of it is better than this: https://www.youtube.com/watch?v=XQr4Xklqzw8
leumon · 4h ago
Just like for llms for the base model you need quantity not quality. It just needs to learn how to correctly predict the next frame.
add-sub-mul-div · 5h ago
If low quality influencer garbage is what people are watching, they'll be happy to generate more of it and I don't think they'll lose sleep about the quality.
greatgib · 5h ago
It kind of does make sense, like a Library would use the books at its disposal.

But what is not normal is that they will easily block, ban and sue you if you try to do the same, like if the catalog of content was belonging to them.

kube-system · 4h ago
> It kind of does make sense, like a Library would use the books at its disposal.

Libraries don't really "use" books to produce anything, except to support accessibility like translations or indexing. Their lending of books is under the first-sale doctrine, which wouldn't be applicable to YouTube videos streamed under license.

> But what is not normal is that they will easily block, ban and sue you if you try to do the same, like if the catalog of content was belonging to them.

Because they do have rights to the content. All of the content on YouTube has been licensed to YouTube, and the licensor has assigned some rights to them.

echoangle · 4h ago
Does it not? Do you not give those rights to YouTube once you upload a video?
throwaway29843 · 3h ago
Not all YouTube videos are uploaded by their rightholders though. There's plenty of stuff reuploaded from other platforms, which Google is feeding into their AI indiscriminately.