LEAKED: A New List Reveals Top Websites Meta Is Scraping of Copyrighted Content to Train Its AI

cm0002@lemmy.world · 3 days ago

LEAKED: A New List Reveals Top Websites Meta Is Scraping of Copyrighted Content to Train Its AI

who@feddit.org · edit-2 3 days ago

Lemmy really hates piracy… in this specific context.

Specifically, Lemmy hates it when corporations profit by using people’s work without permission or payment, especially at a large scale.

I don’t think Lemmy would complain about a poor student scraping a web page in order to learn something.

mindbleach@sh.itjust.works · 3 days ago

Seeking distinctions is pretense. They’re just shuffling cards.

You can ask about models made from public-domain data, and most critics will not budge an inch. Mentioning copyright is working backwards from a gut feeling. The ones who say, sure, okay, it’d be different if– - maybe they have a consistent rationale. But even some of them haven’t examined how they’d feel about this technology, if all their complaints were addressed.