Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

submitted by edited

www.wired.com/story/new-documents-unredacted-me…

Paywall bypass: https://12ft.io/proxy?q=https://www.wired.com/story/new-documents-unredacted-meta-copyright-ai-lawsuit

10
229

Log in to comment

10 Comments

The “piracy database” in question is: https://en.m.wikipedia.org/wiki/Library_Genesis

Screw scholarly journals and giant publishers for squeezing cash out of already-thin academics while being shitty gatekeepers, letting complete trash through while blocking others (among other things). They’re greedy, stagnant, exploitive middlemen.

Doesn’t make what Zuck\Facebook did OK, but I have *zero* sympathy for the monopolistic “victims” here, especially since it’s going to open weight models one can use for free.

were escalated to Meta CEO Mark Zuckerberg (referred to as "MZ" in the memo handed over during discovery) and that Meta's AI team was "approved to use" the pirated material.

Looks like ol Zucky got caught red handed lmao

I used to work in the valley. As a general rule, the higher up in the org the person is, the more relaxed they are about a little lawbreaking.

Its fine when they do it and we get fined when we do it

Off topic, but I am jealous of that handle.

Haha thanks, I was playing mass effect Andromeda at the time I though of it.

Not sure why it matters where the material came from at this point. Haven't courts already said LLM learning didn't constitute a violation of copyright? That conversation stopped very abruptly just a bit after ChatGPT became available to everyone.

Deleted by moderator

 reply
33

Comments from other communities

Wait everyone, calm down, piracy is ok if its a big corporation doing it.

"Too big to fail" = "We cannot give real concequences for bad actions, but we can take a cut after the fact in the form of a weak ass fee."

Earlier reports suggested they trained it on books from Bibliotik.

What changed?

Probably just both honestly.

In for a penny and for a pound.

The llama-1 paper acknowledged the use of the books dataset, libgen isn't mentioned in any of the papers so this is new info.

You heard it here first, folks! Zucc says piracy is a-ok!

What are you going to pirate from facebook though?

Resources!

Like wood and stone? Sweet

Whatever's not nailed down, unless you've got a hammer!

by
[deleted]

Deleted by moderator

 reply
8

True fact: Zuckerberg has wood for sheep.

Remember that video in his backyard, when he said he gets rockhard for the meats.

I hope they also pirated from disney and nintendo

Good. I'm sure despite us all knowing what a piece of shit meta is we will continue to use their products because such a small sacrifice is a bridge too far.

To be honest I don't think anyone's using meta AI. I didn't even know they had an AI.

It's called Llama and it's one of the more prominent open source LLMs. So it's got that going for it at least.

Meta ai is pretry much the basis of all open weight models. I fact i use a meta model as my main model, a dolphin fine tuning but still meta based.

I use it to release stress, I just hurl abuse at it randomly.

Article is paywalled but had no idea either.

I’ve just not used fb or insta for years but I asked my wife who uses both regularly and she hadn’t heard of it either.

IIRC Meta was one of the first companies to publicly release a language model that you could run on your computer, called llama. Hence the naming of projects like llama.cpp and ollama.

Oh they definitely are. AI aside if people want to see change in the way these companies operate, they need to abandon anything they make.

by
[deleted]

It's ok omid the Zuck does it. Like when he stole land from indigenous Hawaiians.

Deleted by moderator

 reply
-32

Sir, this is a Wendy's

SPICY CHICKEN SANDWICH REPRESENTATION

Deleted by moderator

 reply
-12

I was curious to see if Wendy's was on the BDS list. It's not, but I also found this post, which is quite a trip. https://old.reddit.com/r/wendys/comments/1b25k3b/boycott/