63 2 days ago

While I also disagree with the court order and OpenAI's implementation (along with pretty much everything else the company does), the conspiratorial thinking in the comments here is unfounded. The order is overly broad but in the context of the case, it's not totally unwaranted. This is not some conspiracy to collect data on people. I'm confident the records will be handled appropriately once the case concludes (and if they're not, we can be upset then, not now). Let's please reserve our outrage for the plethora of very real issues going on right now.

5
Aeolun 2 days ago

> and if they're not, we can be upset then, not now

Like we could be upset when that credit checking company dumped all those social security numbers on the net and had to pay the first 200k claimants a grand total of $21 for their trouble?

By that point it’s far too late.

pier25 2 days ago

Are we talking about the same company that needs data desperately and has used copyrighted material illegally without permission?

simonw 2 days ago

When did they use copyrighted material illegally?

I didn't think any of the ongoing "fair use" lawsuits had reached a conclusion on that.

jamessinghal 2 days ago

The Thomson Reuters case [1] is the most relevant in the court's finding that the copying of copyrighted material from Westlaw by Ross Intelligence was direct copyright infringement and was not fair use.

The purpose of training in many of the AI Labs being sued mostly matches the conditions that Ross Intelligence was found to have violated, and the question of copying is almost guaranteed if they trained on it.

[1] Thomson Reuters Enterprise Centre GmbH et al v. ROSS Intelligence Inc. https://www.ded.uscourts.gov/sites/ded/files/opinions/20-613...

simonw 2 days ago

Thanks, I hadn't seen that one.

pier25 2 days ago

ok then let's say they used the copyrighted material without permission

pier25 2 days ago

Sorry, I meant to write "monetized copyrighted material without permission".

We'll see if the courts deem it legal but it's, without a doubt, unehtical.

FeepingCreature 2 days ago

Eh, I have the opposite view but then again I'm a copyright minimalist.

pier25 1 day ago

So you think artists do not need to be able to make a living?

simonw 2 days ago

That's true, they did.

basilgohar 2 days ago

In what world do you live in where corporations have any right to a benefit of the doubt? When did that legitimately pan out?

vlovich123 2 days ago

> I'm confident the records will be handlded appropriately once the case concludes (and if they're not, we can be upset then, not now)

This makes no sense to me. Shouldn't we address the damage before it's done vs handwringing after the fact?

verdverm 2 days ago

the damage to which party?

Aeolun 2 days ago

Certainly not the copyright holders, which have lost any form of my sympathy over the past 25 years.

odo1242 2 days ago

Users, presumably

phkahler 2 days ago

You're pretty naive. OpenAI is still trying to figure out how to be profitable. Having a court order to retain a treasure trove of data they were already wanting to keep while offering not to, or claiming not to? Hahaha.

tomnipotent 2 days ago

Preserving data for a judicial hold does not give them leeway to use that data for other purposes, but don't let facts get in the way of your FUD.

phkahler 1 day ago

>> Preserving data for a judicial hold does not give them leeway to use that data for other purposes

Does not give them permission. What if LEO asks for the data? Should they hand it over just because they have it? Remember, this happens all the time with metadata from other companies (phone carriers for example). Having the data means it's possible to use it for other purposes as opposed to not possible. There is always pressure to do so both from within and outside a company.

tomnipotent 1 day ago

> Should they hand it over just because they have it?

Not unless LEO sues OpenAI while it's preserving data from the first discovery, otherwise they cannot be compelled to give up data. Nor are they allowed to violate their TOS and use the data outside of retention, despite the FUD you want to spread about it.

> Having the data means it's possible

No, it doesn't. That's not how any of this works.