this post was submitted on 22 Aug 2023
772 points (95.7% liked)

Technology

76361 readers
1531 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

you are viewing a single comment's thread
view the rest of the comments
[–] Tetsuo@jlai.lu 16 points 2 years ago* (last edited 2 years ago) (4 children)

If I'm not mistaken AI work was just recently considered as NOT copyrightable.

So I find interesting that an AI learning from copyrighted work is an issue even though what will be generated will NOT be copyrightable.

So even if you generated some copy of Harry Potter you would not be able to copyright it. So in no way could you really compete with the original art.

I'm not saying that it makes it ok to train AIs on copyrighted art but I think it's still an interesting aspect of this topic.

As others probably have stated, the AI may be creating content that is transformative and therefore under fair use. But even if that work is transformative it cannot be copyrighted because it wasn't created by a human.

[–] Even_Adder@lemmy.dbzer0.com 10 points 2 years ago* (last edited 2 years ago)

If you're talking about the ruling that came out this week, that whole thing was about trying to give an AI authorship of a work generated solely by a machine and having the copyright go to the owner of the machine through the work-for-hire doctrine. So an AI itself can’t be authors or hold a copyright, but humans using them can still be copyright holders of any qualifying works.

[–] habanhero@lemmy.ca 1 points 2 years ago (1 children)

How do you tell if a piece of work contains AI generated content or not?

It's not hard to generate a piece of AI content, put in some hours to round out AI's signatures / common mistakes, and pass it off as your own. So in practise it's still easy to benefit from AI systems by masking generate content as largely your own.

[–] Even_Adder@lemmy.dbzer0.com 5 points 2 years ago (1 children)

I mean, this is the exact way the U.S. Copyright Office's guidance says they think you should use it.

[–] habanhero@lemmy.ca 3 points 2 years ago (1 children)

Sure, but even under this guidance copyright owners of the training data are still shafted, based on how the data is scraped pretty much freely. Can an opportunist generate an unofficial sequel to Harry Potter, do the minimum to ensure they get copyright and reap the reward from it?

[–] Even_Adder@lemmy.dbzer0.com 3 points 2 years ago (2 children)

That's how copyright has always worked. Fair use is complex, but as long as you're not straight up copying someone's work you're fine. 50 Shades of Grey started out as Twilight fanfiction. So yeah, you could.

[–] habanhero@lemmy.ca 3 points 2 years ago* (last edited 2 years ago) (1 children)

Yes fair use has its stipulations but AI is breaking new grounds here. We are no longer dealing with the reaction videos but in an era where literally dozen of pages of content can be generated in a matter of minutes, with relatively little human involvement. Perhaps it's time to revisit if the law still holds in light of these new technology and tools.

[–] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago (1 children)

You should read this article by Kit Walsh, who’s a senior staff attorney at the EFF.

[–] habanhero@lemmy.ca 1 points 2 years ago

Interesting read! Definitely a useful breakdown and I see the reasoning. Thanks for sharing.

[–] BURN@lemmy.world 1 points 2 years ago (1 children)

Fair use has never been seriously challenged. I’m betting it might happen soon though. We have to remember Fair Use isn’t a law, it’s a set of guidelines under the law that has never been clearly defined.

[–] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago* (last edited 2 years ago)

First of all, fair use is not a set of guidelines, it's a legal doctrine that allows us limited use of copyrighted material without permission from the owner. It is a part of the U.S. Copyright Act, which is a law enacted by Congress.

Second, fair use has been seriously challenged plenty of times, just to name a few:

  • Campbell v. Acuff-Rose Music, Inc.

  • Authors Guild v. Google, Inc.

  • Lenz v. Universal Music Corp.

I recommend reading this article by Kit Walsh, who’s a senior staff attorney at the EFF, a digital rights group who recently won a historic case: border guards now need a warrant to search your phone.

Fair use protects creativity, innovation, and our freedom of expression, but You almost sound like you want it weakened.

[–] XEAL@lemm.ee 1 points 2 years ago (1 children)

How are they going to prove if something was written by an AI? Also, you can take the AI's output and then modify it.

[–] Tetsuo@jlai.lu 2 points 2 years ago

That's definitely an issue. At what point does copyright applies if you are just helped by an AI ?

I guess the courts will have to decide that...

[–] Lucidlethargy@sh.itjust.works -4 points 2 years ago

That's not how copyright works. I'm embarrassed for you, and all the people who blindly upvoted you. Like... Yikes. What's happening to this world?

You can't publish copyrighted work as your own just because you're legally not able to publish copyrighted work. That's a open and shut case of copyright infringement. Why do I have to say this? Am I on candid camera?