OpenAI sued over alleged copyright infringement, accused of using authors’ books to train ChatGPT

published by, Shaurya Sharma

Last Update: June 30, 2023, 09:55 AM IST

San Francisco, California, United States

The ChatGPT 4 performed better than the 3.5, especially when given the imaging options available.

The ChatGPT 4 performed better than the 3.5, especially when given the imaging options available.

Two US authors sued OpenAI in San Francisco federal court on Wednesday, claiming in a proposed class action that the company misused their works to “train” its popular generative artificial-intelligence system ChatGPT.

Two US authors sued OpenAI in San Francisco federal court on Wednesday, claiming in a proposed class action that the company misused their works to “train” its popular generative artificial-intelligence system ChatGPT.

Massachusetts-based authors Paul Tremblay and Mona Awad said ChatGPT mined data copied from thousands of books without permission, thereby infringing on the authors’ copyrights.

Matthew Buttrick, a lawyer for the authors, declined to comment. Representatives for OpenAI, a private company backed by Microsoft Corp, did not immediately respond to a request for comment.

Several legal challenges have been filed over the material used to train state-of-the-art AI systems. The plaintiffs include source-code owners against OpenAI and Microsoft’s GitHub, and visual artists against Stability AI, Midjourney and DeviantArt.

The targets of the lawsuit have argued that their systems constitute fair use of copyrighted works.

ChatGPT responds to users’ text prompts in a conversational manner. Earlier this year it became the fastest-growing consumer application in history, reaching 100 million active users in January, just two months after launching.

ChatGPT and other generative AI systems create content using vast amounts of data scraped from the Internet. Tremblay and Awad’s lawsuit states that the books are a “key component” because they represent the “best examples of high-quality long-form writing”.

The complaint estimates that OpenAI’s training data contains more than 300,000 books, including illegal “shadow libraries” that offer copyrighted books without permission.

Awad is known for novels such as “13 Ways to Look at a Fat Girl” and “Bunny”. Tremblay’s novels include “The Cabin at the End of the World”, which was adapted into the M. Night Shyamalan film “Knock at the Cabin”, released in February.

Tremblay and Awad said that ChatGPT can generate “very accurate” summaries of their books, indicating that they appear in its database.

The lawsuit seeks damages for an unspecified amount of money on behalf of a nationwide class of copyright owners whose works OpenAI allegedly misappropriated.

(This story has not been edited by News18 staff and is published from a syndicated news agency feed – reuters,