OpenAI's lawyer says there are too many files from Ilya Sutskever and other employees to share in copyright lawsuit

Lawyers for OpenAI said requests from the Authors Guild to produce files of Ilya Sutskever and seven others would comprise "over 886,000 documents."

Nov 14, 2024 - 08:00

0 1

OpenAI's lawyer says there are too many files from Ilya Sutskever and other employees to share in copyright lawsuit

OpenAI CEO Sam Altman. — OpenAI is facing several legal battles over claims of copyright infringement.
Andrew Caballero-Reynolds/AFP/Getty Images

OpenAI is trying to negotiate down the number of files it must produce in a copyright case.
Files belonging to ex-chief scientist Ilya Sutskever are among those under dispute.
The Authors Guild's case centers on claims that OpenAI trained AI models on books without permission.

A lawyer for OpenAI is seeking to negotiate down the number of documents the company must review and disclose in a high-profile copyright lawsuit, arguing that the latest requests involving its cofounder Ilya Sutskever and seven other current and former employees are too big and numerous.

In a letter to the judge filed on Wednesday in New York federal court, OpenAI lawyer Carolyn M. Homer said files demanded by the Authors Guild from eight additional people would total hundreds of gigabytes of data "comprising over 886,000 documents."

These eight "custodians" — people thought to have relevant evidence to produce in the pretrial discovery process — include former chief scientist and cofounder Sutskever and researcher Jan Leike, who left the company in May for rival firm Anthropic.

The lawsuit centers on claims that OpenAI's models were trained on books without the authors' permission.

Homer also named other disputed custodians, including OpenAI technical staff members Chelsea Voss, Shantanu Jani, and Jong Wook Kim, pretraining data lead Qiming Yuan, and former employees Andrew Mayne and Cullen O'Keefe.

OpenAI has already agreed to produce documents from 24 custodians but is pushing back against proposed search parameters and requests to produce files relevant to eight new custodians over concerns that their files would significantly increase the resources needed to go through them.

According to OpenAI's lawyer, the company's review of the existing 24 custodians, based on its own proposed search terms, would require it to examine "more than 460,000 documents" totaling 359 gigabytes. Homer said that using the Authors Guild's proposed terms, OpenAI would need to review over 1 million documents.

When factoring in OpenAI's proposed search terms for the eight disputed custodians, Homer said the size of the files would be over 375 gigabytes, exceeding the size of the files from the 24 custodians already agreed on by both parties.

The lawyer also said OpenAI estimated a 71% duplication rate based on proposed search terms between the eight disputed custodians and the 24 existing ones.

OpenAI's lawyer said that the "substantial volume of hits," as well as concerns over high duplication rates, meant it would continue to attempt to reach an agreement with the plaintiffs over Sutskever's files and the other disputed custodians.

The dispute marks the latest development in the ongoing class-action lawsuit brought by the Authors Guild — which provides support for writers — against OpenAI. Unsealed documents reviewed by BI this year showed that the ChatGPT maker deleted two datasets, "books1" and "books2," used to train an older AI model named GPT-3.

OpenAI is also facing several other cases over copyright infringement, including one brought against the company by The New York Times.

Authors Guild lawyers said in filings that the datasets may have included "more than 100,000 published books."

OpenAI and the Authors Guild did not immediately respond to Business Insider's request for comment.

Read the original article on Business Insider

Europe fines Meta almost $840 million for linking Marketplace to Facebook

What's Your Reaction?

Dislike

Love

Funny

Angry

Sad

Wow

admin

Welcome to Lakewood Newsbreak, a subsidiary of Lakewood Opinions, LLC. This website is designed o enhance your news delivery. All information belongs to the individual contributor and LNB take no responsibility for any content. We do not sell any information. LNB pulls from over 2,500 RSS news feeds from around the world to bring you the latest updates. Please enjoy.

Blazers heat up from long range, shoot down Timberwolves

admin Nov 13, 2024 0 0

Timothée Chalamet said an agent asked him to put on wei...

admin Nov 13, 2024 0 1

The DOJ wants Google to sell its Chrome browser. Here a...

admin Nov 21, 2024 0 0

Spirit Airlines to keep flying after filing for bankrup...

admin Nov 18, 2024 0 0

Lo & Sons makes some of the best travel bags we've ever...

admin Nov 13, 2024 0 1

Devin Booker, Suns hope to heat up vs. stingy Thunder

admin Nov 15, 2024 0 1

Comments

There are so many Social Media sites out there and they are hard to keep up with. That is why Lakewood Newsbreak has design a Social site design to discuss and post News and World related items of intrest. We are tring to promote feel good news posts to help the world in these harden times. Please be courteous with your comments. Thannk you and enjoy. Please read our Content Policy for any Questions

Enter Here

WHAT IS YOUR FAVORITE PODCAST

Talk Shows

Interviews

Videos

Business Reviews

City Issues

Please select an option!

You already voted this poll before.

WHAT IS YOUR FAVORITE PODCAST

Total Vote: 10

Talk Shows

20 %

Interviews

30 %

Videos

20 %

Business Reviews

20 %

City Issues

10 %

Bill Maher

9NEWS Parade of Lights returning for 50th year

Opinion: Bike parks are popping up across Col...

How Colorado is trying to make the High Line ...

Colorado county clerk spent $4,000 on get-out...

Colorado marijuana sales — and tax dollars — ...

Here's what 5 CEOs learned by becoming underc...

Elon Musk and Vivek Ramaswamy are starting a ...

Today's Mortgage Rates | Rates Still Haven't ...

CD, Checking, and Savings Rates Today: Superc...

McDonald's is bringing back cheap fast food w...

OpenAI's lawyer says there are too many files from Ilya Sutskever and other employees to share in copyright lawsuit

Lawyers for OpenAI said requests from the Authors Guild to produce files of Ilya Sutskever and seven others would comprise "over 886,000 documents."

Europe fines Meta almost $840 million for linking Marketplace to Facebook

The 25 best early Black Friday clothing and shoe deals to scoop up this week

What's Your Reaction?

Tiny Desk Concerts

Inside 25 Affordable Tiny Home Kits and Prefab Home for Sale at Amazon and Home Depot

Follow Us

Social Media

Advertisment ••

Recommended Posts

Where to get free Thanksgiving meals, volunteer in Denv...

We celebrate Thanksgiving a week early. It makes travel...

Target shares plummet 18% in premarket trade as it post...

Best streaming deals and bundles in November 2024

World News

10 Days in VIETNAM: Hanoi, Ha Long Bay, Hoi An,

Ho Chi Minh, Hue | Full Travel Vlog & Guide

Spoken Word

Advertisment ••••

Voting Poll

WHAT IS YOUR FAVORITE PODCAST

WHAT IS YOUR FAVORITE PODCAST

Most Viewed Posts

Marijuana-Friendly Workout Classes in Denver

Will home prices drop in 2025? Here's what experts say.

Golden, Colorado Has a New Old West-Style Tavern

OpenAI's lawyer says there are too many files from Ilya Sutskever and other employees to share in copyright lawsuit

Lawyers for OpenAI said requests from the Authors Guild to produce files of Ilya Sutskever and seven others would comprise "over 886,000 documents."

What's Your Reaction?

Related Posts

Headline News

Tiny Desk Concerts

Inside 25 Affordable Tiny Home Kits and Prefab Home for Sale at Amazon and Home Depot

Follow Us

Social Media

Advertisment ••

Recommended Posts

World News

10 Days in VIETNAM: Hanoi, Ha Long Bay, Hoi An,

Ho Chi Minh, Hue | Full Travel Vlog & Guide

Spoken Word

Advertisment ••••

Voting Poll

WHAT IS YOUR FAVORITE PODCAST

WHAT IS YOUR FAVORITE PODCAST