Bill Proposal: The Suchir Balaji Internet Data Protection Act
Purpose: To protect the commercial viability of individuals, businesses, and internet services by regulating the use of copyrighted digital data in artificial intelligence (A.I.) training processes, inspired by the concerns raised by Suchir Balaji in the October 23, 2024, New York Times
A BILL
To establish legal protections for copyrighted digital data used in A.I. development, ensuring fair compensation and consent for creators and content providers, and to promote a sustainable internet ecosystem.
To Be enacted by the Senate and House of Representatives of the United States of America in Congress assembled,
SECTION 1. SHORT TITLE
This Act may be cited as the "Suchir Balaji Internet Data Protection Act."
SECTION 2. FINDINGS
Congress finds the following, based on the concerns articulated by Suchir Balaji, a former OpenAI researcher, as reported in the New York Times on October 23, 2024:
A.I. companies, including OpenAI, have used vast amounts of internet data, including copyrighted material, to train models like ChatGPT without explicit consent or compensation to creators.
Such practices threaten the commercial viability of individuals, businesses, and internet services that produce digital content, as A.I. systems compete with original content providers.
The unchecked use of copyrighted data in A.I. training risks undermining the sustainability of the internet ecosystem, as highlighted by Mr. Balaji’s conclusion that these technologies cause more societal harm than benefit.
Legal clarity is needed to balance innovation with the rights of content creators, ensuring fair use principles are applied transparently and equitably.
SECTION 3. DEFINITIONS
For the purposes of this Act:
A.I. Training Data:
Copyrighted Data:
Content Creator:
A.I. Developer: Any entity engaged in the creation, training, or deployment of A.I. models using internet-derived data.
SECTION 4. PROVISIONS
(a) Consent and Compensation for Copyrighted Data
(b) Transparency Requirements
(c) Protection of Internet Ecosystem
The FTC shall establish guidelines to assess the impact of A.I. technologies on the internet ecosystem and recommend corrective actions.
(d) Enforcement
SECTION 5. IMPLEMENTATION
SECTION 6. JUSTIFICATION
This bill is substantiated by Suchir Balaji’s concerns, as reported in the New York Times, that A.I. technologies like ChatGPT, built on copyrighted data without permission, harm content creators and destabilize the internet ecosystem. Mr. Balaji’s resignation from OpenAI and his public stance highlight the urgency of addressing these immediate threats, beyond speculative future risks. The bill aligns with ongoing lawsuits, such as The New York Times v. OpenAI, and seeks to codify protections for creators while fostering responsible A.I. innovation.