Congress Wants Tech Companies to Pay Up for AI Training Data

By Carolina Stanton On Jan 10, 2024

Do AI firms must pay for the coaching knowledge that powers their generative AI techniques? The query is hotly contested in Silicon Valley and in a wave of lawsuits levied towards tech behemoths like Meta, Google, and OpenAI. In Washington, DC, although, there appears to be a rising consensus that the tech giants must cough up.

Today, at a Senate listening to on AI’s affect on journalism, lawmakers from each side of the aisle agreed that OpenAI and others ought to pay media shops for utilizing their work in AI initiatives. “It’s not only morally right,” stated Richard Blumenthal, the Democrat who chairs the Judiciary Subcommittee on Privacy, Technology, and the Law that held the listening to. “It’s legally required.”

Josh Hawley, a Republican working with Blumenthal on AI laws, agreed. “It shouldn’t be that just because the biggest companies in the world want to gobble up your data, they should be able to do it,” he stated.

Media business leaders on the listening to right now described how AI firms have been imperiling their business through the use of their work with out compensation. Curtis LeGeyt, CEO of the National Association of Broadcasters, Danielle Coffey, CEO of the News Media Alliance, and Roger Lynch, CEO of Condé Nast, all spoke in favor of obligatory licensing. (WIRED is owned by Condé Nast.)

Coffey claimed that AI firms “eviscerate the quality content they feed upon,” and Lynch characterised coaching knowledge scraped with out permission as “stolen goods.” Coffey and Lynch additionally each stated that they imagine AI firms are infringing on copyright beneath present regulation. They urged lawmakers to make clear that utilizing journalistic content material with out first brokering licensing agreements is just not protected by honest use, a authorized doctrine that allows copyright violations beneath sure situations.

Common Ground

Senate hearings may be adversarial, however the temper right now was largely congenial. The lawmakers and media business insiders typically applauded every others’ statements. “If Congress could clarify that the use of our content, or other publisher content, for the training and output of AI models is not fair use, then the free market will take care of the rest,” Lynch stated at one level. “That seems eminently reasonable to me,” Hawley replied.

Journalism professor Jeff Jarvis was the listening to’s solely discordant voice. He asserted that coaching on knowledge obtained with out cost is, certainly, honest use, and spoke towards obligatory licensing, arguing that it could injury the data ecosystem moderately than safeguard it. “I must say that I am offended to see publishers lobby for protectionist legislation, trading on the political capital earned through journalism,” he stated, jabbing at his fellow audio system. (Jarvis was additionally topic to the listening to’s solely actual contentious line of questioning, from Republican Marsha Blackburn, who needled Jarvis about whether or not AI is biased towards conservatives and recited an AI-generated poem praising President Biden as proof.)

Outside of the committee room, there may be much less settlement that obligatory licensing is important. OpenAI and different AI firms have argued that it’s not viable to license all coaching knowledge, and a few impartial AI specialists agree.