#ai-training-data

[ follow ]
Artificial intelligence
fromTechCrunch
3 days ago

Micro1, a competitor to Scale AI, raises funds at $500M valuation | TechCrunch

Micro1 raised $35 million Series A at a $500M valuation while rapidly growing ARR to $50M, positioning to supply human-labeled data for AI labs.
Artificial intelligence
fromCNET
4 days ago

Online Media Brands Hope a New Protocol Will Stop Unwanted AI Crawlers

Major online publishers are adopting RSL licensing to block unauthorized AI scraping and require payment when AI trains on their content.
Artificial intelligence
fromBusiness Matters
4 days ago

TikTok tops list of most scraped websites as AI training reshapes data priorities

TikTok became the world’s most scraped website in 2025 after a 321% surge, reflecting AI-driven demand for multimodal training data.
fromFortune
4 days ago

Google's AI is the 'worst' for stealing content, says People CEO | Fortune

When Google became the dominant search engine around 2004, not everyone was happy. Everyone from book publishers to music studios blasted the company for helping itself to copyrighted content without paying. The search giant eventually smoothed things over but now, twenty years later, Google has become the media industry's villain all over again-this time for gobbling that same content to train its AI tools.
Artificial intelligence
Artificial intelligence
fromThe Verge
5 days ago

The web has a new system for making AI companies pay up

Really Simple Licensing (RSL) lets web publishers specify licensing and royalty terms in robots.txt and other content to require payment for AI training-data scraping.
fromZDNET
5 days ago

Publishers are fighting back against AI with a new web protocol - is it too late?

The idea behind RSL is brutally simple. Instead of the old file -- which only said, "yes, you can crawl me," or "no, you can't," and which AI companies often ignore -- publishers can now add something new: machine-readable licensing terms. Want an attribution? You can demand it. Want payment every time an AI crawler ingests your work, or even every time it spits out an answer powered by your article?
Media industry
#copyright
fromArs Technica
6 days ago
Intellectual property law

Judge: Anthropic's $1.5B settlement is being shoved "down the throat of authors"

fromFortune
2 weeks ago
Artificial intelligence

Anthropic landmark copyright settlement with authors may set a precedent for the whole industry

fromArs Technica
6 days ago
Intellectual property law

Judge: Anthropic's $1.5B settlement is being shoved "down the throat of authors"

fromFortune
2 weeks ago
Artificial intelligence

Anthropic landmark copyright settlement with authors may set a precedent for the whole industry

#copyright-infringement
fromwww.theguardian.com
5 days ago
Intellectual property law

Tech companies are stealing our books, music and films for AI. It's brazen theft and must be stopped | Anna Funder and Julia Powles

fromwww.theguardian.com
5 days ago
Intellectual property law

Tech companies are stealing our books, music and films for AI. It's brazen theft and must be stopped | Anna Funder and Julia Powles

#anthropic
fromFortune
6 days ago
Intellectual property law

'We'll see if I can hold my nose and approve it': Judge hates $1.5b AI settlement with book authors so much he's taking 2 weeks to think it over | Fortune

Artificial intelligence
fromZDNET
2 weeks ago

Anthropic agrees to settle copyright infringement class action suit - what it means

Anthropic agreed to a proposed class settlement with three authors over alleged use of pirated works to train its Claude chatbot.
Artificial intelligence
fromArs Technica
2 weeks ago

Authors celebrate "historic" settlement coming soon in Anthropic class action

A settlement in principle was reached in a class-action over Anthropic's AI training data, potentially impacting millions of claimants and the AI industry.
fromFortune
6 days ago
Intellectual property law

'We'll see if I can hold my nose and approve it': Judge hates $1.5b AI settlement with book authors so much he's taking 2 weeks to think it over | Fortune

fromZDNET
2 weeks ago
Artificial intelligence

Anthropic agrees to settle copyright infringement class action suit - what it means

Books
fromDefector
1 week ago

If The Thieving AI Company Can Survive The Legal Settlement, Then It Is Not Big Enough | Defector

A book is the accumulation of a lifetime of experiences, not merely the time spent drafting and arranging words.
fromArs Technica
1 week ago

"First of its kind" AI settlement: Anthropic to pay authors $1.5 billion

believed to be the largest publicly reported recovery in the history of US copyright litigation.
Intellectual property law
Information security
fromDataBreaches.Net
1 week ago

Hackers Threaten to Submit Artists' Data to AI Models If Art Site Doesn't Pay Up - DataBreaches.Net

LunaLock stole Artists&Clients data, demanded $50,000, threatened public release and to submit stolen artwork to AI companies for inclusion in training datasets if unpaid.
Intellectual property law
fromPatently-O
2 weeks ago

Anthropic Settles the Authors' Class Action on Training Data: What It Means for Fair Use, Compensation, and Competition

Fair use can permit training on lawfully obtained books, but acquiring and centrally storing pirated works can still constitute copyright infringement.
Marketing tech
fromstupidDOPE | Est. 2008
2 weeks ago

SEO Agencies Are Quietly Using stupidDOPE Syndication to Outrank Competitors | stupidDOPE | Est. 2008

Syndication on platforms like stupidDOPE boosts long-term search visibility, fuels AI training data, and provides sustained SEO advantages over fleeting social posts.
Artificial intelligence
fromTechCrunch
2 weeks ago

Anthropic settles AI book-training lawsuit with authors | TechCrunch

Anthropic settled a class-action lawsuit over its use of books to train large language models despite a partial fair-use victory.
fromAdExchanger
3 weeks ago

The IAB Tech Lab Isn't Pulling Any Punches In The Fight Against AI Scraping | AdExchanger

Digital publishers are in a losing battle against Big Tech and AI for traffic and ad revenue. But the IAB Tech Lab has a plan to swing the momentum back in pubs' favor. On Tuesday, the Tech Lab announced a new publisher-focused working group that aims to ensure pubs are fairly compensated when AI scrapes their content for training. IAB Tech Lab CEO Anthony Katsur previewed the new initiative at AdMonsters' Sell Side Summit in Nashville, Tenn., on Monday.
Artificial intelligence
fromArs Technica
2 months ago

Anthropic destroyed millions of print books to build its AI models

The AI industry's quest for high-quality training data has led companies like Anthropic to explore controversial practices in acquiring books for their models.
Artificial intelligence
[ Load more ]