Saturday, September 13, 2025
No Result
View All Result
DOLLAR BITCOIN
Shop
  • Home
  • Blockchain
  • Bitcoin
  • Cryptocurrency
  • Altcoin
  • Ethereum
  • DeFi
  • Legal Hub
  • More
    • Market & Analysis
    • Dogecoin
    • NFTs
    • XRP
    • Regulations
  • Shop
    • Bitcoin Book
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Merch
    • Bitcoin Miner
    • Bitcoin Miner Machine
    • Bitcoin Shirt
    • Bitcoin Standard
    • Bitcoin Wallet
DOLLAR BITCOIN
No Result
View All Result
Home Blockchain

Reddit blocks the Internet Archive from crawling its data – here’s why

n70products by n70products
August 12, 2025
in Blockchain
0
Reddit blocks the Internet Archive from crawling its data – here’s why
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


gettyimages-2215157577

Andriy Onufriyenko/Getty Photos

ZDNET’s key takeaways

  • The Web Archive can now solely crawl Reddit’s homepage.
  • Reddit’s objective is to dam AI companies from scraping Reddit person information.
  • Publishers (and others) are suing AI corporations for copyright infringement.

Reddit is defending its privateness from AI corporations which might be taking roundabout approaches to scraping its content material.

The social media platform, generally known as a useful resource the place customers can publish anonymously and discover details about just about any topic, will block the Web Archive’s Wayback Machine from indexing its on-line information, based on a Monday report from The Verge. The transfer is in response to the invention that AI companies, unable to scrape information from Reddit immediately as a result of platform’s prohibitive insurance policies, have as an alternative been retrieving its information from listed content material on the Web Archive and utilizing it to coach fashions.

The Wayback Machine will now solely be capable of scrape information from Reddit’s homepage, based on The Verge, whereas entry to person profiles, feedback, and publish element pages will probably be blocked.

Launched in 1996, the Web Archive is a non-profit that operates an unlimited digital database of internet content material. The archive is maintained partly by the Wayback Machine, a chunk of web-crawling software program that gathers internet pages and preserves them as they appeared after they had been collected, like digital flies in amber. This serves as a useful resource for researchers finding out the evolution of on-line tradition and digital forensic proof for regulation enforcement, amongst different makes use of.

What Reddit’s transfer means

Reddit has beforehand flagged considerations associated to the scraping of its content material with the Web Archive, based on The Verge. The non-profit was additionally reportedly notified earlier than the web-crawling restrictions began going into impact yesterday.

The Web Archive has but to make an official assertion about the way it plans to reply to Reddit’s new restrictions, and on the time of writing, it has not responded to ZDNET’s request for remark. Wayback Machine director Mark Graham, nevertheless, has informed a number of publications that the Web Archive will “proceed to have ongoing discussions about this matter” with Reddit.

Rising rigidity

Reddit’s reported choice to dam Wayback Machine from scraping the vast majority of its content material arrives throughout a second of mounting rigidity between AI corporations and digital publishers, although Reddit is the primary tech firm to wade into the controversy. The corporate sued Anthropic in June after discovering that the AI firm was illegally scraping its information, but it surely has additionally beforehand signed licensing offers with each Google and OpenAI.

(Disclosure: Ziff Davis, ZDNET’s mum or dad firm, filed an April 2025 lawsuit towards OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI programs.) 

AI builders require entry to gargantuan troves of knowledge to coach generative AI fashions, that are designed to establish and replicate delicate mathematical patterns gleaned from these coaching datasets.

A lot of these corporations have scraped coaching information from publicly out there web sites, together with social media websites and information shops, claiming authorized immunity below an idea identified in copyright regulation as fair use. (The courts are nonetheless untangling the legitimacy of that argument, and can doubtless be doing so for a while.)

Most of the organizations whose content material has been copiously scraped — together with a cohort of authors and different artists — have responded with lawsuits. 

Others, in the meantime, have signed content material licensing agreements with the likes of OpenAI, Anthropic, and Google, consenting to using their organizations’ information in trade for elevated visibility within the responses generated by chatbots, or different advantages.





Source link

Tags: ArchiveblockscrawlingDataHeresinternetReddit
Previous Post

BTC Eyes $120K Reclaim On CPI Print And Fed Rate Cut Odds

Next Post

Market Expert Says Sell All Ethereum By October, Here’s Why

Next Post
Market Expert Says Sell All Ethereum By October, Here’s Why

Market Expert Says Sell All Ethereum By October, Here’s Why

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Premium Content

Decoding Base’s upgrade plan: Is it ready to take on Ethereum?

Decoding Base’s upgrade plan: Is it ready to take on Ethereum?

May 26, 2025
269 Pro-Crypto Candidates Voted In to House of Representatives and Senate: Stand With Crypto

269 Pro-Crypto Candidates Voted In to House of Representatives and Senate: Stand With Crypto

November 6, 2024
All about Ethereum’s yearly low

All about Ethereum’s yearly low

March 1, 2025
ETH whale withdraws 5160 tokens worth 20 million

ETH whale withdraws 5160 tokens worth 20 million

December 14, 2024
SEC Crypto Task Force To Tackle Financial Surveillance And Privacy

SEC Crypto Task Force To Tackle Financial Surveillance And Privacy

September 9, 2025
Best robot vacuum mops of 2025: I’ve tested dozens of these robots – here are the top ones

Best robot vacuum mops of 2025: I’ve tested dozens of these robots – here are the top ones

September 12, 2025

Recent Posts

  • Japan Plans Major Crypto Tax Cut — From 55% Down to 20% in 2025
  • I built my own AirTag-like tracker with this Raspberry Pi alternative – how it works
  • XRP Price At $23, Dogecoin To $2, And Solana At $1,800? Analyst Unveils 2026 Predictions

Categories

  • Altcoin
  • Bitcoin
  • Blockchain
  • Blog
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs
  • Regulations
  • XRP

Recommended

Japan Plans Major Crypto Tax Cut — From 55% Down to 20% in 2025

Japan Plans Major Crypto Tax Cut — From 55% Down to 20% in 2025

September 13, 2025
I built my own AirTag-like tracker with this Raspberry Pi alternative – how it works

I built my own AirTag-like tracker with this Raspberry Pi alternative – how it works

September 13, 2025

© 2025 Dollar-Bitcoin | All Rights Reserved

No Result
View All Result
  • Home
  • Blockchain
  • Bitcoin
  • Cryptocurrency
  • Altcoin
  • Ethereum
  • DeFi
  • Legal Hub
  • More
    • Market & Analysis
    • Dogecoin
    • NFTs
    • XRP
    • Regulations
  • Shop
    • Bitcoin Book
    • Bitcoin Coin
    • Bitcoin Hat
    • Bitcoin Merch
    • Bitcoin Miner
    • Bitcoin Miner Machine
    • Bitcoin Shirt
    • Bitcoin Standard
    • Bitcoin Wallet

© 2025 Dollar-Bitcoin | All Rights Reserved

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?
💵 Turn Every Dollar Into Crypto Rewards! Wirex lets you spend dollars or bitcoin — and get up to 8% back in crypto instantly. 💸 Exclusive offers dropping soon — stay tuned!
“Offers Launching Soon”
This is default text for notification bar
Learn more
Go to mobile version