Tag: Cloudflare

  • Cloudflare Blocks Unpaid AI Web Scrapers from Accessing Data

    Cloudflare Blocks Unpaid AI Web Scrapers from Accessing Data

    Key Takeaways

    1. Cloudflare’s CEO Matthew Prince announced that all AI web crawler bots will be blocked by default to protect content creators.
    2. The online search environment is increasingly dominated by AI chatbots, making it harder for content creators to gain traffic and recognition for their work.
    3. AI crawlers are extracting data without compensating original content creators, leading to a sense of unfairness in the web ecosystem.
    4. Cloudflare plans to launch a marketplace to connect content creators with AI companies, focusing on content quality and knowledge enhancement.
    5. Recent disruptions caused by aggressive AI crawlers have led platforms like SourceHut to block major cloud service providers due to excessive traffic.


    Declaring “Content Independence Day,” Cloudflare’s CEO Matthew Prince shared significant updates to the company’s web service system. From now on, all AI web crawler bots will be blocked by default.

    In a blog entry, Prince explained how the current online search environment is dominated by AI chatbots, like Google’s Gemini and OpenAI’s ChatGPT. While these tools provide value, they also extract data from the internet without any consequences, neglecting to reward the original content creators.

    Challenges for Content Creators

    Prince pointed out that recent modifications in Google Search have made it ten times “more difficult for a content creator to get the same volume of traffic” as they did a decade ago.

    He stated, “Instead of being a fair trade, the web is being stripmined by AI crawlers, with content creators seeing almost no traffic and thus almost no value.”

    Prince expressed that the content being scraped serves as “the fuel that powers AI engines,” and it is only just that the original creators receive compensation for their work.

    New Marketplace Initiative

    Cloudflare also unveiled plans for a new marketplace designed to connect creators with AI companies. This marketplace will evaluate available content not just based on the traffic it brings in but also “on how much it furthers knowledge.” Prince is optimistic that this will help AI engines improve swiftly, potentially ushering in a new golden age of high-quality content creation.

    He acknowledged that he doesn’t have all the solutions right now, but the company is collaborating with “leading computer scientists and economists to find them.”

    Recent Issues with AI Crawlers

    Recently, SourceHut, a platform for hosting open-source Git repositories, reported disruptions caused by “aggressive LLM crawlers.” They have blocked multiple cloud service providers, including Google Cloud and Microsoft Azure, due to the overwhelming traffic coming from their networks.

    In January, DoubleVerify, a web analytics platform, noted an 86% rise in General Invalid Traffic (GIVT) from AI scrapers and other automated tools compared to 2024.

    Despite previous commitments, OpenAI’s GPTbot has also discovered methods to ignore or bypass a site’s robots.txt file entirely, leading to an enormous increase in traffic for domain owners and potentially high costs.

    Source:
    Link

  • Cloudflare, PrivadoVPN, and Others Removed from Indian App Stores

    Cloudflare, PrivadoVPN, and Others Removed from Indian App Stores

    The Indian government has taken steps to remove various VPN applications from its app stores. Notable services impacted by these actions include Cloudflare, PrivadoVPN, and Hide.me.

    Document Review and Requests

    TechCrunch looked into the documents released by the ministry and found a notice from Google to Lumen, a database from Harvard that keeps track of government removal requests. A developer shared with the publication a message from Apple, which referenced a request from the Indian Cyber Crime Coordination Center for the app’s removal based on a violation of local laws.

    New Regulations for VPN Providers

    In 2022, India introduced new regulations that mandated VPN companies to keep user data for a period of five years. This data includes IP addresses, names, email addresses, verified phone numbers, and physical addresses. Furthermore, these providers must supply information to the government when it is requested.

    Impact on VPN Companies

    As a result of these new rules, many VPN services like NordVPN, Surfshark, and ExpressVPN decided to close their local servers in India. Additionally, a group of cybersecurity experts urged for public discourse regarding the new regulations through an open letter to India’s Computer Emergency Response Team.

    Previous App Bans

    In 2020, India also prohibited 59 Chinese applications. Among those still banned are TikTok, UC Browser, WeChat, and ES File Explorer, which remain inaccessible in the country.

    Source: Link