TLDR: Cloudflare has launched its new Data Platform, a managed solution for analytical data that eliminates egress fees. This platform integrates Cloudflare Pipelines, R2 Data Catalog, and R2 SQL, offering an affordable, usage-based pricing model and leveraging open standards like Apache Iceberg to challenge traditional cloud providers.
Cloudflare has officially announced the open beta launch of its new Data Platform, a comprehensive managed solution designed for ingesting, storing, and querying analytical data tables. This innovative platform distinguishes itself by adopting open standards such as Apache Iceberg and, most notably, by eliminating egress fees, a move poised to significantly alter the economics of data warehousing.
The Cloudflare Data Platform is a culmination of several existing Cloudflare services: Cloudflare Pipelines, R2 Data Catalog, and R2 SQL. Cloudflare Pipelines are responsible for collecting events sent through Workers or HTTP, processing them using SQL, and then storing them either in Iceberg tables or as files on R2. The R2 Data Catalog, previously in public beta, tracks Iceberg metadata and now manages routine maintenance tasks like compaction to enhance query performance. R2 SQL serves as a distributed serverless query engine, capable of handling petabyte-scale datasets stored in R2.
According to Micah Wylde, principal engineer at Cloudflare, Alex Graham, senior systems engineer, and Jérôme Schneider, staff software engineer, “Analytical data is critical for modern companies. It allows you to understand your users’ behavior, your company’s performance, and alerts you to issues. But traditional data infrastructure is expensive and hard to operate, requiring fixed cloud infrastructure and in-house expertise. We built the Cloudflare Data Platform to be easy enough for anyone to use with affordable, usage-based pricing.”
Jamie Lord, a solution architect at CDS UK, emphasized the transformative impact of zero egress fees, stating, “Zero egress fees fundamentally changes the economics of data warehousing. Cloudflare’s new Data Platform leverages this advantage to challenge AWS and Google’s stranglehold on analytical workloads.” Lord further highlighted that companies often “bleed money on data transfer costs,” with petabyte-scale operations potentially spending millions annually just to move data between regions for analysis. Cloudflare’s platform aims to eliminate these costs entirely.
The elimination of egress fees is a strategic move by Cloudflare, building on its earlier initiatives like the R2 object storage service, which also boasts zero egress fees. This approach directly challenges the pricing models of major cloud providers like AWS, Google Cloud, and Microsoft Azure, which traditionally charge significant fees for data egress, often seen as a vendor lock-in tactic.
The announcement also sheds light on Cloudflare’s acquisition of Arroyo six months prior. Micah Wylde, co-founder and CEO of Arroyo, noted the initial confusion surrounding Cloudflare’s interest in a stream processing engine. The Data Platform now clarifies this, demonstrating how Arroyo’s capabilities are integrated into Cloudflare Pipelines for SQL transformations, enabling use cases such as schematizing, normalizing data, or redacting sensitive information before storage. While Pipelines currently supports stateless transformations, it lays the groundwork for more advanced data processing.
Also Read:
- The Synergy of Python SDKs and AI Agents: A New Era for Data Pipeline Automation
- MongoDB’s AI-Centric Strategy Fuels Strong Growth and Market Confidence
The broader implications of scrapping egress fees extend to fostering innovation, particularly in the realm of Artificial Intelligence. As highlighted in a Cloudflare article, “Scrapping egress fees would unleash AI’s full potential.” AI applications require vast amounts of training data and significant computing power, necessitating the movement of large datasets between platforms. Egress fees act as a barrier to this, hindering the full realization of AI’s transformative potential. By removing these fees, Cloudflare aims to empower organizations to harness AI without financial constraints related to data transfer, promoting multi-cloud strategies and greater reliability.


