Yellowbrick Data does that cloud warehousing thing

Yellowbrick Data has introduced a public cloud version of its data warehouse, so customers no longer have to buy its custom scale-out analytics appliance.

The Cloud Data Warehouse is a SaaS offering that connects customers to a single Yellowbrick Cloud Data Warehouse via multiple public and private clouds. The service supports data warehouse volumes ranging from 10TB to several petabytes.

The Yellowbrick Cloud Data Warehouse has operated in enterprise production environments since early 2019. It now has the ability to replicate between the on-premises Yellowbrick Data Warehouse Appliance and the cloud, to support hybrid deployments on-premises with users in the cloud, and to move data to and from the cloud at will.

Cloud DR provides an up-to-date replica in the cloud of an on-premises or cloud data warehouse at a substantially lower cost than purchasing additional instances, according to Yellowbrick.

The Yellowbrick Cloud Data Warehouse is generally available with support for AWS, Azure and Google Cloud Platform. Yellowbrick Cloud Disaster Recovery will be generally available later this year.

Blocks & Files has been told by Yellowbrick that the Cloud Data Warehouse runs on Yellowbrick hardware in a co-location site with very fast direct connections to the public cloud. This means that Yellowbrick’s custom high-speed hardware is running the software and not the public cloud provider’s hardware. That may happen in the future.

About Yellowbrick

Yellowbrick Data was founded in 2014 by Chief Revenue Officer Jim Dawson, a previous chief sales officer at all-flash pioneer Fusion-io, and CEO Neil Carson. Carson is also an Fusion-io alumnus, having been its CTO in what now seems like the distant past. 

CEO Neil Carson (left) and CRO Jim Dawson (right)

Its first data warehouse product, an integrated and scale-out analytics system, hit the market in September 2017. This all-flash system with custom hardware and software streamed data directly from the SSDs into the CPU’s cache.

The system used NVMe and bypassed main memory to get extra performance. This involved Yellowbrick writing its own operating system, schedulers, memory managers, device drivers and file systems. The claim was that users could get comparable performance to an in-memory database like  SAP HANA, but with less server infrastructure.

Yellowbrick has bagged $173m in three funding rounds between August 2018 and June 2019 This is dwarfed by rival Snowflake’s funding total – a massive $923m including $713.5m raised in two rounds in 2018.

Investors in both companies are no doubt attracted by the revenue potential for cloud data warehousing- projected by Allied Market Research projects to reach $34.7bn by 2025.