Previous Articles on Snowflake
Integrating Snowflake with Trino
Previous Articles on CockroachDB CDC
Using CockroachDB CDC with Apache Pulsar
Using CockroachDB CDC with Azure Event Hubs
SaaS Galore: Integrating CockroachDB with Confluent Kafka, FiveTran, and Snowflake
Using CockroachDB CDC with Confluent Cloud Kafka and Schema Registry
CockroachDB CDC using Minio as cloud storage sink
CockroachDB CDC using Hadoop Ozone S3 Gateway as cloud storage sink
Motivation
I work with financial services clients, and it’s common to encounter a need for streaming changes in the operational data store into a data warehouse or a data lake. A former colleague recently reached out for advice on the fastest and most efficient way to load trade data into Snowflake. I’ve come up with at least three methods, which I will explore in a follow-up series of articles. However, I’ve decided to first explore Redpanda Connect, a solution that has recently caught my attention. This is by no means a conclusive guide on how changefeed data must be loaded into Snowflake; we’re merely exploring the possibilities and discussing the pros and cons in later articles.
CockroachDB changefeeds are an enterprise feature and require a license. In this tutorial, I’m using a free-to-start version of CockroachDB Serverless, which has enterprise changefeeds enabled.