Blog
More about Estuary and related technologies, straight from the team.
Our blog breaks down basic concepts and takes you into the minds of our engineers. We also dig into the business principles that guide our company and allow us to build great solutions for yours.


What Is Streaming Analytics? Azure, AWS, & Kafka Explained
Discover what streaming analytics is and how you can use Azure, AWS, and Kafka to analyze and visualize data for better insights & decisions.


4 Methods to Transfer Data from SQL Server to Snowflake
Is connecting SQL Server to Snowflake essential for the success of your data stack? Choose your favorite from these 4 step-by-step methods.


How To Enable SQL Server Change Data Capture In 5 Steps
Learn how to enable Change Data Capture in SQL Server with our detailed guide. We’ll show you how to set up CDC and use it to track changes.


How To Move Data From MongoDB To BigQuery: 2 Methods
Learn two no-fuss methods to move your MongoDB data into BigQuery — step-by-step instructions included.


PostgreSQL vs MongoDB: Which Database Should You Choose?
Choosing the right database can have a HUGE effect on your application. Let’s break down the pros and cons of PostgreSQL and MongoDB.


What is Data Streaming Technology? Best Practices & Examples
Everything you need to know about data streaming technology, from best practices to real-life examples of how it’s used.


DataOps for Streaming: How to Manage Your Real-Time Data
Streaming data infrastructure is notoriously hard to manage. DataOps gives you the framework to do so successfully.


PostgreSQL Data Types Explained With Examples
PostgreSQL has all the standard RBDMS data types, and then some. Use this handy reference for Postgres data type syntax, examples, and more.


The Complete Guide to Change Data Capture in SQL Server
Microsoft SQL Server provides a native CDC capability, but that’s far from the whole story. Learn what’s possible and how to take advantage.


Real-Time Data Streaming Architecture: Benefits, Challenges, and Impact
Real-time data streaming architecture is a potent tool for teams in a range of industries… if you have the right skills and infrastructure.


What Is Real Time Data Streaming? [Benefits & Use Cases]
Real-time data streaming technology drives modern businesses across industries. And its benefits go much deeper than the obvious.


How to Stream Data from Postgres to Snowflake: Full Guide
Streaming data from Postgres to Snowflake is a great way to support transactional and analytical workloads. And it’s easier than it seems.


3 Ways to Stream Data from Postgres to ElasticSearch
A Postgres to ElasticSearch pipeline is a great way to store and analyze tons of relational data- as long as you keep the two systems in sync.


Firestore vs. Realtime Database: Which Performs Better?
Picking the right NoSQL database in the Firebase suite can dramatically affect your app’s performance. Here’s how to make the right choice.


How to Analyze Google Firestore Data: 4 Options By Use Case
How to find your least-painful path to Firestore analytics at scale, whether that’s inside or outside Firebase.


What Is Real Time Data Warehouse? Benefits & Best Practices
Typical data warehouses are designed to help you effectively manage and analyze data. A real-time data warehouse takes this up a notch.


How To Replicate Data From MySQL To Parquet: Full Guide
Syncing data from MySQL to Parquet lets you power both transactional and Big Data workloads. But setting it up effectively isn’t easy.


Capturing data from Salesforce: Historical and Real-Time
Getting a reliable real-time data stream out of Salesforce alongside historical data is vital for day-to-day operations and analytics alike, but it can be a challenge. Here’s how we did it.


7 Eye-Opening Examples Of Data Pipelines (Guide 2023)
Data pipelines are everywhere, including some surprising places. Here are some often-overlooked examples that might apply to your industry.


What Is Real-Time Processing (In-depth Guide For Beginners)
Real-time data is a foundation of modern life that can sometimes hide in plain sight. Here’s how it works and how you can benefit.


The Real-time Data Landscape
There’s been major innovation throughout the entire real-time data landscape over the last few years. These are the major players, and how they fit together.


How To Move Data From PostgreSQL To BigQuery In Minutes
Syncing your relational database and data warehouse (ie, Postgres and BigQuery) reliably is critical. But it doesn’t have to be hard.


9+ Best Data Pipeline Tools For 2023 (Review & Comparison)
Not all data pipelines platforms are a good fit for every business. In this post, we’ll look at the strengths and weaknesses of top tools.


What Is Real Time Data? Benefits, Examples, And Use Cases
Want to take advantage of real-time data in your business? This guide will provide in-depth information about real-time data and its uses.


Connecting Kafka To PostgreSQL: A Comprehensive Guide 2023
Building a pipeline between Kafka and PostgreSQL has innumerable benefits for your stack. It shouldn’t have to be hard.


What Are Data Connectors? Importance, Types, And Examples
Data connectors are critical for bridging the gap between data systems. They come in many forms, and carry many benefits.


How To Connect Google Sheets To PostgreSQL [Full Guide]
Your team can reap the benefits of both Google Sheets and Postgres without sacrificing data integrity across your stack.


How to sync Firestore data to Snowflake for data analytics
Until recently, connecting Google Cloud Firestore to Snowflake hasn’t been easy. Here’s why (and how) to integrate this powerful pair.


Confluent Kafka vs Apache Kafka vs Estuary: 2023 Comparison
You can’t go wrong with Kafka for real-time data, but other platforms give it a run for its money. Here’s how they stack up.


How To Connect MySQL To SnowFlake In Minutes: Full Guide
MySQL and Snowflake are both powerful foundations for your data stack – but must be unified around the same data. Here’s how.


How To Connect MySQL To BigQuery: 2 Straightforward Ways
Syncing data between MySQL and BigQuery is critical to your operational and analytical data needs – but it’s not always easy to get right.


How to Sync Data from MySQL To Elasticsearch in Real Time
MySQL and Elasticsearch are a powerful duo to power your app – as long as their data stays in sync. We teach you how to set that up.


How To Connect MySQL To Google Sheets Without Writing Code
Connecting MySQL to Google Sheets is a straightforward process you can complete without writing code. Here are three methods.


How To Easily Migrate A Database: MySQL To PostgreSQL
Your step-by-step guide to using a real-time data pipeline to migrate from MySQL to PostgreSQL.


What Is An Automated Data Pipeline – Examples & Use Cases
All about data pipelines, what makes them automated, and real-world examples.


7 Best Airbyte Alternatives [2023 Comparison & Review]
If you’re looking for Airbyte alternatives, here are our top 7 data integration tools of 2023 to help you choose the best one.


Graphing GitHub CI build times with remote transformations and Flow
Create a real-time pipeline with a Cloudflare Workers transformation to monitor build times from the GitHub API.


Meltano vs Airbyte vs Estuary – A 2022 Decision Guide
We compare a wide range of features and pricing plan for Airbyte, Meltano, and Estuary to help you find the right solution for your business.


Airbyte vs Fivetran vs Estuary – In-depth Comparison 2022
We compare a wide range of features and pricing plan for Airbyte, Fivetran, and Estuaryto help you find the right solution for your business.


How to connect Kafka to Hasura GraphQL
In this tutorial, you’ll learn how to integrate Apache Kafka with a real-time GraphQL API using Hasura and Estuary Flow.


7 Best Fivetran Alternatives [2022 Competitor Comparison]
Despite its popularity and widespread use, Fivetran is far from perfect and there are many alternatives in the market with better offerings.


Data Pipeline Architecture: Process & Considerations
Well-thought-out architecture is what differentiates slow, disorganized, failure-prone data pipelines from efficient, scalable, reliable pipelines that deliver the exact results you want.


How To Build Data Pipelines: Full Process & Best Practices
To be successful, a company’s data pipelines must be scalable, organized, usable to the correct stakeholders, and above all, crafted to align with business goals.


Why TypeScript for real-time data transformation?
Using TypeScript for transformations allows us to prevent many of the common types of pipeline failures that might otherwise slip through the cracks.


Streaming Data Pipelines: Must-Have Features & Complexities
Streaming data pipelines are different from most data pipelines because they handle data continuously — in near-real-time. But they still have the fundamental pieces of a data pipeline.


What Is An ETL Pipeline? A Complete Guide For Beginners
ETL stands for extract, transform, and load. Together, this set of processes moves data from a source to a destination system.


What Is A Data Pipeline? Definition, Process, And Examples
A data pipeline is a system that takes data from its various sources and funnels it to its destination. It’s one component of an organization’s data infrastructure.


The problem with credit-based pricing for data platforms
Your data warehouse and data pipelines handle a ton of data every day, and they can quickly make or break your budget. Evaluating exactly how and why you’ll be charged is a crucial planning step.


How to create a real-time materialized view in PostgreSQL
In this article, we’ll discuss an example workflow that uses Estuary Flow to create a real-time materialized view in Postgres.


Everything you actually need to know to use Estuary Flow
There are only a few core concepts you need to know to start using Estuary Flow. We explain them with illustrations.


Four software engineering best practices to improve your data pipelines
Software and data engineering are very different, but you can apply the same best practices to both – if you’re smart about the context.


How to prevent your data warehouse from becoming a data swamp
Your data storage system might be a data swamp masquerading as a data warehouse. Here’s how to tell, and what to do.


Why ELT won’t fix your data problems
If we’re not careful, the modern data stack and ELT can cause new incarnations of problems that have been plaguing us for years.


Announcing Estuary’s partnership with Rockset
As we expand our beta program, Estuary is thrilled to announce our partnership with Rockset.


DataOps for business: A comprehensive introduction
DataOps is a holistic approach that recognizes the far-reaching impacts data has for business, and addresses common problems on several fronts.


Why you don’t actually need a reverse ETL platform
If you set up your architecture right, you just need one, robust data pipeline system.


Data for all: Why data democratization matters at every scale
It’s vital to avoid data-related power imbalances among individuals, teams, and entire businesses. But data democracy takes work.


Understanding the modern data stack, and why open-source matters
The “modern data stack” is a framework used to conceptualize how different data tools work together. But there’s more to it than that.


The complete change data capture guide for MySQL
A comprehensive guide to change data capture in MySQL for cross-functional data teams.


The complete change data capture guide for PostgreSQL
A comprehensive guide to change data capture in PostgreSQL for cross-functional data teams.


3 reasons to rethink your approach to change data capture
As we navigate this rapidly evolving space, we should be familiar with the challenges of change data capture as it currently exists. But we should not expect them to remain the same for long.


A comprehensive introduction to change data capture (CDC)
Change data capture, or CDC, is the process of recognizing a change in a source data system so that a downstream system can act on that change, usually by updating a target system to reflect new information.


How new pipeline tools are changing data engineering in the 2020s
For the past several years, the workforce has been chronically short on data engineers. But what will happen as vendors build services that automate much of their daily work?


Try it yourself: Continuous materialized views in PostgreSQL
Estuary Flow lets you synchronize data across all the systems you care about, with millisecond latency. What does that actually look like in practice? Here’s a simple — but powerful — example.


This article won’t tell you how to build a data mesh
Or, how to take an organization-first approach to modern data architecture


The power and implications of data materialization
The simple concept of materialization can tell us a lot about how (and why) data is stored and represented. In this post: logs and tables; data loading and queries; exports, and more.


Connector stories: Snowflake and BigQuery
Data warehouses like Snowflake and BigQuery are essential to your data stack. Today we’re talking with the two software engineers who built Estuary’s Snowflake and Bigquery connectors to get an inside look at their process.


The Estuary story and guiding principles
Our mission and guiding principles are defined not just by the common challenges of our field, but also by our personal experiences. Here’s how we got to where we are, the beliefs we hold as a result, and the vision we work towards.


Database vs data warehouse vs data lake: Key differences and usage
We hear these terms used a lot, and to the uninitiated, they can sometimes seem interchangeable. So, what’s the difference between these types of data storage systems?


Three data scaling pitfalls and how to avoid them
Being prepared for data scalability challenges and staying aware of best practices can help you avoid common issues.


Connector stories: Apache Kakfa
Apache Kafka is an extremely popular open-source event streaming platform. We talk to Estuary developer Alex about his process and the insights he gained building the Kafka connector.


Introducing Estuary’s open-source connector repository
We believe that an ecosystem of open-source connectors will be critical to the future of data integration. That’s why Estuary is excited to announce our open-source connector repository.


The costs of data integration explained, and how to minimize them
There is a cost associated with putting your data to work, and the benefits you gain depend on the systems you put into place. To maximize net value, you need to strike a balance between minimizing costs and maximizing gain.


Re-evaluating Kafka: issues and alternatives for real-time
Kafka’s challenges have exhausted many an engineer on the path to successful data streaming. What if there was an easier way?


5 example use-cases for real-time data processing
The applications for real-time data processing are diverse and far-reaching. This article highlights just a few.


Real-time vs batch data pipelines: a comprehensive introduction
Real-time and batch are two broad categories of data processing. Though they handle data differently, both are vital to the systems that make our businesses and society run.


Real-time MapReduce: Why MapReduce is making a comeback
What if we could adapt MapReduce to real-time data processing? Spoiler: it’s awesome and we’re building a next-gen data platform based on it!


ETL vs ELT: Breaking down the split paradigm
At first glance, ETL and ELT seem rigidly defined and mutually exclusive. But to understand our data integration options, we need to look closer.


What’s a data pipeline? The business essentials.
The term “data pipeline” evokes a mental image that speaks for itself. But their significance for your data goes far beyond that.


Putting an end to Unreliable Analytics
When building a product or service, it’s imperative to know that input data will be as expected. If that information comes…


Lead With Vision, Not Metrics: How OKR’s Can Be a Dangerous Tool
Data, if used properly, is key to success in any organization. It’s impossible to improve without metrics which help us understand…


Data Democracy Unlocks Value for Organizations. Here’s How to Start
Sometimes the most relaxing place to travel is nowhere at all.


Keep Privacy and Governance in Mind When Developing or Updating Your Systems
Design a fitness plan that’s simple and fun and you’ll never miss a workout.


How Kubernetes will Enable a new Genre of Vendor to Enterprise
What do you do when geography gets in the way of friendship or love?


Safely Sharing Data Between Companies
You don’t have to be a professional designer to appreciate visual balance and beauty.


How to take advantage of the data revolution in your business
Data is exploding and it takes planning to get in front of that trend.


A Unified Data Foundation for Real-time and Batch
Batch and streaming usually require different architectures and building infrastructure, but shouldn’t.