Amazon Aurora MySQL to Parquet in minutes
Amazon Aurora MySQL is a fully managed, MySQL-compatible, relational database engine that combines the speed and reliability of high-end commercial databases with the simplicity and cost-effectiveness of open-source databases. Aurora MySQL is a drop-in replacement for MySQL and makes it simple and cost-effective to set up, operate, and scale your new and existing MySQL deployments, thus freeing you to focus on your business and applications. Amazon RDS provides administration for Aurora by handling routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair. Amazon RDS also provides push-button migration tools to convert your existing Amazon RDS for MySQL applications to Aurora MySQ
Apache Parquet is an open-source, column-oriented data storage format of the Hadoop ecosystem designed to provide fast querying on large datasets. Parquet is routinely used for creating very highly scaled data lakes that can still be queried. Parquet is similar to other column-storage file formats that are available in Hadoop.
Estuary helps move data from
Amazon Aurora MySQL to Parquet in minutes with millisecond latency.
Data can then be directed to Parquet using materializations that are also open-source. Connectors have the ability to push data as quicikly as a destination will handle. Parquet likes files that are around 1 GB each. So, if you have high data volumes, Flow can keep your data lake up-to-date in near real-time.