diff --git a/README.md b/README.md index 84cdb4fc..1ff4f556 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ -**MyDuck Server** unlocks serious power for your (MySQL|Postgres) analytics. Imagine the simplicity of (MySQL|Postgres)’s familiar interface fused with the raw analytical speed of [DuckDB](https://duckdb.org/). Now you can supercharge your analytical queries with DuckDB’s lightning-fast OLAP engine, all while using the tools and dialect you know. +**MyDuck Server** unlocks serious power for your MySQL & Postgres analytics. Imagine the simplicity of (MySQL|Postgres)’s familiar interface fused with the raw analytical speed of [DuckDB](https://duckdb.org/). Now you can supercharge your analytical queries with DuckDB’s lightning-fast OLAP engine, all while using the tools and dialect you know. ## ❓ Why MyDuck ❓ @@ -12,7 +12,7 @@ While MySQL and Postgres are the most popular open-source databases for OLTP, th With MyDuck Server, you can: -- **Accelerate MySQL & Postgres analytics** by running analytical queries on your MySQL & Postgres data at speeds several orders of magnitude faster 🚀 +- **Accelerate analytics** by running queries on your MySQL & Postgres data at speeds several orders of magnitude faster 🚀 - **Keep familiar tools**—there's no need to change your existing (MySQL|Postgres)-based data analysis toolchains 🛠️ - **Go beyond MySQL & Postgres syntax** through DuckDB's full power to expand your analytics potential 💥 - **Run DuckDB in server mode** to share a DuckDB instance with your team or among your applications 🌩️ @@ -22,31 +22,32 @@ With MyDuck Server, you can: MyDuck Server isn't here to replace MySQL & Postgres — it's here to help MySQL & Postgres users do more with their data. This open-source project gives you a convenient way to integrate high-speed analytics into your workflow, all while embracing the flexibility and efficiency of DuckDB. ## ✨ Key Features +

- duck under dolphin + duck under dolphin

- **Blazing Fast OLAP with DuckDB**: MyDuck stores data in DuckDB, an OLAP-optimized database known for lightning-fast analytical queries. With DuckDB, MyDuck executes queries up to 1000x faster than traditional MySQL & Postgres setups, enabling complex analytics that were impractical with MySQL or Postgres alone. -- **MySQL-Compatible Interface**: MyDuck speaks MySQL wire protocol and understands MySQL syntax, so you can connect to it with any MySQL client and run MySQL-style SQL. MyDuck translates your queries on the fly and executes them in DuckDB. Connect your favorite data visualization tools and BI platforms to MyDuck without any changes, and enjoy the speed boost. +- **MySQL-Compatible Interface**: MyDuck speaks MySQL wire protocol and understands MySQL syntax, so you can connect to it with any MySQL client and run MySQL-style SQL. MyDuck translates your queries on the fly and executes them in DuckDB. - **Postgres-Compatible Interface**: MyDuck speaks Postgres wire protocol as well, allowing you to send DuckDB SQL directly with any Postgres client. DuckDB's SQL dialect [closely resembles PostgreSQL](https://duckdb.org/docs/sql/dialect/postgresql_compatibility.html), enabling you to speed up existing Postgres queries with minimal changes. - **Raw DuckDB Power**: MyDuck's support for raw DuckDB SQL opens up DuckDB’s full analytical capabilities, including [friendly SQL syntax](https://duckdb.org/docs/sql/dialect/friendly_sql.html), [advanced aggregates](https://duckdb.org/docs/sql/functions/aggregates), [accessing remote data sources](https://duckdb.org/docs/data/data_sources), and more. -- **Zero-ETL**: Just start replication and go! MyDuck can act as a MySQL replica or Postgres standby that replicates data from your primary server in real-time, so you can start querying immediately. There’s no need to set up complex ETL pipelines. +- **Zero-ETL**: Just start replication and go! MyDuck can act as a MySQL replica or a Postgres standby that replicates data from your primary server in real-time, so you can start querying immediately. There’s no need to set up complex ETL pipelines. - **Consistent and Efficient Replication**: Thanks to DuckDB's [solid ACID support](https://duckdb.org/2024/09/25/changing-data-with-confidence-and-acid.html), we've carefully managed transaction boundaries in the replication stream to ensure a **consistent data view** — you'll never see dirty data mid-transaction. Plus, MyDuck's **transaction batching** collects updates from multiple transactions and applies them to DuckDB in batches, significantly reducing write overhead (since DuckDB isn’t designed for high-frequency OLTP writes). -- **DuckDB in Server Mode**: If you aren't interested in MySQL & Postgres but just want to share a DuckDB instance with your team or among your applications, MyDuck is also a great solution. You can deploy MyDuck to a server, connect to it with the Postgres client library in your favorite programming language, and start running DuckDB SQL queries directly. +- **HTAP Architecture Support**: MyDuck works well with database proxy tools to enable hybrid transactional/analytical processing setups. You can route DML operations to (MySQL|Postgres) and analytical queries to MyDuck, creating a powerful HTAP architecture that combines the best of both worlds. -- **Seamless Integration with Dump & Copy Utilities**: MyDuck plays perfectly with modern MySQL & Postgres data migration tools, especially the [MySQL Shell](https://dev.mysql.com/doc/mysql-shell/en/) and [pg_dump](https://www.postgresql.org/docs/current/app-pgdump.html). For MySQL, you can load data into MyDuck in parallel from a MySQL Shell dump, or leverage the Shell’s `copy-instance` utility to copy a consistent snapshot of your running MySQL server to MyDuck. For Postgres, MyDuck can load data from a `pg_dump` archive. +- **Seamless Integration with Dump & Copy Utilities**: MyDuck plays well with modern MySQL & Postgres data migration tools, especially the [MySQL Shell](https://dev.mysql.com/doc/mysql-shell/en/) and [pg_dump](https://www.postgresql.org/docs/current/app-pgdump.html). For MySQL, you can load data into MyDuck in parallel from a MySQL Shell dump, or leverage the Shell’s `copy-instance` utility to copy a consistent snapshot of your running MySQL server to MyDuck. For Postgres, MyDuck can load data from a `pg_dump` archive. -- **Bulk Data Loading**: MyDuck supports fast bulk data loading from the client side with the standard MySQL `LOAD DATA LOCAL INFILE` command or the PostgreSQL `COPY FROM STDIN` command. +- **Bulk Upload & Download**: MyDuck supports fast bulk data loading from the client side with the standard MySQL `LOAD DATA LOCAL INFILE` command or the PostgreSQL `COPY FROM STDIN` command. You can also extract data from MyDuck using the PostgreSQL `COPY TO STDOUT` command. -- **Standalone Mode**: MyDuck can run in standalone mode without replication. In this mode, it is a drop-in replacement for MySQL, but with a DuckDB heart. You can `CREATE TABLE`, transactionally `INSERT`, `UPDATE`, and `DELETE` data, and run blazingly fast `SELECT` queries. +- **Standalone Mode**: MyDuck can run in standalone mode without replication. In this mode, it is a drop-in replacement for (MySQL|Postgres), but with a DuckDB heart. You can `CREATE TABLE`, transactionally `INSERT`, `UPDATE`, and `DELETE` data, and run blazingly fast `SELECT` queries. -- **HTAP Architecture Support**: MyDuck works well with database proxy tools to enable hybrid transactional/analytical processing setups. You can route DML operations to (MySQL|Postgres) and analytical queries to MyDuck, creating a powerful HTAP architecture that combines the best of both worlds. +- **DuckDB in Server Mode**: If you aren't interested in MySQL & Postgres but just want to share a DuckDB instance with your team or among your applications, MyDuck is also a great solution. You can deploy MyDuck to a server, connect to it with the Postgres client library in your favorite programming language, and start running DuckDB SQL queries directly. ## 📊 Performance @@ -56,7 +57,7 @@ Typical OLAP queries can run **up to 1000x faster** with MyDuck Server compared We have big plans for MyDuck Server! Here are some of the features we’re working on: -- [x] Be compatible with MySQL proxy tools like [ProxySQL](https://proxysql.com/) and [MariaDB MaxScale](https://mariadb.com/kb/en/maxscale/). +- [x] Be compatible with MySQL proxy tools like [ProxySQL](https://proxysql.com/). - [x] Replicate data from PostgreSQL. - [ ] Authentication. - [ ] ...and more! We’re always looking for ways to make MyDuck Server better. If you have a feature request, please let us know by [opening an issue](https://github.com/apecloud/myduckserver/issues/new). @@ -67,7 +68,7 @@ We have big plans for MyDuck Server! Here are some of the features we’re worki ### Prerequisites - **Docker** (recommended) for setting up MyDuck Server quickly. -- MySQL or PostgreSQL clients for connecting and testing your setup. +- MySQL or PostgreSQL CLI clients for connecting and testing your setup. ### Installation @@ -93,14 +94,14 @@ mysql -h127.0.0.1 -P13306 -uroot ``` > [!NOTE] -> MySQL CLI clients version 9.0 and above are not yet supported. If you are using macOS, consider `brew install mysql-client@8.4`. +> MySQL CLI clients version 9.0 and above are not yet supported on macOS. Consider `brew install mysql-client@8.4`. #### Connecting via PostgreSQL client -For full analytical power, connect using the Postgres port and write DuckDB SQL directly: +For full analytical power, connect to the Postgres port and run DuckDB SQL queries directly: ```bash -psql -h 127.0.0.1 -p 15432 -U mysql +psql -h 127.0.0.1 -p 15432 -U postgres ``` ### Replicating Data @@ -136,8 +137,8 @@ MyDuck Server supports setting up replicas from common cloud-based MySQL & Postg ### HTAP Setup With MyDuck's powerful analytics capabilities, you can create an hybrid transactional/analytical processing system where high-frequency data writes are directed to a standard MySQL or Postgres instance, while analytical queries are handled by a MyDuck Server instance. Follow our HTAP setup instructions to easily set up an HTAP demonstration: -* For MySQL HTAP, we can build the cluster based on [ProxySQL](docs/tutorial/mysql-htap-proxysql-setup.md) and [MariaDB MaxScale](docs/tutorial/mysql-htap-maxscale-setup.md). -* For PostgreSQL HTAP, we build it based on [PGPool-II](docs/tutorial/pg-htap-pgpool-setup.md) +* Provisioning a MySQL HTAP cluster based on [ProxySQL](docs/tutorial/mysql-htap-proxysql-setup.md) or [MariaDB MaxScale](docs/tutorial/mysql-htap-maxscale-setup.md). +* Provisioning a PostgreSQL HTAP cluster based on [PGPool-II](docs/tutorial/pg-htap-pgpool-setup.md) ### Query & Load Parquet Files @@ -149,7 +150,7 @@ Already have a DuckDB file? You can seamlessly bootstrap MyDuck Server with it. ## 💡 Contributing -Let’s make (MySQL|Postgres) analytics fast and powerful—together! +Let’s make (MySQL|Postgres) analytics fast and powerful — together! MyDuck Server is open-source, and we’d love your help to keep it growing! Check out our [CONTRIBUTING.md](CONTRIBUTING.md) for ways to get involved. From bug reports to feature requests, all contributions are welcome! diff --git a/logo/MyDuck.svg b/logo/MyDuck.svg new file mode 100644 index 00000000..2f893e42 --- /dev/null +++ b/logo/MyDuck.svg @@ -0,0 +1 @@ +MyDuckServerMySQL ProtocolPostgreSQLProtocolObject StorageLocal DiskDuckDBformatParquetZero-ETL Data SyncTransactionalReadReadWriteWrite(Planned)Embedded DuckDBQuery EngineDelta LakeDatabaseProxyDashboard,BI,DataApps,LLMs,ETLTools,PythonLibraries,DataframeAPIs, \ No newline at end of file diff --git a/logo/myduck-arch.png b/logo/myduck-arch.png deleted file mode 100644 index a069422d..00000000 Binary files a/logo/myduck-arch.png and /dev/null differ diff --git a/logo/myduck-logo.png b/logo/myduck-logo.png deleted file mode 100644 index 4202b333..00000000 Binary files a/logo/myduck-logo.png and /dev/null differ