From afc2e39af6e9fab207c66632e6a05e9f527712f6 Mon Sep 17 00:00:00 2001 From: Yongqiang YANG Date: Mon, 23 Dec 2024 17:59:18 +0800 Subject: [PATCH] [ccr] add performance (#1563) ## Versions - [x] dev - [ ] 3.0 - [ ] 2.1 - [ ] 2.0 ## Languages - [x] Chinese - [x] English ## Docs Checklist - [ ] Checked by AI - [ ] Test Cases Built --------- Co-authored-by: Yongqiang YANG --- .../data-admin/ccr/performance.md | 90 +++++++++++++++++++ .../data-admin/ccr/performance.md | 90 +++++++++++++++++++ sidebars.json | 3 +- 3 files changed, 182 insertions(+), 1 deletion(-) create mode 100644 docs/admin-manual/data-admin/ccr/performance.md create mode 100644 i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/data-admin/ccr/performance.md diff --git a/docs/admin-manual/data-admin/ccr/performance.md b/docs/admin-manual/data-admin/ccr/performance.md new file mode 100644 index 0000000000000..35dee06e03e8b --- /dev/null +++ b/docs/admin-manual/data-admin/ccr/performance.md @@ -0,0 +1,90 @@ +--- +{ + "title": "Performance", + "language": "en_US" +} +--- + + + +The performance data in this document is based on the default configuration. If you are facing high network latency or high throughput write scenarios, you can refer to the operation manual (../manual.md) for optimization. + +## Test Dataset +- **Dataset**: TPC-H 1T + +## Test Cluster Configuration + +| Configuration Item | Upstream Configuration | Downstream Configuration | +|--------------------|-------------------------------------------|---------------------------------------------| +| FE | 2 cores 16 GB | 2 cores 16 GB | +| BE | 3 nodes, 16 cores 64 GB, 3*500 GB per node | 3 nodes, 16 cores 64 GB, 3*500 GB per node | + +--- + +## Incremental Synchronization Performance Test + +### Test Steps +1. Create the library table information for TPC-H 1T in the upstream cluster. +2. Create a synchronization task for the TPC-H 1T database. +3. Wait for the TPC-H 1T data import to complete and record the completion time `a`. + - **Time Record**: `2024-12-05 20:09:48` +4. Wait for the downstream data synchronization to complete and record the completion time `b`. + - **Time Record**: `2024-12-05 20:10:21.678` + - **Log Information**: `INFO handle upsert binlog, sub sync state: IngestBinlog, prevCommitSeq: 3271, commitSeq: 3282 job=per_ccr line=ccr/job.` + +### Test Conclusion +Incremental synchronization time difference: `b - a = 33 seconds` + +--- + +## Full Synchronization Performance Test + +### Test Steps +1. Create the library table information for TPC-H 1T in the upstream cluster and complete the data import, recording the completion time `a`. + - **Time Record**: `2024-12-05 17:56:56` +2. Create a synchronization task for the TPC-H 1T database. +3. Wait for the downstream data synchronization to complete and record the completion time `b`. + - **Time Record**: `2024-12-05 18:02:57` + +### Test Conclusion +Full synchronization time difference: `b - a = 6 minutes 1 second` + +--- + +## Flink Synchronization Performance Test + +### Test Steps +1. Use Flink to import 100,000,000 records of data in the upstream. +2. Create a synchronization task for the library table. +3. Observe the difference between the downstream synchronization completion time and the upstream import completion time at each stage (e.g., 1,000,000 records, 2,000,000 records, etc.). +4. The last import completion time in the upstream is `2024-12-06 12:53:00`. + - **Log Information**: `2024-12-06 12:53:00,455 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source: Custom Source (1/1) (17be78de01ee5175d801c08c1b443771) switched from RUNNING to FINISHED.` +5. The downstream synchronization completion time is `2024-12-06 12:53:01.433`. + - **Log Information**: `2024-12-06 12:53:01.433 TRACE update job progress done, state: DBIncrementalSync, subState: Done, commitSeq: 12246911, prevCommitSeq: 12246911 job=flink_per_ccr line=ccr/job_progress.go:389` + +### Test Conclusion +The **lag time** at each stage is maintained within `5 seconds`. + +--- + +## Summary +- **Incremental Synchronization**: Time difference of approximately 33 seconds. +- **Full Synchronization**: Completion time of approximately 6 minutes 1 second. +- **Flink Synchronization**: Each stage's synchronization delay is within 5 seconds. diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/data-admin/ccr/performance.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/data-admin/ccr/performance.md new file mode 100644 index 0000000000000..1c14757acfb4b --- /dev/null +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/admin-manual/data-admin/ccr/performance.md @@ -0,0 +1,90 @@ +--- +{ + "title": "性能", + "language": "zh_CN" +} +--- + + + +本文档中的性能数据基于默认配置,如果您面临高网络延迟或高吞吐量写入场景,可以参考操作手册 (../manual.md) 进行优化。 + +## 测试数据集 +- **数据集**:TPC-H 1T + +## 测试集群配置 + +| 配置项 | 上游配置 | 下游配置 | +|----------|--------------------------------------|--------------------------------------| +| FE | 2 核 16 GB | 2 核 16 GB | +| BE | 3 个节点,16 核 64 GB,每节点 3*500 GB | 3 个节点,16 核 64 GB,每节点 3*500 GB | + +--- + +## 增量同步性能测试 + +### 测试步骤 +1. 在上游集群创建 TPC-H 1T 的库表信息。 +2. 创建 TPC-H 1T 数据库的同步任务。 +3. 等待 TPC-H 1T 数据导入完成,记录完成时间 `a`。 + - **时间记录**:`2024-12-05 20:09:48` +4. 等待下游数据同步完成,记录完成时间 `b`。 + - **时间记录**:`2024-12-05 20:10:21.678` + - **日志信息**:`INFO handle upsert binlog, sub sync state: IngestBinlog, prevCommitSeq: 3271, commitSeq: 3282 job=per_ccr line=ccr/job.` + +### 测试结论 +增量同步时间差:`b - a = 33 秒` + +--- + +## 全量同步性能测试 + +### 测试步骤 +1. 在上游集群创建 TPC-H 1T 的库表信息并完成数据导入,记录完成时间 `a`。 + - **时间记录**:`2024-12-05 17:56:56` +2. 创建 TPC-H 1T 数据库的同步任务。 +3. 等待下游数据同步完成,记录完成时间 `b`。 + - **时间记录**:`2024-12-05 18:02:57` + +### 测试结论 +全量同步时间差:`b - a = 6 分 1 秒` + +--- + +## Flink 同步性能测试 + +### 测试步骤 +1. 上游使用 Flink 导入方式导入 100,000,000 条数据。 +2. 创建库表的同步任务。 +3. 在每个阶段观察下游同步完成时间与上游导入完成时间的差异(例如:1,000,000 条、2,000,000 条等)。 +4. 上游最后一次导入完成时间为 `2024-12-06 12:53:00`。 + - **日志信息**:`2024-12-06 12:53:00,455 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source: Custom Source (1/1) (17be78de01ee5175d801c08c1b443771) switched from RUNNING to FINISHED.` +5. 下游同步完成时间为 `2024-12-06 12:53:01.433`。 + - **日志信息**:`2024-12-06 12:53:01.433 TRACE update job progress done, state: DBIncrementalSync, subState: Done, commitSeq: 12246911, prevCommitSeq: 12246911 job=flink_per_ccr line=ccr/job_progress.go:389` + +### 测试结论 +每个阶段的 **lag 时间** 均保持在 `5 秒内`。 + +--- + +## 总结 +- **增量同步**:时间差约为 33 秒。 +- **全量同步**:完成时间约为 6 分 1 秒。 +- **Flink 同步**:各阶段同步延迟均在 5 秒内。 diff --git a/sidebars.json b/sidebars.json index cb00972b7fbbf..135c83ac1248c 100644 --- a/sidebars.json +++ b/sidebars.json @@ -409,7 +409,8 @@ "admin-manual/data-admin/ccr/overview", "admin-manual/data-admin/ccr/quickstart", "admin-manual/data-admin/ccr/manual", - "admin-manual/data-admin/ccr/feature" + "admin-manual/data-admin/ccr/feature", + "admin-manual/data-admin/ccr/performance" ] }, "admin-manual/data-admin/recyclebin",