-
Notifications
You must be signed in to change notification settings - Fork 247
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
## Versions - [x] dev - [ ] 3.0 - [ ] 2.1 - [ ] 2.0 ## Languages - [x] Chinese - [x] English ## Docs Checklist - [ ] Checked by AI - [ ] Test Cases Built --------- Co-authored-by: Yongqiang YANG <[email protected]>
- Loading branch information
1 parent
41b6620
commit afc2e39
Showing
3 changed files
with
182 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,90 @@ | ||
--- | ||
{ | ||
"title": "Performance", | ||
"language": "en_US" | ||
} | ||
--- | ||
|
||
<!-- | ||
Licensed to the Apache Software Foundation (ASF) under one | ||
or more contributor license agreements. See the NOTICE file | ||
distributed with this work for additional information | ||
regarding copyright ownership. The ASF licenses this file | ||
to you under the Apache License, Version 2.0 (the | ||
"License"); you may not use this file except in compliance | ||
with the License. You may obtain a copy of the License at | ||
http://www.apache.org/licenses/LICENSE-2.0 | ||
Unless required by applicable law or agreed to in writing, | ||
software distributed under the License is distributed on an | ||
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY | ||
KIND, either express or implied. See the License for the | ||
specific language governing permissions and limitations | ||
under the License. | ||
--> | ||
|
||
The performance data in this document is based on the default configuration. If you are facing high network latency or high throughput write scenarios, you can refer to the operation manual (../manual.md) for optimization. | ||
|
||
## Test Dataset | ||
- **Dataset**: TPC-H 1T | ||
|
||
## Test Cluster Configuration | ||
|
||
| Configuration Item | Upstream Configuration | Downstream Configuration | | ||
|--------------------|-------------------------------------------|---------------------------------------------| | ||
| FE | 2 cores 16 GB | 2 cores 16 GB | | ||
| BE | 3 nodes, 16 cores 64 GB, 3*500 GB per node | 3 nodes, 16 cores 64 GB, 3*500 GB per node | | ||
|
||
--- | ||
|
||
## Incremental Synchronization Performance Test | ||
|
||
### Test Steps | ||
1. Create the library table information for TPC-H 1T in the upstream cluster. | ||
2. Create a synchronization task for the TPC-H 1T database. | ||
3. Wait for the TPC-H 1T data import to complete and record the completion time `a`. | ||
- **Time Record**: `2024-12-05 20:09:48` | ||
4. Wait for the downstream data synchronization to complete and record the completion time `b`. | ||
- **Time Record**: `2024-12-05 20:10:21.678` | ||
- **Log Information**: `INFO handle upsert binlog, sub sync state: IngestBinlog, prevCommitSeq: 3271, commitSeq: 3282 job=per_ccr line=ccr/job.` | ||
|
||
### Test Conclusion | ||
Incremental synchronization time difference: `b - a = 33 seconds` | ||
|
||
--- | ||
|
||
## Full Synchronization Performance Test | ||
|
||
### Test Steps | ||
1. Create the library table information for TPC-H 1T in the upstream cluster and complete the data import, recording the completion time `a`. | ||
- **Time Record**: `2024-12-05 17:56:56` | ||
2. Create a synchronization task for the TPC-H 1T database. | ||
3. Wait for the downstream data synchronization to complete and record the completion time `b`. | ||
- **Time Record**: `2024-12-05 18:02:57` | ||
|
||
### Test Conclusion | ||
Full synchronization time difference: `b - a = 6 minutes 1 second` | ||
|
||
--- | ||
|
||
## Flink Synchronization Performance Test | ||
|
||
### Test Steps | ||
1. Use Flink to import 100,000,000 records of data in the upstream. | ||
2. Create a synchronization task for the library table. | ||
3. Observe the difference between the downstream synchronization completion time and the upstream import completion time at each stage (e.g., 1,000,000 records, 2,000,000 records, etc.). | ||
4. The last import completion time in the upstream is `2024-12-06 12:53:00`. | ||
- **Log Information**: `2024-12-06 12:53:00,455 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source: Custom Source (1/1) (17be78de01ee5175d801c08c1b443771) switched from RUNNING to FINISHED.` | ||
5. The downstream synchronization completion time is `2024-12-06 12:53:01.433`. | ||
- **Log Information**: `2024-12-06 12:53:01.433 TRACE update job progress done, state: DBIncrementalSync, subState: Done, commitSeq: 12246911, prevCommitSeq: 12246911 job=flink_per_ccr line=ccr/job_progress.go:389` | ||
|
||
### Test Conclusion | ||
The **lag time** at each stage is maintained within `5 seconds`. | ||
|
||
--- | ||
|
||
## Summary | ||
- **Incremental Synchronization**: Time difference of approximately 33 seconds. | ||
- **Full Synchronization**: Completion time of approximately 6 minutes 1 second. | ||
- **Flink Synchronization**: Each stage's synchronization delay is within 5 seconds. |
90 changes: 90 additions & 0 deletions
90
...cusaurus-plugin-content-docs/current/admin-manual/data-admin/ccr/performance.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,90 @@ | ||
--- | ||
{ | ||
"title": "性能", | ||
"language": "zh_CN" | ||
} | ||
--- | ||
|
||
<!-- | ||
Licensed to the Apache Software Foundation (ASF) under one | ||
or more contributor license agreements. See the NOTICE file | ||
distributed with this work for additional information | ||
regarding copyright ownership. The ASF licenses this file | ||
to you under the Apache License, Version 2.0 (the | ||
"License"); you may not use this file except in compliance | ||
with the License. You may obtain a copy of the License at | ||
http://www.apache.org/licenses/LICENSE-2.0 | ||
Unless required by applicable law or agreed to in writing, | ||
software distributed under the License is distributed on an | ||
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY | ||
KIND, either express or implied. See the License for the | ||
specific language governing permissions and limitations | ||
under the License. | ||
--> | ||
|
||
本文档中的性能数据基于默认配置,如果您面临高网络延迟或高吞吐量写入场景,可以参考操作手册 (../manual.md) 进行优化。 | ||
|
||
## 测试数据集 | ||
- **数据集**:TPC-H 1T | ||
|
||
## 测试集群配置 | ||
|
||
| 配置项 | 上游配置 | 下游配置 | | ||
|----------|--------------------------------------|--------------------------------------| | ||
| FE | 2 核 16 GB | 2 核 16 GB | | ||
| BE | 3 个节点,16 核 64 GB,每节点 3*500 GB | 3 个节点,16 核 64 GB,每节点 3*500 GB | | ||
|
||
--- | ||
|
||
## 增量同步性能测试 | ||
|
||
### 测试步骤 | ||
1. 在上游集群创建 TPC-H 1T 的库表信息。 | ||
2. 创建 TPC-H 1T 数据库的同步任务。 | ||
3. 等待 TPC-H 1T 数据导入完成,记录完成时间 `a`。 | ||
- **时间记录**:`2024-12-05 20:09:48` | ||
4. 等待下游数据同步完成,记录完成时间 `b`。 | ||
- **时间记录**:`2024-12-05 20:10:21.678` | ||
- **日志信息**:`INFO handle upsert binlog, sub sync state: IngestBinlog, prevCommitSeq: 3271, commitSeq: 3282 job=per_ccr line=ccr/job.` | ||
|
||
### 测试结论 | ||
增量同步时间差:`b - a = 33 秒` | ||
|
||
--- | ||
|
||
## 全量同步性能测试 | ||
|
||
### 测试步骤 | ||
1. 在上游集群创建 TPC-H 1T 的库表信息并完成数据导入,记录完成时间 `a`。 | ||
- **时间记录**:`2024-12-05 17:56:56` | ||
2. 创建 TPC-H 1T 数据库的同步任务。 | ||
3. 等待下游数据同步完成,记录完成时间 `b`。 | ||
- **时间记录**:`2024-12-05 18:02:57` | ||
|
||
### 测试结论 | ||
全量同步时间差:`b - a = 6 分 1 秒` | ||
|
||
--- | ||
|
||
## Flink 同步性能测试 | ||
|
||
### 测试步骤 | ||
1. 上游使用 Flink 导入方式导入 100,000,000 条数据。 | ||
2. 创建库表的同步任务。 | ||
3. 在每个阶段观察下游同步完成时间与上游导入完成时间的差异(例如:1,000,000 条、2,000,000 条等)。 | ||
4. 上游最后一次导入完成时间为 `2024-12-06 12:53:00`。 | ||
- **日志信息**:`2024-12-06 12:53:00,455 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source: Custom Source (1/1) (17be78de01ee5175d801c08c1b443771) switched from RUNNING to FINISHED.` | ||
5. 下游同步完成时间为 `2024-12-06 12:53:01.433`。 | ||
- **日志信息**:`2024-12-06 12:53:01.433 TRACE update job progress done, state: DBIncrementalSync, subState: Done, commitSeq: 12246911, prevCommitSeq: 12246911 job=flink_per_ccr line=ccr/job_progress.go:389` | ||
|
||
### 测试结论 | ||
每个阶段的 **lag 时间** 均保持在 `5 秒内`。 | ||
|
||
--- | ||
|
||
## 总结 | ||
- **增量同步**:时间差约为 33 秒。 | ||
- **全量同步**:完成时间约为 6 分 1 秒。 | ||
- **Flink 同步**:各阶段同步延迟均在 5 秒内。 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters