perf(file-validator): improve the performance of file_validator #1078

pynickle · 2025-10-16T16:27:26Z

Checklist

Changes have been tested locally and work as expected.
All tests in workflows pass successfully.
Documentation has been updated if necessary.
Code formatting and commit messages align with the project's conventions.
Comments have been added for any complex logic or functionality if possible.

This PR is a ..

Related Issues

Describe the source of related requirements, such as links to relevant issue discussions.

e.g. close #xxxx, fix #xxxx

Description

性能提升

实测在 1.21.1 NeoForge 的相同环境下从 14s 提高到了不到 3s。

源代码使用 join_all 同时启动所有文件校验任务，导致大量 IO 操作，线程阻塞。使用限制并发读取带来了大量的性能优化。

其次，使用 spawn_blocking 将任务移动到单独的线程池，这也是这类 CPU 密集的工作更好的地方。

解压 natives 库也做了并发限制。

其他地方，提取了一些重复的逻辑，做了一些小代码更改。

Additional Context

Add any other relevant information or screenshots here.

Copilot

Pull Request Overview

This PR significantly improves the performance of the file validator in the SJMCL Minecraft launcher by implementing concurrent processing with limited parallelism. The changes replace unbounded concurrent execution (join_all) with controlled concurrency using stream processing, reducing execution time from 14 seconds to under 3 seconds in testing.

Key changes:

Introduced concurrency control with CONCURRENT_HASH_CHECKS constant (16 concurrent operations)
Refactored file validation logic into reusable helper functions
Used spawn_blocking for CPU-intensive hash validation tasks
Applied concurrency limits to native library extraction

Comments suppressed due to low confidence (1)

src-tauri/src/launch/helpers/file_validator.rs:1

The error handling loop can be simplified using iterator methods. Consider using results.into_iter().collect::<Result<Vec<_>, _>>()?; before the loop to handle all errors at once.

use crate::error::SJMCLResult;

Copilot · 2025-10-17T04:18:52Z

src-tauri/src/launch/helpers/file_validator.rs

+    let native_str = if let Some(native_fn) = Some(&get_natives_string) {
+      library.natives.as_ref().and_then(native_fn)
+    } else {
+      None
+    };


The conditional assignment is unnecessarily complex. Since native_fn is always Some(&get_natives_string), this can be simplified to directly call library.natives.as_ref().and_then(&get_natives_string).

UNIkeEN

有几个函数没有改但是调换了位置，有什么原因吗；

没有特别原因的话能不能换回去， diff 好长上下翻来翻去（xD）

UNIkeEN · 2025-11-02T15:07:21Z

src-tauri/src/launch/helpers/file_validator.rs

+use url::Url;
 use zip::ZipArchive;

+const CONCURRENT_HASH_CHECKS: usize = 16;


这个是不是有点魔法数字了，自适应的动态 concurrent 数量是不是更好，cc @ToolmanP

可以改成 CPU 核心数量动态计算）

可以改成 CPU 核心数量动态计算）

我感觉这个是不是可以移入 utils，btw 现在已有的下载 task 自动设置并发数没有实现这个么

cc @ToolmanP

pynickle · 2025-11-02T16:41:24Z

有几个函数没有改但是调换了位置，有什么原因吗；

没有特别原因的话能不能换回去， diff 好长上下翻来翻去（xD）

hhhh，换回去不太可能，原来的函数顺序太乱了，不过现在这个感觉也不太好，我想再大刀阔斧地改一下）

pynickle · 2025-11-03T14:16:06Z

目前的函数顺序：

数据结构定义
底层工具函数
文件验证
library artifacts
library paths
library 处理
assets 处理

UNIkeEN

btw，解决一下 conflict 喵

UNIkeEN · 2025-11-16T01:32:34Z

src-tauri/src/launch/helpers/file_validator.rs

+    let cpu_count = sys.cpus().len();
+    (cpu_count * 3).max(8).min(32)
+  })
+}


我还是感觉这个函数很奇怪，为什么不用

std::thread::available_parallelism().unwrap().into()

来获取并发数，而使用自定义的规则

对于IO密集的任务，这样更好吧（？

对于IO密集的任务，这样更好吧（？

如果这个有依据的话，麻烦下封装至utils里面

src-tauri/src/launch/helpers/file_validator.rs

UNIkeEN · 2025-11-16T01:44:44Z

src-tauri/src/launch/helpers/file_validator.rs

+  Ok(None)
+}
+
+async fn validate_files_concurrently<T, F, Fut>(


原来 validate_file_with_hash 不是内嵌在这里面而是需要在 processor 里啊，我感觉如果并发数的逻辑改成 std 后，这个函数不是很有必要复用（至少它看起来就更像一个通用并发 utils helper 了）

UNIkeEN · 2025-11-17T16:49:53Z

我还是觉得需要用std而不是自定义规则，如果采纳并最终形成合理自定义规则的话，应该需要做成utils

具体 cc @ToolmanP 审查与决定

另外，此 PR 修改了此前的一些代码风格问题（比如使用get数组下标替代先判断长度做if-else），👍

ToolmanP · 2025-11-18T15:32:50Z

src-tauri/src/launch/helpers/file_validator.rs

+    std::thread::sleep(sysinfo::MINIMUM_CPU_UPDATE_INTERVAL);
+    sys.refresh_cpu_usage();
+    let cpu_count = sys.cpus().len();
+    (cpu_count * 3).max(8).min(32)


这个有什么依据吗

单纯就是对于这种部分 IO 阻塞，然后 CPU 哈希计算（就是这种实际情况）的场景下的一个比较好的数值吧，这你一定要说依据好像也没什么依据啊）

这应该已经很不魔法数字了吧），如果真要依据那估计就得用 rayon 了（乐，这个我是改不动

ToolmanP · 2025-11-18T15:42:11Z

src-tauri/src/launch/helpers/file_validator.rs

+    let cpu_count = sys.cpus().len();
+    (cpu_count * 3).max(8).min(32)
+  })
+}


对于IO密集的任务，这样更好吧（？

如果这个有依据的话，麻烦下封装至utils里面

src-tauri/src/launch/helpers/file_validator.rs

ToolmanP

逻辑没问题，但是魔法数字要标注解释一下

ToolmanP · 2025-11-23T09:28:33Z

src-tauri/src/launch/helpers/file_validator.rs

+  F: Fn(T, bool) -> Fut + Send + Sync + Clone + 'static,
+  Fut: std::future::Future<Output = SJMCLResult<Option<PTaskParam>>> + Send,
+{
+  let max_concurrent = get_concurrent_limit(3.0);


3.0 和 1.5是什么意义？

src-tauri/src/utils/sys_info.rs

perf(file-validator): improve the performance of file_validator

9a72f6c

UNIkeEN requested review from ToolmanP and Copilot October 17, 2025 04:18

UNIkeEN assigned hans362 and unassigned hans362 Oct 17, 2025

UNIkeEN requested a review from hans362 October 17, 2025 04:18

Copilot AI reviewed Oct 17, 2025

View reviewed changes

Merge branch 'main' into perf/file-validate

4441a23

UNIkeEN reviewed Nov 2, 2025

View reviewed changes

pynickle added 2 commits November 3, 2025 22:03

imp: adjust function order

d2f7fc6

feat: use dynamic concurrent limit

8830e34

pynickle requested a review from UNIkeEN November 3, 2025 14:16

UNIkeEN requested changes Nov 16, 2025

View reviewed changes

UNIkeEN reviewed Nov 16, 2025

View reviewed changes

src-tauri/src/launch/helpers/file_validator.rs Outdated Show resolved Hide resolved

UNIkeEN reviewed Nov 16, 2025

View reviewed changes

pynickle and others added 2 commits November 16, 2025 16:57

Merge branch 'main' into perf/file-validate

76949d8

imp

bccccc5

UNIkeEN added the 🔜 Review Priority Prior to review, modification and discussion, and will be merged in plan soon. label Nov 17, 2025

ToolmanP reviewed Nov 18, 2025

View reviewed changes

imp: improve validate files concurrently use semaphore

48fa7f2

ToolmanP requested changes Nov 23, 2025

View reviewed changes

style: add comment

132e508

pynickle requested review from ToolmanP and UNIkeEN December 23, 2025 10:56

perf(file-validator): improve the performance of file_validator #1078

Are you sure you want to change the base?

perf(file-validator): improve the performance of file_validator #1078

Conversation

pynickle commented Oct 16, 2025

Checklist

This PR is a ..

Related Issues

Description

性能提升

Additional Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

UNIkeEN left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pynickle commented Nov 2, 2025

Uh oh!

pynickle commented Nov 3, 2025

Uh oh!

UNIkeEN left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

UNIkeEN commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pynickle Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ToolmanP left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

UNIkeEN commented Nov 17, 2025 •

edited

Loading

pynickle Nov 19, 2025 •

edited

Loading