Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Limit memory consumed by dedup hash index #224

Open
tasket opened this issue Nov 19, 2024 · 0 comments
Open

Limit memory consumed by dedup hash index #224

tasket opened this issue Nov 19, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@tasket
Copy link
Owner

tasket commented Nov 19, 2024

Wyng currently only curbs the growth of its (ephemeral) hash index when parts of the hash tree get full... Each tree branch references up to 65536 hashes. Even when that limit is reached on certain branches, others can continue to fill up.

Putting a limit on the index size in proportion to system RAM could reduce the risk of bogging down the system or overrunning /tmp. Some programs use a circular index (like circular buffer) to enforce limits, however Wyng has the advantage of chronologically arranged hash data in the manifests. The manifests could be read in reverse order when building the index and simply stop when the determined limit is reached.

The components of the index should be simple and deterministic enough to meter index growth with a cumulative variable instead of continually polling system memory stats.

@tasket tasket added the enhancement New feature or request label Nov 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant