-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What happens if we embed the probability tables? #399
Comments
Quick test with https://github.com/Yoric/binjs-ref/tree/entropy-0.4-embed and dictionary depth = 4 This is untested code.
|
I'm tracking a bug that increase a lot the amount of data we write to *.prelude. |
Latest version
We still embed much data that I'm pretty sure we don't need, but we're now within 1% of brotli. Pending roundtrip. |
Latest version, depth 1, trying to use as much as possible the same protocol as binast/binjs-fbssdc#2. Facebook sample set$ cargo run --release --example sample_directory -- --in tests/data/facebook/single/ --sampling 0.2 --depth 1 --follow-symlinks false --min-size 0 --dictionary-threshold 0 binjs/brotli: 1.05 Real js samples$ cargo run --release --example sample_directory -- --in ~/Downloads/scrap/ --sampling 0.2 --depth 1 --follow-symlinks false --min-size 0 --dictionary-threshold 0 binjs/brotli: 1.03 |
In #293, we have a mechanism to embed probability tables in the prelude. It actually seems to indicate that probability tables don't take that much space.
Could we possibly improve our compression results by giving up on the idea of shared probability tables and rather embedding the probability tables in the file?
The text was updated successfully, but these errors were encountered: