feat: Allow loading and serializing with `tensorizer` #2

sangstar · 2024-12-19T20:49:31Z

This draft PR aims to unintrusively integrate tensorizer in to exllamav2's model loading machinery, along with adding tests for correctness and avoiding regressions.

Currently in a draft PR stage. The tests have been most recently updated, but the core logic needs to be made less intrusive and less smelly.

Still to add:

Add comments to test file for better clarity
Make way tensorizer is exposed in config machinery less intrusive
Decide whether to use io_handler for all Tensorizer I/O stuff or retire it altogether
Consider rethinking the way packaging tensorizer configurable stuff is done, whether it needs a dedicated arguments class or if just packing them in their config class is appropriate
Decide if .state_dict should be a public attribute
Allow passing TensorDeserializer args to calls to TensorDeserializer with some wrapper

The proper commit history for the old fork is not lost; it can be found at: #1

sangstar added 7 commits December 16, 2024 14:52

fix: Squash changes from old branch on to synced fork

aa2ad39

The proper commit history for the old fork is not lost; it can be found at: #1

fix: Fix identation issue

8d9427e

docs: Add text on motivation for local_config_path

957fa1e

tests: Fix tests

deb76d0

docs: Explain test module with docstring

5b195f3

chore: Revert unnecessary proposed docstring change

39c8c88

feat: (WIP) Add initial subclass-based integration strategy

5ff390a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow loading and serializing with `tensorizer` #2

feat: Allow loading and serializing with `tensorizer` #2

sangstar commented Dec 19, 2024 •

edited

Loading

feat: Allow loading and serializing with tensorizer #2

Are you sure you want to change the base?

feat: Allow loading and serializing with tensorizer #2

Conversation

sangstar commented Dec 19, 2024 • edited Loading

feat: Allow loading and serializing with `tensorizer` #2

feat: Allow loading and serializing with `tensorizer` #2

sangstar commented Dec 19, 2024 •

edited

Loading