Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support importing safetensors format #626

Open
wdoppenberg opened this issue Aug 10, 2023 · 7 comments
Open

Support importing safetensors format #626

wdoppenberg opened this issue Aug 10, 2023 · 7 comments
Labels
feature The feature request

Comments

@wdoppenberg
Copy link
Contributor

wdoppenberg commented Aug 10, 2023

Safetensors (de)serialization

Feature description

Load, at either tensor- or module-level, weights from a .safetensors file.

Feature motivation

To be able to tap into the huggingface ecosystem it would be useful to have the ability to (de)serialize for safetensors. This would obviously have the limitation of these files merely containing a key-value JSON for all weights, and no description of the model's topology/architecture just like ONNX does. Ideally this would help grow the Model Zoo.

This would then mostly be a convenience feature, both for sharing and using existing models.

(Optional) Suggest a Solution

Possibly we could start by deriving the TryFrom trait for their TensorView and our Tensor and take it from there. Huggingface's candle (Rust) framework already has the capability to (de)serialize from .safetensors.

@antimora
Copy link
Collaborator

I think natively we should stick to the file format we have currently, primarily because it's difficult to maintain many formats. We already support JSON (for visual debugging), Bincode (for embedding data), and MessagePack (for long-term and cross-language support). However, we should support file conversion from other formats, such as safetensors, numpy, and pickle. The conversion code should be placed in burn-import under a feature flag included in the default feature set.

I also would like for us to build a tool to do basic structure conversion. What I mean by that is we need some ability to remap field names and also change the depth/level of field nodes.

@antimora antimora added the feature The feature request label Aug 10, 2023
@antimora antimora changed the title Safetensors (de)serialization Support importing safetensors format Mar 28, 2024
@jin-eld
Copy link
Contributor

jin-eld commented Apr 28, 2024

Hi, is there any timeline for this feature? Given the large number of .safetensor models the lack of a conversion tool does hurt...

@nathanielsimard
Copy link
Member

Not yet, but we may implement this sonner rather than later

@antimora
Copy link
Collaborator

antimora commented Apr 29, 2024

@nathanielsimard it would be easy to do now that PyTorchFileRecorder works nicely. We just need to do the same things but using safetensors reader which is available from Candle.

@wandbrandon
Copy link

wandbrandon commented Jan 20, 2025

Is there a way to translate candle's implementation of safetsensors into burn format in memory? Just until this feature is completed

@antimora
Copy link
Collaborator

antimora commented Jan 20, 2025

Is there a way to translate candle's implementation of safetsensors into burn format in memory? Just until this feature is completed

We are already using Candle's PyTorch pt reader so it would be very similar and easy adoption for safetensors format. I personally did not have use cases to do for safetensors format so it was left unfinished.

PRs are welcome. We will prioritize the review if you submit one.

@wandbrandon
Copy link

Is there a way to translate candle's implementation of safetsensors into burn format in memory? Just until this feature is completed

We are already using Candle's PyTorch pt reader so it would be very similar and easy adoption for safetensors format. I personally did not have use cases to do for safetensors format so it was left unfinished.

PRs are welcome. We will prioritize the review if you submit one.

#2721

Hopefully, this is sufficient! tested and worked well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature The feature request
Projects
None yet
Development

No branches or pull requests

5 participants