support for ORT genai library #2356

tranlm · 2024-09-18T19:32:54Z

tranlm
Sep 18, 2024

Hi there,

I've been evaluating your library as the approach for compressing models for serving. I see that you also rely on onnx runtime for loading and running the model in batches. I'm wondering if your library also supports conversions such that I can load the compressed models onto onnxruntime genai (it's currently the inference engine I use for my app).

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for ORT genai library #2356

{{title}}

Replies: 0 comments

Select a reply

support for ORT genai library #2356

tranlm Sep 18, 2024

Replies: 0 comments

tranlm
Sep 18, 2024