[Feature] Handling onnxrt execution provider config for various models #1098

manickavela29 · 2024-07-09T05:57:05Z

With #992, config for backends are handled as arguments and it is done.

But there is an additional issue with arguments and models, as the suggested configs are not specific to models
In the case of zipformer,

configs required for encoder model, need not be required for decoder and joiner
Tensorrt has max_workspace_size which will limit the gpu memory usage for the session,
Encoder model can take mode space, which is not necessary for decoder. but with our current arguments both are allocated same memory

Solution :
for zipformer case,
Adding encoder_config, decoder_config and joiner_config

sherpa-onnx/sherpa-onnx/csrc/online-transducer-model-config.h

Lines 14 to 17 in 3e4307e

    
           std::string encoder; 
        
           std::string decoder; 
        
           std::string joiner;

and a new overloaded function at providers and session
All the config will be hardcoded for specific model and not as argument while starting sherpa,

But if you have any better suggestions let me know

csukuangfj · 2024-07-09T06:37:38Z

Adding encoder_config, decoder_config and joiner_config

That would introduce too many command line arguments

How about creating separate sess_opts_ for the decoder and the joiner and hard-coding the config values if tensorrt is used?

manickavela29 · 2024-07-09T07:22:25Z

Yes, I meant on similar lines,

By adding separate configs for encoder,decoder and joiner,
I actually meant they will be hardcoded and not exposed as an argument.

And as you suggested, they will have separate sess_opts_ which will be built with their hardcoded custom config

manickavela29 · 2024-07-15T01:12:46Z

Actually, given the mode size of decoder and joiner,
we can as well just running with CUDA EP itself,
since encoder is the only heavy lifter here

manickavela29 changed the title ~~[Feature] Handling onnxrt execution provider config for several models flexibly~~ [Feature] Handling onnxrt execution provider config for various models Jul 9, 2024

manickavela29 mentioned this issue Jul 15, 2024

encoder only trt ep for transducer #1130

Merged

manickavela29 closed this as completed Jul 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Handling onnxrt execution provider config for various models #1098

[Feature] Handling onnxrt execution provider config for various models #1098

manickavela29 commented Jul 9, 2024 •

edited

Loading

csukuangfj commented Jul 9, 2024

manickavela29 commented Jul 9, 2024

manickavela29 commented Jul 15, 2024 •

edited

Loading

[Feature] Handling onnxrt execution provider config for various models #1098

[Feature] Handling onnxrt execution provider config for various models #1098

Comments

manickavela29 commented Jul 9, 2024 • edited Loading

csukuangfj commented Jul 9, 2024

manickavela29 commented Jul 9, 2024

manickavela29 commented Jul 15, 2024 • edited Loading

manickavela29 commented Jul 9, 2024 •

edited

Loading

manickavela29 commented Jul 15, 2024 •

edited

Loading