ONNXRuntime-Extensions

What's ONNXRuntime-Extensions

Introduction: ONNXRuntime-Extensions is a C/C++ library that extends the capability of the ONNX models and inference with ONNX Runtime, via ONNX Runtime Custom Operator ABIs. It includes a set of ONNX Runtime Custom Operator to support the common pre- and post-processing operators for vision, text, and nlp models. And it supports multiple languages and platforms, like Python on Windows/Linux/macOS, some mobile platforms like Android and iOS, and Web-Assembly etc. The basic workflow is to enhance a ONNX model firstly and then do the model inference with ONNX Runtime and ONNXRuntime-Extensions package.

Quickstart

The library can be utilized as either a C/C++ library or other advance language packages like Python, Java, C#, etc. To build it as a shared library, you can use the build.bat or build.sh scripts located in the root folder. The CMake build definition is available in the CMakeLists.txt file and can be modified by appending options to build.bat or build.sh, such as build.bat -DOCOS_BUILD_SHARED_LIB=OFF. For more details, please refer to the C API documentation.

Python installation

pip install onnxruntime-extensions

The nightly build is also available for the latest features, please refer to nightly build

Usage

1. Generation of Pre-/Post-Processing ONNX Model

The onnxruntime-extensions Python package provides a convenient way to generate the ONNX processing graph. This can be achieved by converting the Huggingface transformer data processing classes into the desired format. For more detailed information, please refer to the API below:

help(onnxruntime_extensions.gen_processing_models)

NOTE:

The generation of model processing requires the ONNX package to be installed. The data processing models generated in this manner can be merged with other models using the onnx.compose if needed.

2. Using Extensions for ONNX Runtime inference

Python

There are individual packages for the following languages, please install it for the build.

import onnxruntime as _ort
from onnxruntime_extensions import get_library_path as _lib_path

so = _ort.SessionOptions()
so.register_custom_ops_library(_lib_path())

# Run the ONNXRuntime Session, as ONNXRuntime docs suggested.
# sess = _ort.InferenceSession(model, so)
# sess.run (...)

C++

  // The line loads the customop library into ONNXRuntime engine to load the ONNX model with the custom op
  Ort::ThrowOnError(Ort::GetApi().RegisterCustomOpsLibrary((OrtSessionOptions*)session_options, custom_op_library_filename, &handle));

  // The regular ONNXRuntime invoking to run the model.
  Ort::Session session(env, model_uri, session_options);
  RunSession(session, inputs, outputs);

Java

var env = OrtEnvironment.getEnvironment();
var sess_opt = new OrtSession.SessionOptions();

/* Register the custom ops from onnxruntime-extensions */
sess_opt.registerCustomOpLibrary(OrtxPackage.getLibraryPath());

C#

SessionOptions options = new SessionOptions()
options.RegisterOrtExtensions()
session = new InferenceSession(model, options)

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

License

MIT License

Name	Name	Last commit message	Last commit date
Latest commit wenbingl fix the json_pointer compiler warnings within its latest release (#904 ) Mar 5, 2025 4c3ae1b · Mar 5, 2025 History 636 Commits
.config	.config	Update tsaoptions.json (#309 )	Oct 25, 2022
.github	.github	Add stale issue handler (#875 )	Jan 18, 2025
.pipelines	.pipelines	upgrade the emsdk compiler to 4.x in CI pipeline (#897 )	Feb 27, 2025
.pyproject	.pyproject	Unify the image operations in extensions library (#831 )	Oct 30, 2024
base	base	Fix compilation error in GenAI (#864 )	Dec 23, 2024
cmake	cmake	fix the json_pointer compiler warnings within its latest release (#904 )	Mar 5, 2025
docs	docs	Improve Documentation: Add Hugging Face Compatibility Docs and Refine…	Sep 30, 2024
include	include	Update op_def_struct.h to fix memory leak (#888 )	Feb 9, 2025
java	java	Fix the windows API missing issue and Linux shared library size issue…	Jul 29, 2024
nuget	nuget	Update macosx framework packaging to follow apple guidelines (#776 )	Aug 13, 2024
onnxruntime_extensions	onnxruntime_extensions	Add initial Python API decoder support (#869 )	Jan 14, 2025
operators	operators	fix the json_pointer compiler warnings within its latest release (#904 )	Mar 5, 2025
prebuild	prebuild	Enable using system certs on Android. (#543 )	Aug 24, 2023
pyop	pyop	Add regex loading from tokenizer.json and code refinement (#863 )	Jan 7, 2025
shared	shared	Support audio attention mask for multiple audio file preprocessing fo…	Mar 4, 2025
test	test	Support audio attention mask for multiple audio file preprocessing fo…	Mar 4, 2025
tools	tools	enable OS native codecs by default for all pipelines (#855 )	Dec 20, 2024
tutorials	tutorials	fix the build for mobile packaging (#843 )	Nov 18, 2024
.clang-format	.clang-format	Add a generic image processor and its C API (#745 )	Jun 20, 2024
.clang-tidy	.clang-tidy	Refactor the header file directory and integrate the eager tensor imp…	Apr 17, 2024
.flake8	.flake8	initial checkins	Oct 12, 2020
.gitignore	.gitignore	Enhancing CUDA Support in Python Package Build and Testing (#608 )	Nov 27, 2023
.sscignore	.sscignore	Fix Secure Supply Chain Analysis Warning in PR pipeline (#414 )	May 4, 2023
.swift-format	.swift-format	Update OrtExtensionsUsage to also use the ORT Objective-C API. (#483 )	Sep 25, 2023
CMakeLists.txt	CMakeLists.txt	add the missing header file string_view in ortx_tokenizer.h (#880 )	Feb 3, 2025
CODEOWNERS	CODEOWNERS	Update CODEOWNERS	Aug 23, 2023
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Initial CODE_OF_CONDUCT.md commit	Oct 5, 2020
LICENSE	LICENSE	Updating LICENSE to template content	Oct 5, 2020
MANIFEST.in	MANIFEST.in	Remove OpenCV dependency from C_API mode (#800 )	Sep 4, 2024
README.md	README.md	Improve Documentation: Add Hugging Face Compatibility Docs and Refine…	Sep 30, 2024
SECURITY.md	SECURITY.md	Initial SECURITY.md commit	Oct 5, 2020
ThirdPartyNotices.txt	ThirdPartyNotices.txt	fix the json_pointer compiler warnings within its latest release (#904 )	Mar 5, 2025
build.android	build.android	add the decoder_prompt_id for whisper tokenizer (#775 )	Jul 29, 2024
build.bat	build.bat	Fix the Unicode code discrepency on CLIP model (#814 )	Sep 23, 2024
build.ios_xcframework	build.ios_xcframework	Add iOS packaging pipeline. (#327 )	Dec 23, 2022
build.sh	build.sh	Remove OpenCV dependency from C_API mode (#800 )	Sep 4, 2024
build_lib.bat	build_lib.bat	Add build.py to make it easier for developers to build different vari…	Jan 2, 2023
build_lib.sh	build_lib.sh	Enable C++ unit tests on iOS (#560 )	Sep 18, 2023
cgmanifest.json	cgmanifest.json	fix the json_pointer compiler warnings within its latest release (#904 )	Mar 5, 2025
pyproject.toml	pyproject.toml	Remove numpy dependency from its Python binary build (#657 )	Feb 21, 2024
requirements-dev.txt	requirements-dev.txt	Fix the pipeline breaks dues to the MSVC 19.40 and numpy 2.0 release (#…	Jun 17, 2024
setup.cfg	setup.cfg	Minor changes to test CI PR trigger (#634 )	Jan 16, 2024
setup.py	setup.py	make onnx package to be optional. (#653 )	Feb 15, 2024
version.txt	version.txt	Bump version from 0.14.0 to 0.15.0 (#893 )	Feb 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ONNXRuntime-Extensions

What's ONNXRuntime-Extensions

Quickstart

Python installation

Usage

1. Generation of Pre-/Post-Processing ONNX Model

NOTE:

2. Using Extensions for ONNX Runtime inference

Python

C++

Java

C#

Contributing

License

About

Releases 14

Packages

Contributors 56

Languages

License

microsoft/onnxruntime-extensions

Folders and files

Latest commit

History

Repository files navigation

ONNXRuntime-Extensions

What's ONNXRuntime-Extensions

Quickstart

Python installation

Usage

1. Generation of Pre-/Post-Processing ONNX Model

NOTE:

2. Using Extensions for ONNX Runtime inference

Python

C++

Java

C#

Contributing

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 14

Packages 0

Contributors 56

Languages

Packages