Infero

Overview

infer.0.0.13-2.mp4

Infero allows you to easily download, convert, and host your models using the ONNX runtime. It provides a simple CLI to run and maintain the models.

Features

Automatic downloads.
Automatic ONNX conversions.
Automatic server setup.
8-bit quantization support.
GPU support.

Installation

To install Infero, run the following command:

pip install infero

Usage

Here is a simple example of how to use Infero:

infero pull [hf_model_name]

To run a model:

infero run [hf_model_name]

With 8-bit quantization:

infero run [hf_model_name] --quantize

To list all available models:

infero list

To remove a model:

infero remove [hf_model_name]

Infero is licensed under the MIT License. See the LICENSE file for more details.

Contact

For any questions or feedback, please contact us at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github/workflows		.github/workflows
infero		infero
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Infero

Overview

Features

Installation

Usage

Contact

About

Releases 9

Packages

Languages

License

norsulabs/infero

Folders and files

Latest commit

History

Repository files navigation

Infero

Overview

Features

Installation

Usage

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Languages

Packages