triton-inference-server / onnxruntime_backend Public

Notifications You must be signed in to change notification settings
Fork 56
Star 129

Code
Issues 71
Pull requests 4
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Issues: triton-inference-server/onnxruntime_backend

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

71 Open 36 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Built-in support for (custom?) decryption of model weights

#279 opened Oct 28, 2024 by vadimkantorov

Deploy TTS model with Triton and onnx backend, failed:Protobuf parsing failed

#272 opened Sep 25, 2024 by AnasAlmana

[Question] Multiple model inputs and GPU allocations

#269 opened Aug 29, 2024 by msyulia

load_model fail cause gpu memory leak

#268 opened Aug 27, 2024 by zjd1988

How do I get all possible input and output names?

#267 opened Aug 22, 2024 by zhmiao

Triton ONNX runtime backend slower than onnxruntime python client on CPU

#265 opened Aug 19, 2024 by Mitix-EPI

OpenVINO EP doesn't respect threading parameters

#260 opened Jul 6, 2024 by mbahri

UNAVAILABLE: Unsupported: Triton TRITONBACKEND API version: X does not support 'onnxruntime' TRITONBACKEND API version X

#259 opened Jun 28, 2024 by pultarmi

Is onnxruntime-genai supported?

#251 opened May 4, 2024 by jackylu0124

Failed to allocated memory for requested buffer of size X

#249 opened Mar 21, 2024 by aaditya-srivathsan

Facing errors when installing onnxruntime backend for triton

#247 opened Mar 15, 2024 by Aniket-20

CPU Throttling when Deploying Triton with ONNX Backend on Kubernetes

#245 opened Mar 1, 2024 by langong347

Enable "trt_build_heuristics_enable" optimization for onnxruntime-TensorRT

#241 opened Feb 23, 2024 by tobaiMS

Will onxxruntime backend support INT8 on cpu ?

#240 opened Feb 20, 2024 by bharadwajymg

Request for Supporting minShapes/optShapes/maxShapes for TensorRT

#232 opened Jan 15, 2024 by teith

Question: Does ONNX-RT silently fallbacks to CPU?

#228 opened Dec 20, 2023 by Thytu

Model failed to create because of output dimensions

#220 opened Nov 27, 2023 by nyanmn

Support arbitrary options for execution providers

#217 opened Nov 16, 2023 by gedoensmax

Error while Loading YOLOv8 Model with EfficientNMS_TRT Plugin in TRITON

#210 opened Aug 30, 2023 by whitewalker11

how to use onnxruntime profiling in triton

#207 opened Jul 25, 2023 by cyh-ustc

Onnxruntime backend error when workload is high since Triton uses CUDA 12 bug

Something isn't working

#203 opened Jul 8, 2023 by zeruniverse

GPU memory leak with high load for ONNX model

#198 opened Jun 14, 2023 by junwang-wish

Add enable_dynamic_shapes To Model Config To Resolve CNN Memory Leaks With OpenVino EP

#194 opened Jun 2, 2023 by narolski

How to create onnx model for ragged batching?

#192 opened May 30, 2023 by Sitcebelly

InvalidArgumentError: The tensor Input (Input) of Slice op is not initialized.

#191 opened May 25, 2023 by qiu-pinggaizi

Previous 1 2 3 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly