feat: ✨ Add Dockerfiles and configuration for inference server and CLI #48

onuralpszr · 2025-12-31T18:34:13Z

🛠️ PR Summary

🌟 Summary

Adds Docker build artifacts and a Rust-based inference HTTP server (with Swagger UI) plus a containerized CLI for running Ultralytics inference.

📊 Key Changes

Added a new .dockerignore to reduce Docker build context and exclude artifacts, models, tests, and docs.
Introduced docker/Dockerfile-cli to build and ship a rootless runtime image for the ultralytics-inference CLI (downloads a default yolo11n.onnx).
Introduced docker/Dockerfile-server and a new Rust crate under docker/server/ to run an Axum-based inference server.
Implemented server endpoints: / (root), /health, /info, and POST /predict (multipart upload with conf and max_det query params), plus Swagger UI at /swagger-ui.

🎯 Purpose & Impact

Makes it easier to deploy inference as a containerized service or run a containerized CLI 📦
Provides an OpenAPI/Swagger UI for quick API exploration and integration 🧭
Improves security posture by running containers as a non-root user 🔐
Enables consistent, reproducible builds for users who want a turnkey inference server 🚀

Signed-off-by: Onuralp SEZER <[email protected]>

UltralyticsAssistant · 2025-12-31T18:34:47Z

👋 Hello @onuralpszr, thank you for submitting a ultralytics/inference 🚀 PR! This is an automated message to help with review—an Ultralytics engineer will assist soon. A few quick checks to ensure smooth integration ✨

✅ Define a Purpose: Clearly explain the purpose of your fix or feature in your PR description, and link to any relevant issues. Ensure your commit messages are clear, concise, and adhere to the project's conventions.
✅ Synchronize with Source: Confirm your PR is synchronized with the ultralytics/inference main branch. If it's behind, update it by clicking the 'Update branch' button or by running git pull and git merge main locally.
✅ Ensure CI Checks Pass: Verify all Ultralytics Continuous Integration (CI) checks are passing. If any checks fail, please address the issues.
✅ Update Documentation: Update the relevant documentation for any new or modified features.
✅ Add Tests: If applicable, include or update tests to cover your changes, and confirm that all tests are passing.
✅ Sign the CLA: Please ensure you have signed our Contributor License Agreement if this is your first Ultralytics PR by writing "I have read the CLA Document and I sign the CLA" in a new message.
✅ Minimize Changes: Limit your changes to the minimum necessary for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." — Bruce Lee

For more guidance, please refer to our Contributing Guide. Don't hesitate to leave a comment if you have any questions. Thank you for contributing to Ultralytics! 🚀

UltralyticsAssistant

🔍 PR Review

_{Made with ❤️ by Ultralytics Actions}

Solid addition overall: Docker artifacts are clear, the server routes + Swagger UI integration look coherent, and non-root runtime is a good security baseline. Main issues to address are server robustness (multipart error handling and avoiding a potential panic on empty results) and Docker build reliability (ensure CA certs in the server builder stage and avoid fragile wildcard copies for ONNX Runtime shared libs).

💬 Posted 6 inline comments

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/server/src/main.rs

+                }
+            };
+
+            let result = &results[0];


❗ CRITICAL: let result = &results[0]; will panic if the model returns an empty results vec (e.g., unexpected input, internal model error, or library behavior changes). This should be handled gracefully (return 500 with an ErrorResponse) to avoid crashing the server process.

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/Dockerfile-cli

+COPY --from=builder /usr/src/app/target/release/ultralytics-inference /usr/local/bin/ultralytics-inference
+
+# Copy ONNX Runtime shared libraries
+COPY --from=builder /usr/src/app/target/release/libonnxruntime*.so* /usr/lib/


⚠️ HIGH: COPY --from=builder .../target/release/libonnxruntime*.so* /usr/lib/ assumes ONNX Runtime shared libs are present under target/release/. If the build produces no matching files (feature changes, different linking strategy, etc.), the Docker build will fail at this step. Consider making the presence of these libs explicit (e.g., copying from a known output dir/artifact) or ensuring the build always emits them.

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/Dockerfile-server

+FROM ubuntu:questing-20251029 AS builder
+
+# Install build dependencies + Rust (image-only)
+RUN apt-get update && apt-get install -y \


⚠️ HIGH: Builder stage installs curl but not ca-certificates. rustup download uses TLS and may fail in minimal images without CA roots. Add ca-certificates to the builder stage deps to make the build more robust.

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/Dockerfile-server

+COPY --from=builder /usr/src/app/docker/server/target/release/ultralytics-inference-server /usr/local/bin/ultralytics-inference-server
+
+# Copy ONNX Runtime shared libraries
+COPY --from=builder /usr/src/app/docker/server/target/release/libonnxruntime*.so* /usr/lib/


⚠️ HIGH: Same concern as the CLI image: copying libonnxruntime*.so* from docker/server/target/release/ will fail the Docker build if no files match. It would be safer to copy from a deterministic location or ensure the server build step always produces these shared libs.

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/server/src/main.rs

+    let max_det = params.max_det.unwrap_or(300);
+
+    // Extract image from multipart form
+    while let Ok(Some(field)) = multipart.next_field().await {


⚠️ HIGH: while let Ok(Some(field)) = multipart.next_field().await silently treats multipart parsing errors as end-of-stream, which can misreport a malformed request as "Missing 'image' field" (400) instead of a 400/500 with the actual error. Consider explicitly handling Err(e) from next_field() and returning an appropriate error response.

UltralyticsAssistant · 2025-12-31T18:35:09Z

docker/server/src/main.rs

+            };
+
+            // Run inference
+            let mut model = state.model.lock().await;


💡 MEDIUM: The model mutex is held across the entire inference + response construction path. This serializes all requests and also increases tail latency under load. If YOLOModel supports concurrent inference, consider narrowing the critical section to only the model call (and cloning/copying data you need from result before unlocking), or using a pool/sharded models if intended for multi-request throughput.

codecov · 2025-12-31T18:42:26Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

feat: ✨ Add Dockerfiles and configuration for inference server and CLI

1553f5b

Signed-off-by: Onuralp SEZER <[email protected]>

UltralyticsAssistant added dependencies Dependency-related topics enhancement New feature or request labels Dec 31, 2025

UltralyticsAssistant reviewed Dec 31, 2025

View reviewed changes

onuralpszr added 2 commits December 31, 2025 22:55

Merge branch 'main' into feat/dockerfiles

adb5858

Merge branch 'main' into feat/dockerfiles

ae83124

onuralpszr marked this pull request as draft January 3, 2026 02:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: ✨ Add Dockerfiles and configuration for inference server and CLI #48

feat: ✨ Add Dockerfiles and configuration for inference server and CLI #48

Uh oh!

onuralpszr commented Dec 31, 2025 •

edited by UltralyticsAssistant

Loading

Uh oh!

UltralyticsAssistant commented Dec 31, 2025

Uh oh!

UltralyticsAssistant left a comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

UltralyticsAssistant Dec 31, 2025

Uh oh!

codecov bot commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat: ✨ Add Dockerfiles and configuration for inference server and CLI #48

Are you sure you want to change the base?

feat: ✨ Add Dockerfiles and configuration for inference server and CLI #48

Uh oh!

Conversation

onuralpszr commented Dec 31, 2025 • edited by UltralyticsAssistant Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

Uh oh!

UltralyticsAssistant commented Dec 31, 2025

Uh oh!

UltralyticsAssistant left a comment

Choose a reason for hiding this comment

🔍 PR Review

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

UltralyticsAssistant Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Dec 31, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

onuralpszr commented Dec 31, 2025 •

edited by UltralyticsAssistant

Loading