ros-navigation · Vidyadharan98 · Feb 8, 2026 · Feb 8, 2026 · Apr 26, 2026 · Apr 27, 2026
diff --git a/README.md b/README.md
@@ -1,2 +1,10 @@
 # navigation2.ai
 Nav2 Integrations, Contact Points, and Demos using AI
+
+
+## nav2_depth_estimation_ai
+
+A ROS 2 composable perception pipeline that integrates camera driver, image preprocessing, AI-based depth estimation using DepthAnything V3, and point cloud projection into a modular component container for use in navigation.
+
+More info : [nav2_depth_estimation_ai README](nav2_depth_estimation_ai/README.md)
+
diff --git a/nav2_depth_estimation_ai/CMakeLists.txt b/nav2_depth_estimation_ai/CMakeLists.txt
@@ -0,0 +1,13 @@
+cmake_minimum_required(VERSION 3.8)
+project(nav2_depth_estimation_ai)
+
+find_package(ament_cmake REQUIRED)
+
+install(DIRECTORY
+  config
+  launch
+  params
+  DESTINATION share/${PROJECT_NAME}
+)
+
+ament_package()
diff --git a/nav2_depth_estimation_ai/README.md b/nav2_depth_estimation_ai/README.md
@@ -0,0 +1,245 @@
+# nav2_depth_estimation_ai
+
+This package provides a **perception pipeline** using AI-based depth estimation using DepthAnything V3 from RGB images for use in navigation and mobility tasks.
+
+The pipeline is designed to be **modular and configurable**, allowing users to swap components such as image sources and depth estimation models using a YAML configuration file.
+
+All components run as **ROS2 composable nodes** inside a single container for efficient intra-process communication.
+
+## Pipeline Architecture
+
+```mermaid
+graph LR
+
+Source[Image Source] -->|Image Topic| Preprocess[Image Preprocessing]
+
+Preprocess -->|Processed Image| Depth[Depth Estimator]
+
+Depth -->|Depth Image| Projection[PointCloud Projection]
+
+Projection -->|PointCloud| Nav2Costmap((Nav2Costmap))
+```
+
+The pipeline performs the following transformations:
+
+1. RGB images are captured from a image source.
+2. Optional preprocessing (crop/resize/decimation) is applied.
+3. A depth estimation model generates a depth map.
+4. The depth map is converted into a **3D point cloud**.
+
+The resulting point cloud can be used by **Nav2 perception pipelines, mapping systems, or obstacle detection modules**.
+
+## Demo
+
+Perception pipeline that generates depth maps and point clouds from RGB input.
+
+https://github.com/user-attachments/assets/12ce2808-099f-4718-b8c1-1de120bb601a
+
+## Dependencies
+
+### Core Dependencies
+The following packages are required for the basic pipeline:
+
+- `image_proc` – Used for image preprocessing operations.
+- `depth_image_proc` – Used to project depth images into point clouds.
+
+### Example Dependencies
+The pipeline can be configured with different nodes. A typical setup may include:
+
+- `usb_cam` as the **RGB image source**
+- [depth_anything_v3](https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt) as the **depth estimation model**
+
+To build `depth_anything_v3` from source, follow instructions:
+
+```bash
+cd ~/ros2_ws/src
+
+git clone https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt.git
+
+cd ..
+
+rosdep install --from-paths src --ignore-src -r -y
+
+# From your ROS 2 workspace
+colcon build --packages-select depth_anything_v3 --cmake-args -DCMAKE_BUILD_TYPE=Release
+
+source install/setup.bash
+```
+
+# Model preparation
+1. Obtain the ONNX model (Two Options): 
+  A. Download the ONNX file from [Huggingface](https://huggingface.co/TillBeemelmanns/Depth-Anything-V3-ONNX)
+  B. Generate ONNX following the instruction [here](https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt/blob/main/onnx/README.md)
+2. Place model file: Put the ONNX/engine file in the models/ directory
+3. Update configuration: Modify config/nav2_depth_ai_params.yaml with the correct model path
+4. (Optional) Generate TensorRT engine for optimized inference (if using TensorRT backend):
+
+```bash
+./src/ros2-depth-anything-v3-trt/generate_engines.sh
+```
+
+For the upstream build instructions, see:
+
+https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt/tree/main#building
+
+---
+
+### Image Source
+
+Defines the node responsible for providing the **input image stream** to the perception pipeline.
+
+Example configuration:
+
+```yaml
+image_source:
+  type: rgb
+  package: usb_cam
+  plugin: usb_cam::UsbCamNode
+  parameters:
+    video_device: /dev/video0
+    image_width: 640
+    image_height: 480
+    pixel_format: mjpeg2rgb
+    frame_rate: 30.0
+  topics:
+    output_topic: /image_raw
+    camera_info_topic: /camera_info
+```
+
+| Parameter                  | Description                                                    |
+| -------------------------- | -------------------------------------------------------------- |
+| `type`                     | Specifies the input image type used by the pipeline. Supported types: `rgb` or `depth`           |
+| `package`                  | ROS 2 package that provides the image source node.             |
+| `plugin`                   | Fully qualified composable node plugin used to start the node. |
+| `parameters`               | Configuration parameters passed to the image source node.      |
+| `topics.output_topic`      | Topic where the node publishes the image stream.               |
+| `topics.camera_info_topic` | Topic where the node publishes camera calibration information. |
+
+
+---
+
+### Image Preprocessing
+
+Image preprocessing can be enabled to crop, decimate, or resize the image before depth estimation.
+
+```yaml
+usb_cam:
+  ros__parameters:
+    video_device: /dev/video0
+    image_width: 640
+    image_height: 480
+    pixel_format: mjpeg2rgb
+    frame_rate: 30.0
+
+crop_decimate:
+  ros__parameters:
+    x_offset: 0
+    y_offset: 0
+    width: 640
+    height: 480
+    decimation_x: 1
+    decimation_y: 1
+
+resize:
+  ros__parameters:
+    width: 504
+    height: 280
+```
+
+Preprocessing nodes used:
+
+* `image_proc::CropDecimateNode`
+* `image_proc::ResizeNode`
+
+---
+
+### Depth Estimator
+
+If the input type is **RGB**, a depth estimation model is used to generate a depth image from the incoming RGB frames.
+
+Example configuration:
+
+```yaml
+depth_anything_v3:
+  ros__parameters:
+    # Model configuration
+    onnx_path: "~/ros2_ws/install/depth_anything_v3/share/depth_anything_v3/models/DA3METRIC-LARGE.onnx"
+    precision: "fp16"  # fp16 or fp32
+
+    # Debug configuration
+    enable_debug: true
+    debug_colormap: "JET"  # JET, HOT, COOL, SPRING, SUMMER, AUTUMN, WINTER, BONE, GRAY, HSV, PARULA, PLASMA, INFERNO, VIRIDIS, MAGMA, CIVIDIS
+    debug_filepath: "/tmp/depth_anything_v3_debug/"
+    write_colormap: false
+    debug_colormap_min_depth: 0.0    # Minimum depth value for colormap visualization
+    debug_colormap_max_depth: 50.0   # Maximum depth value for colormap visualization
+    sky_threshold: 0.3               # Threshold for sky classification (lower = more sky)
+    sky_depth_cap: 200.0             # Maximum depth value to fill sky regions
+
+    # Point cloud downsampling (1 = no downsampling, 10 = every 10th point)
+    point_cloud_downsample_factor: 2
+
+    # Point cloud colorization with RGB from input image
+    colorize_point_cloud: true  # Set to true to publish RGB point cloud instead of XYZ only
+```
+
+---
+
+## Running the Pipeline
+
+Launch the pipeline:
+
+```bash
+ros2 launch nav2_depth_estimation_ai perception_pipeline.launch.py
+```
+
+All nodes run inside a **ComposableNodeContainer**.
+
+---
+
+## Output Topics
+
+| Topic                          | Description                        |
+| ------------------------------ | ---------------------------------- |
+| `/pipeline/image_raw`          | Raw image from camera              |
+| `/pipeline/image_preprocessed` | Preprocessed image                 |
+| `/pipeline/depth`              | Depth image generated by the model |
+| `/pipeline/points`             | Generated 3D point cloud           |
+
+---
+
+## Troubleshooting
+
+### 1. Depth estimator dependency mismatch
+
+If you are using `depth_anything_v3`, ensure that the dependency versions match those required by the package.
+
+Refer to the official repository for tested dependencies:
+
+https://github.com/ika-rwth-aachen/ros2-depth-anything-v3-trt#dependencies
+
+Version mismatches (e.g., TensorRT, CUDA) may prevent the depth estimator from loading or running correctly.
+
+---
+
+### 2. Point cloud not visualizing
+
+If the pipeline is running and point cloud messages are being published but no data appears in RViz or visualization tools, check the `camera_info` topic.
+
+You can verify it using:
+
+```bash
+ros2 topic echo /pipeline/camera_info
+```
+If the intrinsic camera parameters are all zeros, `depth_image_proc` will not be able to correctly project the depth image into a point cloud.
+
+Ensure that:
+- The camera is properly calibrated
+- A valid camera_info message is being published
+
+---
+
+## Note to Future Contributors
+
+If any changes are made to the pipeline architecture, configuration structure, or node interfaces, please update the README and documentation accordingly to keep them consistent with the implementation.
+