DeepStream Pipeline
Hardware-accelerated processing via GStreamer/DeepStream. Runs TensorRT YOLO engines directly on GPU, passing frames through appsink for low-copy Python processing.
The open-source vision framework for edge devices. Runs DeepStream pipelines, YOLO detection, and world models at 60 FPS on NVIDIA Jetson, Intel NPU, and Hailo.
Hardware-accelerated processing via GStreamer/DeepStream. Runs TensorRT YOLO engines directly on GPU, passing frames through appsink for low-copy Python processing.
Simultaneous FaceMesh, hand gesture recognition, and body pose estimation running alongside primary object detection.
Publishes telemetry across dedicated topics such as detections, depth, and pose with a multithreaded executor strategy.
Predictive planning with LeWM and V-JEPA style world models for temporal awareness, safety checks, and forward simulation.
Tested on NVIDIA Jetson Orin Nano (8GB) in MAXN mode.
| Configuration | Models Active | Frame Rate | Latency |
|---|---|---|---|
| Detection Only (INT8) | YOLO TensorRT | 60 FPS | 16ms |
| Minimal Pipeline | Detection + Depth + Tracking | 35-40 FPS | 28ms |
| Full Pipeline | Detection + Face + Gesture + Pose | 25-30 FPS | 38ms |
| World Model Planning | LeWM (Inference) | 200 Hz | 5ms |