Release Notes: Deep Learning Streamer (DL Streamer) Pipeline Framework Release 2026.0#

Version 2026.0#

Key highlights:#

New elements: gvafpsthrottle, g3dradarprocess, g3dlidarparse
New model support: YOLOv26, YOLO-E, RT-DETR, HuggingFace ViT
Streamlined integration with Ultralytics and HuggingFace model hubs
GstAnalytics metadata support: DLStreamer supports GstAnalytics metadata for object detection, classification, tracking and adds custom GstAnalytics extension for keypoints
gvawatermark overhaul: object bluring, text backgrounds, label filtering, extra fonts, thickness/color options, FPS overlay
Inference enhancements: batch timeout, OpenCV tensor compression for all devices
Windows platform: GPU inference via D3D11, gvapython support, CI integration, build/setup improvements
New Python samples: VLM Alerts, Smart NVR, ONVIF Discovery, face detection/age classification, open-vocabulary detection, RealSense, DL Streamer + DeepStream
Optimizer: multi-stream optimization, cross-stream batching, device selection, refactored with tests
Component updates: OpenVINO 2026.0.0, NPU driver 1.30, RealSense SDK 2.57.5
Library consolidation: merged gvawatermark3d, gvadeskew, gvamotiondetect, gvagenai into gstvideoanalytics
CI: Zizmor security scanning, Windows CI, Docker image size checks

Deep Learning Streamer (DL Streamer) Pipeline Framework is a streaming media analytics framework, based on GStreamer* multimedia framework, for creating complex media analytics pipelines. It ensures pipeline interoperability and provides optimized media, and inference operations using Intel® Distribution of OpenVINO™ Toolkit Inference Engine backend, across Intel® architecture, CPU, discrete GPU, integrated GPU and NPU. The complete solution leverages:

Open source GStreamer* framework for pipeline management
GStreamer* plugins for input and output such as media files and real-time streaming from camera or network
Video decode and encode plugins, either CPU optimized plugins or GPU-accelerated plugins based on VAAPI
Deep Learning models converted from training frameworks TensorFlow*, Caffe* etc.

The following elements in the Pipeline Framework repository:

Element	Description
gvaattachroi	Adds user-defined regions of interest to perform inference on, instead of full frame.
gvaaudiodetect	Performs audio event detection using AclNet model.
gvaaudiotranscribe	Performs audio transcription using OpenVino GenAI Whisper model.
gvaclassify	Performs object classification. Accepts the ROI as an input and outputs classification results with the ROI metadata.
gvadetect	Performs object detection on a full-frame or region of interest (ROI) using object detection models such as YOLOv4-v11, MobileNet SSD, Faster-RCNN etc. Outputs the ROI for detected objects.
gvafpscounter	Measures frames per second across multiple streams in a single process.
gvafpsthrottle	Throttles the frame rate of a pipeline to a specified FPS value.
gvagenai	Performs inference with Vision Language Models using OpenVINO™ GenAI, accepts video and text prompt as an input, and outputs text description. It can be used to generate text summarization from video.
gvainference	Runs deep learning inference on a full-frame or ROI using any model with an RGB or BGR input.
gvametaaggregate	Aggregates inference results from multiple pipeline branches
gvametaconvert	Converts the metadata structure to the JSON format.
gvametapublish	Publishes the JSON metadata to MQTT or Kafka message brokers or files.
gvamotiondetect	Performs lightweight motion detection on NV12 video frames and emits motion regions of interest (ROIs) as analytics metadata.
gvapython	Provides a callback to execute user-defined Python functions on every frame. Can be used for metadata conversion, inference post-processing, and other tasks.
gvarealsense	Provides integration with Intel RealSense cameras, enabling video and depth stream capture for use in GStreamer pipelines.
gvatrack	Performs object tracking using zero-term, or imageless tracking algorithms. Assigns unique object IDs to the tracked objects.
gvawatermark	Overlays the metadata on the video frame to visualize the inference results.
g3dradarprocess	Processes 3D radar data for use in analytics pipelines.
g3dlidarparse	Parses 3D lidar data for use in analytics pipelines.

For the details on supported platforms, please refer to System Requirements. For installing Pipeline Framework with the prebuilt binaries or Docker* or to build the binaries from the open source, refer to Intel® DL Streamer Pipeline Framework installation guide.

New in this Release

Title	High-level description
3D elements (g3dradarprocess, g3dlidarparse)	New 3D plugin support with g3dradarprocess element for radar data processing and g3dlidarparse element for lidar data parsing, enabling 3D analytics pipelines.
FPS throttle element (gvafpsthrottle)	New element to throttle the frame rate of a pipeline to a specified FPS value.
YOLOv26 model support	Added converters and post-processing for YOLOv26 models, including oriented bounding box (OBB) support and INT8 GPU inference. Added YOLOv26 to supported models in samples.
RT-DETR model support	Added RT-DETR support implementation for real-time detection transformer models.
HuggingFace ViT classifier support	Added HuggingFace Vision Transformer (ViT) classifier config parser for inference.
Batch timeout for inference elements	Added batch-timeout parameter to inference elements, allowing control over batching wait time.
VLM Alerts sample	New Python sample for VLM-based alerts with displaying results on produced video.
Smart NVR sample	New Python sample for Smart NVR with added custom analytics logic (gvaAnalytics) and custom storage (gvaRecorder) elements.
ONVIF Camera Discovery sample	New Python sample demonstrating ONVIF camera discovery and DL Streamer pipeline launcher.
Face detection & age classification sample	New Python sample for face detection and age classification using HuggingFace models.
Open-vocabulary object detection sample	New Python sample with open-vocabulary prompt for object detection.
DL Streamer + DeepStream coexistence sample	New sample demonstrating DL Streamer and DeepStream working in one system.
Motion detect sample (Windows)	New sample demonstrating DL Streamer gvamotiondetect functionality
RealSense element usage sample	New sample demonstrating gvarealsense element usage.

Updated in this Release

Title	High-level description
gvawatermark enhancements	Major enhancements to the gvawatermark element: display configuration options (thickness, color index), text background support, inclusive/exclusive label filtering, additional font support, average FPS info overlay, and visual documentation.
DLS Optimizer enhancements	Optimizer refactored with multi-stream optimization, cross-stream batching, improved FPS reporting, and device selection improvements.
gvametaconvert enhancements	Added reference NTP timestamp from RTCP sender meta extraction to gvametaconvert element
ROI object construction enhacement	For existing GstAnalyticsODMtd only, creates GstVideoRegionOfInterestMeta until full GstAnalytics migration
Latency tracer multi-source/sink support	Extended latency_tracer to support multiple sources and multiple sinks.
Detection anomaly converter	Refactored and enhanced anomaly logic in DetectionAnomalyConverter.
FP32 precision in BoxesLabelsConverter	Added FP32 precision support in BoxesLabelsConverter label parsing.
Bounding box validation	Added extra validation of bounding boxes to improve robustness.
OpenCV tensor compression for all devices	Use OpenCV tensor compression for all inference devices, yielding best performance across CPU/GPU/NPU.
Model API refactoring	Moved Model API parser to separate files; added conversion from Ultralytics and HuggingFace metadata to Model API.
Python samples overview	Added overview section for Python samples; updated READMEs.
Tests	Expanded coverage of functional and unit tests.
Windows: GPU inference with D3D11	Added support for GPU inference on Windows using D3D11.
Windows: gvapython support	Added Windows support for gvapython element and gstgva Python bindings.
Windows: enhanced build & setup	Enhanced Windows build/setup scripts, added remove script, Visual C++ runtime handling, and JSON output for Windows samples.
Windows: CI integration	Enabled Windows tests in GitHub Actions workflow, model downloads on Windows.
Library consolidation	Merged gvawatermark3d, gvadeskew, gvamotiondetect, and gvagenai into the gstvideoanalytics library.
OpenVINO update	Update to OpenVINO 2026.0.0.
NPU driver update	Update to NPU driver version 1.30.
RealSense update	Update to Intel RealSense SDK 2.57.5.
Model download script improvements	Simplified YOLO model download script, enhanced INT8 quantization, refactored YOLOv8+ export/quantize, added model validation.
CI: Zizmor security scanning	Added Zizmor GitHub Actions security scanner.

Fixed

#	Issue Description
1	Fixed YOLO26 model inference on GPU FP16/FP32.
2	Fixed threshold parameter in gvadetect not working with PDD model.
3	Fixed yolov8-seg inference result different from OpenVINO.
4	Fixed gvapython failing to read yolo-pose keypoint metadata.
5	Fixed NV12 frame data in Python by removing padding correctly.
6	Fixed watermark default text background behaviour.
7	Fixed check for pad_value in model XML file.
8	Fixed yolo_v10.cpp compile error on Windows.
9	Fixed DLL output paths on Windows.
10	Fixed compilation warnings on Windows.
11	Fixed timestamp on VS 2026.
12	Fixed GStreamer downloader by adding UserAgent.
13	Fixed libva path setup in setup_dls_env.ps1
14	Removed libva dependency for monolithic elements on Windows.
15	Fixed latency tracker for smart intersection pipelines.
16	Fixed environment variable paths in Ubuntu install guide.
17	Fixed directory already exists error during build.
18	Removed duplicate gvametapublish element register.
19	Reverted RTP timestamp feature due to issues.
20	Fixed download public models script - versions of NumPy, Onnx, and Seaborn.
21	Fixed missing context in Build Docker instruction.
22	Fixed formatting in installation guide and developer guide documentation.

Known Issues

Issue	Issue Description
Preview Architecture 2.0 Samples	Preview Arch 2.0 samples have known issues with inference results.

Legacy Features and Deprecation Timeline#

List of the features and components to be deprecated in the future.

Feature	End of Support Date	Replacement strategy
OMZ models	end of Q3’2026	Use HuggingFace, Ultralytics, TIMM
Model-proc-file	end of Q3’2026	ModelAPI (common with Geti)
GstVideoRegionOfInterest meta + Intel extensions	end of Q4’2026	usage of GstAnalyticsMtd
Architecture 2.0 elements	end of Q4’2026	Preserve essential components and ideas
FFMpeg integration samples	end of Q2’2026	no replacement

Legal Information#

GStreamer is an open source framework licensed under LGPL. See https://gstreamer.freedesktop.org/documentation/frequently-asked-questions/licensing.html. You are solely responsible for determining if your use of GStreamer requires any additional licenses. Intel is not responsible for obtaining any such licenses, nor liable for any licensing fees due, in connection with your use of GStreamer.
FFmpeg is an open source project licensed under LGPL and GPL. See https://www.ffmpeg.org/legal.html. You are solely responsible for determining if your use of FFmpeg requires any additional licenses. Intel is not responsible for obtaining any such licenses, nor liable for any licensing fees due, in connection with your use of FFmpeg.

Release Notes: Deep Learning Streamer (DL Streamer) Pipeline Framework Release 2026.0#

Version 2026.0#

Key highlights:#

Legacy Features and Deprecation Timeline#

Legal Information#

This Page