Release Notes: Deep Learning Streamer (DL Streamer) Pipeline Framework Release 2026.0#

Version 2026.0#

Key highlights:#

  • New elements: gvafpsthrottle, g3dradarprocess, g3dlidarparse

  • New model support: YOLOv26, YOLO-E, RT-DETR, HuggingFace ViT

  • Streamlined integration with Ultralytics and HuggingFace model hubs

  • GstAnalytics metadata support: DLStreamer supports GstAnalytics metadata for object detection, classification, tracking and adds custom GstAnalytics extension for keypoints

  • gvawatermark overhaul: object bluring, text backgrounds, label filtering, extra fonts, thickness/color options, FPS overlay

  • Inference enhancements: batch timeout, OpenCV tensor compression for all devices

  • Windows platform: GPU inference via D3D11, gvapython support, CI integration, build/setup improvements

  • New Python samples: VLM Alerts, Smart NVR, ONVIF Discovery, face detection/age classification, open-vocabulary detection, RealSense, DL Streamer + DeepStream

  • Optimizer: multi-stream optimization, cross-stream batching, device selection, refactored with tests

  • Component updates: OpenVINO 2026.0.0, NPU driver 1.30, RealSense SDK 2.57.5

  • Library consolidation: merged gvawatermark3d, gvadeskew, gvamotiondetect, gvagenai into gstvideoanalytics

  • CI: Zizmor security scanning, Windows CI, Docker image size checks

Deep Learning Streamer (DL Streamer) Pipeline Framework is a streaming media analytics framework, based on GStreamer* multimedia framework, for creating complex media analytics pipelines. It ensures pipeline interoperability and provides optimized media, and inference operations using Intel® Distribution of OpenVINO™ Toolkit Inference Engine backend, across Intel® architecture, CPU, discrete GPU, integrated GPU and NPU. The complete solution leverages:

  • Open source GStreamer* framework for pipeline management

  • GStreamer* plugins for input and output such as media files and real-time streaming from camera or network

  • Video decode and encode plugins, either CPU optimized plugins or GPU-accelerated plugins based on VAAPI

  • Deep Learning models converted from training frameworks TensorFlow*, Caffe* etc.

  • The following elements in the Pipeline Framework repository:

    Element

    Description

    gvaattachroi

    Adds user-defined regions of interest to perform inference on, instead of full frame.

    gvaaudiodetect

    Performs audio event detection using AclNet model.

    gvaaudiotranscribe

    Performs audio transcription using OpenVino GenAI Whisper model.

    gvaclassify

    Performs object classification. Accepts the ROI as an input and outputs classification results with the ROI metadata.

    gvadetect

    Performs object detection on a full-frame or region of interest (ROI) using object detection models such as YOLOv4-v11, MobileNet SSD, Faster-RCNN etc. Outputs the ROI for detected objects.

    gvafpscounter

    Measures frames per second across multiple streams in a single process.

    gvafpsthrottle

    Throttles the frame rate of a pipeline to a specified FPS value.

    gvagenai

    Performs inference with Vision Language Models using OpenVINO™ GenAI, accepts video and text prompt as an input, and outputs text description. It can be used to generate text summarization from video.

    gvainference

    Runs deep learning inference on a full-frame or ROI using any model with an RGB or BGR input.

    gvametaaggregate

    Aggregates inference results from multiple pipeline branches

    gvametaconvert

    Converts the metadata structure to the JSON format.

    gvametapublish

    Publishes the JSON metadata to MQTT or Kafka message brokers or files.

    gvamotiondetect

    Performs lightweight motion detection on NV12 video frames and emits motion regions of interest (ROIs) as analytics metadata.

    gvapython

    Provides a callback to execute user-defined Python functions on every frame. Can be used for metadata conversion, inference post-processing, and other tasks.

    gvarealsense

    Provides integration with Intel RealSense cameras, enabling video and depth stream capture for use in GStreamer pipelines.

    gvatrack

    Performs object tracking using zero-term, or imageless tracking algorithms. Assigns unique object IDs to the tracked objects.

    gvawatermark

    Overlays the metadata on the video frame to visualize the inference results.

    g3dradarprocess

    Processes 3D radar data for use in analytics pipelines.

    g3dlidarparse

    Parses 3D lidar data for use in analytics pipelines.

For the details on supported platforms, please refer to System Requirements. For installing Pipeline Framework with the prebuilt binaries or Docker* or to build the binaries from the open source, refer to Intel® DL Streamer Pipeline Framework installation guide.

New in this Release

Title

High-level description

3D elements (g3dradarprocess, g3dlidarparse)

New 3D plugin support with g3dradarprocess element for radar data processing and g3dlidarparse element for lidar data parsing, enabling 3D analytics pipelines.

FPS throttle element (gvafpsthrottle)

New element to throttle the frame rate of a pipeline to a specified FPS value.

YOLOv26 model support

Added converters and post-processing for YOLOv26 models, including oriented bounding box (OBB) support and INT8 GPU inference. Added YOLOv26 to supported models in samples.

RT-DETR model support

Added RT-DETR support implementation for real-time detection transformer models.

HuggingFace ViT classifier support

Added HuggingFace Vision Transformer (ViT) classifier config parser for inference.

Batch timeout for inference elements

Added batch-timeout parameter to inference elements, allowing control over batching wait time.

VLM Alerts sample

New Python sample for VLM-based alerts with displaying results on produced video.

Smart NVR sample

New Python sample for Smart NVR with added custom analytics logic (gvaAnalytics) and custom storage (gvaRecorder) elements.

ONVIF Camera Discovery sample

New Python sample demonstrating ONVIF camera discovery and DL Streamer pipeline launcher.

Face detection & age classification sample

New Python sample for face detection and age classification using HuggingFace models.

Open-vocabulary object detection sample

New Python sample with open-vocabulary prompt for object detection.

DL Streamer + DeepStream coexistence sample

New sample demonstrating DL Streamer and DeepStream working in one system.

Motion detect sample (Windows)

New sample demonstrating DL Streamer gvamotiondetect functionality

RealSense element usage sample

New sample demonstrating gvarealsense element usage.

Updated in this Release

Title

High-level description

gvawatermark enhancements

Major enhancements to the gvawatermark element: display configuration options (thickness, color index), text background support, inclusive/exclusive label filtering, additional font support, average FPS info overlay, and visual documentation.

DLS Optimizer enhancements

Optimizer refactored with multi-stream optimization, cross-stream batching, improved FPS reporting, and device selection improvements.

gvametaconvert enhancements

Added reference NTP timestamp from RTCP sender meta extraction to gvametaconvert element

ROI object construction enhacement

For existing GstAnalyticsODMtd only, creates GstVideoRegionOfInterestMeta until full GstAnalytics migration

Latency tracer multi-source/sink support

Extended latency_tracer to support multiple sources and multiple sinks.

Detection anomaly converter

Refactored and enhanced anomaly logic in DetectionAnomalyConverter.

FP32 precision in BoxesLabelsConverter

Added FP32 precision support in BoxesLabelsConverter label parsing.

Bounding box validation

Added extra validation of bounding boxes to improve robustness.

OpenCV tensor compression for all devices

Use OpenCV tensor compression for all inference devices, yielding best performance across CPU/GPU/NPU.

Model API refactoring

Moved Model API parser to separate files; added conversion from Ultralytics and HuggingFace metadata to Model API.

Python samples overview

Added overview section for Python samples; updated READMEs.

Tests

Expanded coverage of functional and unit tests.

Windows: GPU inference with D3D11

Added support for GPU inference on Windows using D3D11.

Windows: gvapython support

Added Windows support for gvapython element and gstgva Python bindings.

Windows: enhanced build & setup

Enhanced Windows build/setup scripts, added remove script, Visual C++ runtime handling, and JSON output for Windows samples.

Windows: CI integration

Enabled Windows tests in GitHub Actions workflow, model downloads on Windows.

Library consolidation

Merged gvawatermark3d, gvadeskew, gvamotiondetect, and gvagenai into the gstvideoanalytics library.

OpenVINO update

Update to OpenVINO 2026.0.0.

NPU driver update

Update to NPU driver version 1.30.

RealSense update

Update to Intel RealSense SDK 2.57.5.

Model download script improvements

Simplified YOLO model download script, enhanced INT8 quantization, refactored YOLOv8+ export/quantize, added model validation.

CI: Zizmor security scanning

Added Zizmor GitHub Actions security scanner.

Fixed

#

Issue Description

1

Fixed YOLO26 model inference on GPU FP16/FP32.

2

Fixed threshold parameter in gvadetect not working with PDD model.

3

Fixed yolov8-seg inference result different from OpenVINO.

4

Fixed gvapython failing to read yolo-pose keypoint metadata.

5

Fixed NV12 frame data in Python by removing padding correctly.

6

Fixed watermark default text background behaviour.

7

Fixed check for pad_value in model XML file.

8

Fixed yolo_v10.cpp compile error on Windows.

9

Fixed DLL output paths on Windows.

10

Fixed compilation warnings on Windows.

11

Fixed timestamp on VS 2026.

12

Fixed GStreamer downloader by adding UserAgent.

13

Fixed libva path setup in setup_dls_env.ps1

14

Removed libva dependency for monolithic elements on Windows.

15

Fixed latency tracker for smart intersection pipelines.

16

Fixed environment variable paths in Ubuntu install guide.

17

Fixed directory already exists error during build.

18

Removed duplicate gvametapublish element register.

19

Reverted RTP timestamp feature due to issues.

20

Fixed download public models script - versions of NumPy, Onnx, and Seaborn.

21

Fixed missing context in Build Docker instruction.

22

Fixed formatting in installation guide and developer guide documentation.

Known Issues

Issue

Issue Description

Preview Architecture 2.0 Samples

Preview Arch 2.0 samples have known issues with inference results.

Legacy Features and Deprecation Timeline#

List of the features and components to be deprecated in the future.

Feature

End of Support Date

Replacement strategy

OMZ models

end of Q3’2026

Use HuggingFace, Ultralytics, TIMM

Model-proc-file

end of Q3’2026

ModelAPI (common with Geti)

WSL support

end of Q3’2026

native Windows support

GstVideoRegionOfInterest meta + Intel extensions

end of Q4’2026

usage of GstAnalyticsMtd

Architecture 2.0 elements

end of Q4’2026

Preserve essential components and ideas

FFMpeg integration samples

end of Q2’2026

no replacement