Take-Away Order Accuracy: Release Notes#
Version history and changelog for Take-Away Order Accuracy.
Version 2026.0.0 (March 2026)#
General Availability Release
This is the first GA release of Take-Away Order Accuracy, promoted from 2026.0-rc2 with no code changes. All functionality is identical to 2026.0-rc2.
Published Images#
Image |
Tag |
|---|---|
|
|
|
|
|
|
|
|
Version 2026.0-rc2 (March 2026)#
What’s New#
OVMS export scripts updated to OVMS 2026.0 release branch (
releases/2026/0);openvinoandopenvino-tokenizersupdated to2026.0.0rc3YOLO model download added to
setup_models.sh— YOLO models are now downloaded automatically during setupParallel mode VLM scheduler improvements to the
VLMSchedulerbatching logicFrame selector fix — corrected frame selection logic in
frame-selector-serviceOrder recall in Gradio UI — added order recall/replay functionality
RTSP streaming fix — resolved RTSP stream connection issues in the Gradio UI
FastAPI and Starlette version update in the Gradio UI image for security/compatibility
Benchmark duration increased (
BENCHMARK_DURATIONdefault raised)setup_models.shsimplified — script restructured for clarity
Published Images#
Image |
Tag |
|---|---|
|
|
|
|
|
|
|
|
rtsp-streamerimage tag remains2026.0-rc1— no changes in this release.
Version 2026.0-rc1 (March 2026)#
Initial Release Candidate
Highlights#
AI-Powered Order Validation: Real-time take-away order verification using Qwen2.5-VL-7B Vision Language Model
Multi-Station Parallel Processing: Concurrent order validation across multiple stations via RTSP streams
Intelligent Frame Selection: YOLO11-based frame selection with OpenVINO INT8 inference for optimal VLM input
Semantic Matching: Hybrid exact/semantic item matching via dedicated microservice
Docker Registry Support: Pre-built images published to
intel/Docker Hub namespaceStream Density Benchmarking: Automated latency-based stream density testing
Published Images#
Image |
Tag |
Size |
|---|---|---|
|
|
9.64GB |
|
|
1.96GB |
|
|
1.3GB |
|
|
227MB |
Features#
Core Functionality#
Dual Service Mode: Single worker mode for development, parallel worker mode for production
VLM Integration: Qwen2.5-VL-7B-Instruct via OpenVINO Model Server (OVMS) with GPU acceleration
Video Processing: GStreamer-based pipeline with RTSP support and configurable FPS
Frame Selection: YOLO11 nano model with OpenVINO INT8 inference for hand/object detection and frame filtering
Semantic Matching: Hybrid exact/semantic item matching with configurable similarity threshold
EasyOCR Integration: Order number detection from video frames
Architecture#
Station Workers: Production-ready multi-process workers with per-station isolation
VLM Scheduler: Time-window batching for throughput optimization
2PC Pipeline Sync: Two-phase commit synchronization between RTSP streamer and processing pipelines
Circuit Breaker: Resilient RTSP connectivity with auto-recovery
Exponential Backoff: Configurable retry with jitter for transient failures
User Interface#
Gradio UI: Web-based interface for video upload and order validation
REST API: FastAPI-based endpoints with OpenAPI documentation
MinIO Integration: S3-compatible storage for frames and results
Build & Deployment#
Registry Mode:
make buildpulls pre-built images from Docker HubLocal Build Mode:
make build REGISTRY=falsebuilds all images locally from sourceOVMS Auto-Config:
graph.pbtxtauto-generated fromconfig.jsongraph_options for tester-friendly tuningModel Setup Script:
setup_models.shhandles VLM model download, EasyOCR model download, and graph configuration
Benchmarking#
Stream Density Test:
make benchmark-stream-density— automated latency-based stream scalingFixed Workers Benchmark:
make benchmark-oa— throughput testing with configurable workersSingle Video Benchmark:
make benchmark— end-to-end latency testingVLM Metrics Logger: Detailed performance metrics collection and consolidation
Components#
Component |
Image |
Description |
|---|---|---|
Order Accuracy Service |
|
Core orchestration, GStreamer pipelines, VLM scheduling |
Frame Selector |
|
YOLO11 OpenVINO INT8 frame selection |
Gradio UI |
|
Web interface for order validation |
RTSP Streamer |
|
Video-to-RTSP stream conversion with 2PC sync |
OVMS VLM |
|
Qwen2.5-VL-7B model serving |
Semantic Service |
|
Semantic text matching microservice |
MinIO |
|
S3-compatible object storage |
Configuration Defaults#
Variable |
Default |
Description |
|---|---|---|
|
|
Target latency threshold (25s) |
|
|
Warmup time (seconds) |
|
|
Frames before OCR ready signal |
|
|
Pull from registry; set |
Known Issues#
RTSP Reconnection Delay: Initial RTSP connection may take 5-10 seconds
Large Video Upload: Videos >500MB may timeout on slow connections
Order Accuracy Image Size: 9.64GB due to torch+CUDA dependencies (required by EasyOCR)