Release Notes#
Version 2026.0.0 (March 2026)#
General Availability Release
Promoted from 2026.0-rc2 with no code changes. All functionality is identical to 2026.0-rc2.
Published Images#
Image |
Tag |
|---|---|
|
|
Version 2026.0-rc2 (March 2026)#
What’s New#
Image tag updated to
2026.0-rc2;TAGMakefile variable updated to2026.0-rc2FastAPI and Starlette version updates in the application image
setup_models.shupdated to OVMS 2026.0 export branch (releases/2026/0);openvinoandopenvino-tokenizersupdated to2026.0.0rc3init-envtarget added to Makefile for creating.envfrom.env.example
Published Images#
Image |
Tag |
|---|---|
|
|
Version 2026.0-rc1 (March 2026)#
New Features#
Flexible Image Deployment: New
REGISTRYflag for build and deployment controlmake build REGISTRY=false- Build image locally from sourcemake build- Pull pre-built image from registry (default)make up REGISTRY=false- Run with locally built imagemake up- Run with registry image (default)
Docker Image Tagging: Standardized image naming
Image:
intel/order-accuracy-dine-in:2026.0-rc1Configurable via
TAGandDINEIN_IMAGE_NAMEvariables
Configuration Changes#
Variable |
Old Default |
New Default |
|---|---|---|
|
1.0.0 |
2026.0-rc1 |
|
false |
true |
|
- |
intel/order-accuracy-dine-in:2026.0-rc1 |
Migration Notes#
Update deployment scripts to use new
REGISTRYflagImage name changed from
dine-in-dine-intointel/order-accuracy-dine-inUse
REGISTRY=falsefor local development workflows
Version 2.0.0 (February 2026)#
New Features#
Circuit Breaker Pattern: Added fault tolerance for VLM and Semantic services
5 consecutive failures trigger circuit OPEN state
30s recovery timeout for VLM, 15s for Semantic service
Automatic recovery with half-open state testing
Connection Pooling: Shared HTTP clients with optimized settings
Up to 50 concurrent connections for VLM client
HTTP/2 support enabled for improved performance
Keepalive connections (30s expiry)
Bounded Validation Cache: LRU cache prevents memory exhaustion
Maximum 10,000 entries
Thread-safe operations
Automatic eviction of oldest entries
Image Preprocessing Pipeline: Optimized images for faster VLM inference
Smart resizing (672px max dimension)
Adaptive contrast enhancement
Light sharpening for food detail
JPEG compression (82% quality)
Stream Density Benchmark: New testing mode for concurrent validation
Automatic density scaling
Latency-based pass/fail criteria
Comprehensive results export (JSON/CSV)
Improvements#
Thread-safe Singleton: Config manager uses double-checked locking
Async Metrics Collection: Non-blocking system stats retrieval
Token Usage Logging: Detailed TPS and token metrics
Enhanced VLM Prompts: Inventory-aware prompts for better accuracy
Bug Fixes#
Fixed race condition in service initialization
Fixed unbounded Dict causing OOM under load
Fixed blocking
psutil.cpu_percent()callFixed sync HTTP in Gradio callback
Configuration Changes#
Variable |
Old Default |
New Default |
|---|---|---|
|
60 |
300 |
|
1000 |
2000 |
Dependencies#
Added:
httpx[http2]>=0.25.0(HTTP/2 support)Added:
aiofiles>=23.0.0(async file operations)Removed:
requests(replaced by httpx)
Known Issues#
GPU memory utilization metric always reports 0.0 (metrics collector limitation)
First
psutil.cpu_percent()call returns 0.0 (expected behavior)
Version 1.0.0 (January 2026)#
Initial Release#
Gradio UI for interactive validation
FastAPI REST endpoints
OVMS VLM integration (Qwen2.5-VL-7B)
Semantic matching service
Basic metrics collection
Docker Compose deployment
Features#
Single image validation
Batch validation endpoint
Order manifest comparison
Accuracy scoring
Performance metrics