Smart Kiosk Assistant#
This application is part of the Voice Enabled Interactions reference architecture for retail.
Smart Kiosk Assistant is a voice-first, retrieval-augmented kiosk stack for retail, Quick Service Restaurant (QSR), and similar customer-facing deployments. The browser captures microphone audio, the stack transcribes it, retrieves a grounded answer from a local knowledge base, and plays a synthesized reply. All inference runs locally on Intel CPU or GPU via OpenVINO.
Services#
Service |
Port |
Role |
|---|---|---|
|
8010 |
Speech-to-text (Whisper) |
|
8011 |
Speech synthesis (SpeechT5 / Qwen-TTS) |
|
8020 |
Knowledge-base retrieval and answer generation |
|
8012 |
FastAPI session orchestrator |
|
7860 |
Gradio browser interface |
audio-analyzer, text-to-speech, and rag-service host the inference
models. kiosk-core and kiosk-ui are I/O-only.