System Requirements#
This page provides detailed hardware, software, platform requirements, and supported models to help you set up and run the application efficiently.
Software and Hardware Requirements#
OS: Windows 11
Processor: Intel® Core Ultra Series 1 (with integrated GPU support)
Memory: 32 GB RAM (minimum recommended)
Storage: At least 50 GB free (for models and logs)
GPU/Accelerator: Intel® iGPU (Core Ultra Series 1, Arc GPU, or higher) for summarization acceleration
Python: 3.12 or above
Node.js: v18+ (for frontend)
Supported Models#
ASR (Automatic Speech Recognition)#
Whisper (all models supported)
Recommended:
whisper-small
or lower for CPU efficiencyRuns on CPU (Whisper is CPU-centric)
FunASR (Paraformer)
Recommended for Chinese transcription (
paraformer-zh
)
Supports transcription of audio files up to 45 minutes
Summarization (LLMs)#
Qwen Models (OpenVINO / IPEX)
Qwen2.0-7B-Instruct
Qwen2.5-7B-Instruct
Summarization supports up to 7,500 tokens (≈ 45 minutes of audio) on GPU
Supported Weight Formats#
int8 → Recommended for lower-end CPUs (fast + efficient)
fp16 → Recommended for higher-end systems (better accuracy, GPU acceleration)
int4 → Supported, but may reduce accuracy (use only if memory-constrained)
Run summarization on GPU (Intel® iGPU / Arc GPU) for faster performance.