Video Search and Summarization (VSS) Sample Application#

Use the Video Search and Summarization (VSS) sample application to search through your videos, summarize them, and more.

This foundational sample application provides three modes:

Mode	Use To	Capability
🔍 Video Search (overview and architecture)	Find specific content within large video datasets through natural language.	Extract and index visual, audio, and textual features from video frames using the LangChain framework, multimodal embedding models, and agentic reasoning. Query using natural language or multi-modal models.
📝 Video Summarization (overview and architecture)	Create concise summaries of long-form videos or live streams, automatically.	Improve searchability. Combine insights from different data types using Generative AI Vision Language Models (VLMs), computer vision, and audio analysis.
🔗 Combined Video Search and Summarization (overview and architecture)	Find specific content and create concise summaries of videos - ideal for a comprehensive video analysis.	Search quickly and directly over generated video summaries. Using the summary as a knowledge base makes the search results more relevant and accurate.

The detailed documentation to help you get started, configure, and deploy the sample application along with the required microservices are as follows.

Documentation#

Get Started
- Get Started: How to get started with the sample application.
- System Requirements: What hardware and software you need to run the sample application.
Deployment
- How to Build from Source: How to build from source code.
- How to Deploy with Helm: How to deploy using the Helm chart.
API Reference
- API Reference: Comprehensive reference for the available REST API endpoints.
Release Notes
- Release Notes: Information on the latest updates, improvements, and bug fixes.