Video Search and Summarization (VSS) Sample Application#

Use the Video Search and Summarization (VSS) sample application to search through your videos, summarize them, and more.

This foundational sample application provides three modes:

Mode

Use To

Capability

šŸ” Video Search (overview and architecture)

Find specific content within large video datasets through natural language.

Extract and index visual, audio, and textual features from video frames using the LangChain framework, multimodal embedding models, and agentic reasoning. Query using natural language or multi-modal models.

šŸ“ Video Summarization (overview and architecture)

Create concise summaries of long-form videos or live streams, automatically.

Improve searchability. Combine insights from different data types using Generative AI Vision Language Models (VLMs), computer vision, and audio analysis.

šŸ”— Combined Video Search and Summarization (overview and architecture)

Find specific content and create concise summaries of videos - ideal for a comprehensive video analysis.

Search quickly and directly over generated video summaries. Using the summary as a knowledge base makes the search results more relevant and accurate.

The detailed documentation to help you get started, configure, and deploy the sample application along with the required microservices are as follows.

Documentation#