How to Build from Source#

This guide provides step-by-step instructions for building the Chat Question-and-Answer Core Sample Application from source.

If you want to build the microservices image locally, you can optionally refer to the steps in the Building the Backend Image and Building the UI Image sections. These sections provide detailed instructions on how to build the Docker images for both the backend and UI components of the Chat Question-and-Answer Core application separately.

If you want to build the images via docker compose, please refer to the section Build the Images via Docker Compose.

Once all the images are built, you can proceed to start the service using the docker compose command as described in the Get Started page.

Note:

The build instruction is applicable only on an Ubuntu system. Build from source is not supported for the sample application on Edge Microvisor Toolkit. Intel recommends using prebuilt images on Edge Microvisor Toolkit.

Building the Backend Image#

To build the Docker image for the Chat Question-and-Answer Core application, follow these steps:

Ensure you are in the project directory:

cd sample-applications/chat-question-and-answer-core

Build the Docker image using the provided Dockerfile based on your setup and prefered model framework.

Choose one of the following options:
- OpenVINO™ toolkit framework
  - CPU-only inferencing (Default Configuration):
    docker build -t chatqna:latest -f docker/Dockerfile .
    This build the image using OpenVINO toolkit to support CPU-based inferencing, suitable for hardware setups without GPU support.
  - GPU-enabled inferencing (Intel GPUs support):
    docker build -t chatqna:latest --build-arg USE_GPU=true -f docker/Dockerfile .
    This build the image using OpenVINO toolkit with additional GPU support for accelerated inferencing. It remains compatible with CPU-only systems, offering flexibility across different hardware setups.
- Ollama framework
  - CPU-only inferencing
    docker build -t chatqna:latest -f docker/Dockerfile.ollama .
    This build the image with Ollama framework to support CPU-based inferencing. Currently, Ollama is enabled only for CPU-based inferencing in this sample application.
Verify that the Docker image has been built successfully:
```
docker images | grep chatqna
```
You should see an entry for chatqna with the latest tag.

Building the UI image#

To build the Docker image for the chatqna-ui application, follow these steps:

Ensure you are in the ui/ project directory:

cd sample-applications/chat-question-and-answer-core/ui

Build the Docker image using the provided Dockerfile:
```
docker build -t chatqna-ui:latest .
```
Verify that the Docker image has been built successfully:
```
docker images | grep chatqna-ui
```
You should see an entry for chatqna-ui with the latest tag.
Once you have verified that the image has been built successfully, navigate back to the chat-question-and-answer-core directory:
```
cd ..
```

Build the Images via Docker Compose#

This guide explains how to build the images using the compose.yaml file via the docker compose command. It also outlines how to enable GPU support during the build process.

Ensure you are in the project directory:

cd sample-applications/chat-question-and-answer-core

Set Up Environment Variables:

Choose one of the following options based on your hardware setups:
- OpenVINO™ toolkit framework
  - CPU-only setup (Default):
    export HUGGINGFACEHUB_API_TOKEN=<your-huggingface-token> source scripts/setup_env.sh
  - GPU-enabled setup:
    export HUGGINGFACEHUB_API_TOKEN=<your-huggingface-token> source scripts/setup_env.sh -d gpu
    ℹ️ The -d gpu flag enables the GPU-DEVICE profile required for GPU-based execution.
- Ollama framework
  - CPU-only supported
    source scripts/setup_env.sh -b ollama
Build the Docker images defined in the compose.yaml file:
```
docker compose -f docker/compose.yaml build
```
Verify that the Docker images have been built successfully:
```
docker images | grep chatqna
```
You should see entries for both chatqna and chatqna-ui.

Running the Application Container#

After building the images for the Chat Question-and-Answer Core application, you can run the application container using docker compose by following these steps:

Set Up Environment Variables:

Choose one of the following options based on your hardware setups:
- OpenVINO™ toolkit framework
  - CPU-only setup (Default):
    export HUGGINGFACEHUB_API_TOKEN=<your-huggingface-token> source scripts/setup_env.sh
  - GPU-enabled setup:
    export HUGGINGFACEHUB_API_TOKEN=<your-huggingface-token> source scripts/setup_env.sh -d gpu
    ℹ️ The -d gpu flag enables the GPU-DEVICE profile required for GPU-based execution.
- Ollama framework
  - CPU-only supported
    source scripts/setup_env.sh -b ollama
Configure the models to be used (LLM, Embeddings, Rerankers) through a YAML configuration file, as outlined in the Get-Started: Running The Application using Docker Compose section.

Refer to and use the same list of models for OpenVINO toolkit framework as documented in Chat Question-and-Answer.
Start the Docker containers with the previously built images:
```
docker compose -f docker/compose.yaml up
```
Access the application:
- Open your web browser and navigate to http://<host-ip>:8102 to view the application dashboard.

Verification#

Ensure that the application is running by checking the Docker container status:
```
docker ps
```
Access the application dashboard and verify that it is functioning as expected.

Troubleshooting#

If you encounter any issues during the build or run process, check the Docker logs for errors:
```
docker logs <container-id>
```