Architecture#
Trusted Compute is a set of software defined security extensions that utilize hardware security capabilities of the Edge Node within the Edge Manageability Framework.
Objective#
It enhances edge node protection through:
- Continuous Monitoring: Regularly checks the integrity of platform firmware, OS kernel, critical system components, and runtime environment through ongoing measurement and attestation. 
- Isolated Execution: Runs Confidential-workloads securely in a trusted VM environment, preventing unauthorized access or interference and safeguarding the host from the workloads.
- Confidential-workload A workload that handles sensitive or private data and requires protections from unauthorized access or tampering. 
 
 
- Security Objective: Trusted Compute ensures the integrity of code and data within container workloads running in a Trusted VM instance,
- protecting against attacks from unprivileged software, applications, and simple hardware adversaries. It prevents access to sensitive code and data during boot and runtime, detects tampering or unauthorized changes to platform software, and isolates the node from future workloads. 
 
Provision the Edge Node Cluster with Trusted Compute Extension#
- Selection: Users select the edge node to be Trusted Compute enabled during provisioning. 
- Provisioning: Trusted Compute extension artifacts are provisioned to the edge node during its setup. 
Platform Prerequisites for Enabling Trusted Compute#
- Secure Boot: For Trusted compute extensions to be deployed, Secure Boot must be enabled on the edge node. 
- Full Disk Encryption: Platform must also enable full disk encryption. 
 
Continuous Integrity Monitoring Mechanism#
Continuous monitoring involves three key pods:
- Trust Agent Pod 
- Verifier Pod 
- Attestation Manager Pod 
Trust Agent#
The Trust boot process establishes a chain of trust from the firmware to the bootloader, kernel, and userspace by measuring the integrity of each component. These measurements (hashes) are stored in the TPM’s platform configuration registers (PCR), ensuring the system’s trustworthiness. The Trust Agent on the node securely retrieves signed quotes of these measurements from the TPM for remote comparison by the verifier against expected golden values (attestation).
Verifier#
A verifier pod comprises four components:
- AAS (authentication and authorization service) 
- CMS (certificate management service) 
- HVS (host verification service) 
- Nats Server (Neural Autonomic Transport System) 
The scope of the verifier pod is to:
- Securely retrieve signed quotes from the trust agent pod for every attestation and verify against expected golden flavors and publish the trust report. 
- After every reboot of the edge node, the first TPM measurements are set as Golden flavors. 
Flavors#
Flavor creation is the process of adding one or more sets of acceptable golden measurements to the Verification Service database. These measurements correspond to specific system components and are used as the basis of comparison to generate trust attestations. Flavors are automatically matched to hosts based on the Flavor group used by the host and the Flavors, and the Flavor Match Policies of the Flavor group.
Trust Report Generated by Verifier#
Trust report is a JSON-formatted report includes a chain or signatures that provides auditability for the Report. The JSON attestation will include the base trust status of the host, as well as the overall trust for each individual Flavor used in the attestation. The Report will also contain host information, such as TPM version, Operating System name and version, BIOS version, etc
Attestation Manager#
The scope of the Attestation Manager pod is to:
- Set golden flavors with Verifier. 
- Set policy for attestation (cron job). 
- Verify if the attestation result is a pass or a fail. 
- Verify the base dependency (platform metadata), such as secure boot is turned on for every attestation report. 
- Update the Edge Infrastructure Manager Instance status when an attestation fails. 
- Cordon the node by calling Kubernetes* API server API when an attestation fails. 
How Often Should an Attestation Be Run?#
- Users can define a policy in Helm* charts using the “pollDuration:” field in the values.yaml file to specify how often the attestation should run. The default value is set to 60 minutes. Below is a snippet from values.yaml: 
- Every time a node platform reboots, the Trust Agent automatically initiates an attestation. env:pollDuration: “60” # duration in minutes
What happens when the Attestation fails?#
- The node can have one of the following statuses: Verified, Attestation Failed, or Secure Boot Disabled. 
- ** Attestation Failed or Secure Boot Disabled:** When a node is in either of these states, the Attestation Manager will place the node into a Cordon state.
- This prevents the Kubernetes orchestrator from deploying any future applications to the affected edge node. 
 
- ** Resolution and Onboarding:** Once the source of the issue is identified and resolved, the node can undergo a fresh onboarding process to restore its functionality and status. 
Trusted Workload Protection#
Trusted Compute allows orchestrating and executing a Confidential-workload within an isolated & trusted VM on the edge node. The security objectives of this approach are:
- Data confidentiality: Unauthorized entities cannot view data while it is in use within the VM. 
- Data integrity: Unauthorized entities cannot add, remove, or alter data while it is in use within the VM. 
- Code integrity: Unauthorized entities cannot add, remove, or alter code executing in the VM. 
The following sections describe the software architecture for protecting workloads using VM-based Isolated Execution Environments in a Kubernetes environment.
Workload (App & Package) Prerequisites for Trusted Compute#
- Build Container Image: For applications & packages (workloads) that require higher levels of isolation, add configuration to application’s Helm chart to change runtime for container execution. 
- Push to Registry: The user pushes the container image(s) to the workload registry. This is represented as Step 1 in the diagram. 
 
Deploying the Confidential-workload in Edge Orchestrator#
- Deploy Workload: The user deploys the workload using Edge Orchestrator. This is represented as Step 2 in the diagram. 
- Orchestration: Edge Orchestrator schedules the Confidential-workload to target nodes that are Trusted Compute ready. 
- Pull Trusted VM Image: The containerd runtime pulls the trusted VM image along with the workload and stores them on the node’s file storage.
- This is represented as Steps 3 & 4 in the diagram. 
 
- Launch Trusted VM: The trusted VM is launched and booted up. This is represented as Step 5 in the diagram. 
Workload Execution Flow#
- Bootstrap Workload: The Kata runtime shim communicates with the Kata agent within the trusted VM to bootstrap the workload from the mount point. This is represented as Steps 6, 7, and 8 in the diagram. 
- Launch Workload: The Kata agent launches the workload with the necessary resources. This is represented as Step 8 in the diagram.