Secure AI Development with Confidential Computing

Quick Start Security Template

FAQ

A: Yes, but audit them first:

Q: How do I handle secrets during development vs production?

Get Started on Phala Cloud - Deploy your first secure AI application with built-in best practices.

Secure AI Development with Confidential Computing: Best Practices Guide

TL;DR:

Building secure AI applications with [Confidential Computing (TEE)](https://docs.phala.com/phala-cloud/security-and-privacy/security-architecture) requires more than just deployment — it demands a security-first architecture, robust secret management, and continuous [attestation](https://docs.phala.com/phala-cloud/attestation/overview).

This guide outlines best practices for developing production-grade Confidential AI on [Phala Cloud](https://docs.phala.com/phala-cloud/getting-started/overview), including defense-in-depth strategies to protect models, data, and the execution environment from insider threats or misconfigurations.

You’ll learn proven methods for architecting secure AI systems, hardening code and dependencies, and maintaining verifiable trust across every stage of development and deployment.

Introduction

Building confidential AI applications goes beyond simply deploying your code into a TEE. True security requires a comprehensive approach that addresses architecture design, code security, dependency management, secret handling, and continuous verification. A single misconfiguration can compromise the entire confidential computing guarantee.

This guide provides actionable best practices for developing secure AI applications in TEE environments, with specific focus on Phala Cloud deployment. You’ll learn how to architect applications with security-first principles, implement defense-in-depth strategies, and avoid common pitfalls that undermine confidential computing protections.

What you’ll learn:

Security-first architecture patterns for confidential AI
Code hardening and dependency security
Secure secret and credential management
Input validation and output sanitization
Continuous attestation and trust verification
Production deployment security checklist
Common vulnerabilities and how to prevent them

Why Secure Development Matters in TEE Environments

The TEE Security Model

TEEs provide hardware-enforced isolation and attestation, but they’re not a security panacea. Your application’s security depends on:

What TEE protects:

Memory encryption at rest and in transit
Code execution isolated from host OS
Cryptographic attestation of runtime state
Protection against physical attacks

What you must protect:

Application logic and code quality
Dependency security and supply chain
Secret management and key handling
Input validation and data sanitization
Side-channel attack mitigation
Deployment configuration security

The Cost of Security Failures

Real-world confidential computing vulnerabilities:

Vulnerability: Insecure dependency with supply chain attack
Impact: Malicious code injected into TEE application
Root cause: Unvetted npm package with backdoor
Prevention: Dependency scanning, hash verification, minimal dependencies
Vulnerability: Secrets leaked through application logs
Impact: API keys exposed in attestation report
Root cause: Debug logging enabled in production
Prevention: Structured logging, secret redaction, environment-specific configs
Vulnerability: Side-channel attack through timing
Impact: Model weights extracted via inference timing analysis
Root cause: Variable-time operations based on sensitive data
Prevention: Constant-time algorithms, timing jitter, batch processing

The Security-Performance Balance

Confidential computing introduces performance overhead (5-15% typical). Security hardening adds additional overhead. The key is making informed tradeoffs:

Security Measure	Performance Impact	Security Benefit	Recommended
Memory encryption	2-5%	Critical - prevents physical attacks	Always
Attestation verification	<1% per request	High - ensures TEE integrity	Always
Input validation	1-3%	Critical - prevents injection attacks	Always
Constant-time operations	5-20%	Medium - mitigates side-channels	Sensitive data only
Comprehensive logging	3-10%	Medium - aids incident response	Production with filtering
Runtime dependency scanning	10-30% startup	Low - better at build time	Build time only

Security-First Architecture Patterns

1. Principle of Least Privilege

Design your application to operate with minimal permissions and access.

Best Practice:

# Security-first: Non-root user with minimal permissions
FROM python:3.11-slim
# Create non-root user
RUN useradd -m -u 1000 appuser && \
    mkdir -p /app && \
    chown appuser:appuser /app
# Install dependencies as root, run as non-root
COPY requirements.txt /tmp/
RUN pip install --no-cache-dir -r /tmp/requirements.txt && \
    rm /tmp/requirements.txt
# Switch to non-root user
USER appuser
WORKDIR /app
# Copy application code
COPY --chown=appuser:appuser . .
# Run application as non-root
CMD ["python", "app.py"]

2. Defense in Depth Architecture

Implement multiple layers of security controls.

# Phala Cloud deployment with defense-in-depth
app_name: secure-ai-api
tee_type: intel-tdx
# Layer 1: Infrastructure Security
infrastructure:  
  network_isolation: true  
  firewall_rules:    
    - port: 443      
      protocol: tcp      
      source: "0.0.0.0/0"  # HTTPS only    
    - port: 9944      
      protocol: tcp      
      source: "10.0.0.0/8"  # Internal attestation only
# Layer 2: Application Container Security
container:  
  image: "ghcr.io/myorg/secure-ai-api:v1.2.3"  
  image_verification:    
    signature_check: true    
    sbom_scan: true  
  readonly_rootfs: true  
  no_new_privileges: true
# Layer 3: Runtime Security
runtime:  
  seccomp_profile: "runtime/default"  
  apparmor_profile: "docker-default"  
  capabilities_drop:    
    - ALL  
  capabilities_add:    
    - NET_BIND_SERVICE  # Only what's needed
# Layer 4: Application Security
application:  
  environment_variables:    
    - name: LOG_LEVEL      
      value: "INFO"    
    - name: ENABLE_DEBUG      
      value: "false"  
  secrets:    
    - name: API_KEY      
      source: "phala-kms"      
      encrypted: true    
    - name: MODEL_ENCRYPTION_KEY      
      source: "phala-kms"      
      encrypted: true
# Layer 5: Monitoring and Attestation
security:  
  attestation:    
    continuous_verification: true    
    interval_seconds: 300    
    alert_on_failure: true  
  audit_logging:    
    enabled: true    
    destination: "syslog://secure-log-collector:514"    
    pii_redaction: true

3. Secure Service Communication

Application architecture with mTLS:

# secure_ai_service.py
import os
import ssl
import httpx
from fastapi import FastAPI, HTTPException, Depends
from fastapi.security import HTTPBearer
import hashlib
import hmac

app = FastAPI()
security = HTTPBearer()

# Configuration from environment (set securely via Phala Cloud)
API_SECRET = os.environ["API_SECRET"]  # Retrieved from KMS
TRUSTED_CERT_HASH = os.environ["TRUSTED_CERT_HASH"]

# Create SSL context for mTLS
def create_ssl_context():
    """Create SSL context with certificate pinning"""
    context = ssl.create_default_context()
    # Load client certificate (for mutual TLS)
    context.load_cert_chain(
        certfile="/app/certs/client.crt",
        keyfile="/app/certs/client.key"
    )
    # Enable certificate verification
    context.verify_mode = ssl.CERT_REQUIRED
    context.check_hostname = True
    return context

# Verify request authenticity
def verify_request_signature(authorization: str = Depends(security)):
    """Verify HMAC signature on requests"""
    try:
        token = authorization.credentials
        # Verify HMAC signature (implementation depends on your protocol)
        # This prevents tampering even within TEE network
        expected = hmac.new(
            API_SECRET.encode(),
            msg=token.encode(),
            digestmod=hashlib.sha256
        ).hexdigest()
        if not hmac.compare_digest(token[:64], expected):
            raise HTTPException(status_code=401, detail="Invalid signature")
    except Exception as e:
        raise HTTPException(status_code=401, detail="Authentication failed")

# Example: Secure model inference endpoint
@app.post("/v1/inference")
async def secure_inference(
    request: dict,
    auth: None = Depends(verify_request_signature)
):
    """Inference endpoint with security controls"""
    # Input validation (see Input Validation section)
    if not validate_input(request):
        raise HTTPE
...

Code Hardening Best Practices

1. Minimal Dependency Strategy

Every dependency is a potential attack vector. Minimize your attack surface.

Dependency audit process:

#!/bin/bash
# dependency-audit.sh - Run before every deployment
set -e
echo "=== Dependency Security Audit ==="
# 1. Check for known vulnerabilities
echo "Scanning for known vulnerabilities..."
pip-audit --desc
# 2. Verify package hashes
echo "Verifying package integrity..."
pip install --require-hashes -r requirements.txt
# 3. Check for outdated packages
echo "Checking for outdated packages..."
pip list --outdated
# 4. Analyze dependency tree
echo "Analyzing dependency tree..."
pipdeptree --warn fail
# 5. License compliance check
echo "Checking license compliance..."
pip-licenses --summary
echo "=== Audit Complete ==="

Minimal requirements.txt with hash verification:

# requirements.txt - Pin versions and verify hashes
# Generated with: pip-compile --generate-hashes requirements.in
fastapi==0.104.1 \
    --hash=sha256:fc6a8a70c2ad4e0f3f3b7b2b6f8e4a3a3b3c4d5e6f7a8b9c0d1e2f3a4b5c6d7
uvicorn[standard]==0.24.0 \
    --hash=sha256:a1b2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0c1d2e3f4a5b6c7d8e9f0a1b2
pydantic==2.5.0 \
    --hash=sha256:b1c2d3e4f5a6b7c8d9e0f1a2b3c4d5e6f7a8b9a0b1c2d3e4f5a6b7c8d9e0f1a2
# Cryptography with verified hash
cryptography==41.0.7 \
    --hash=sha256:c1d2e3f4a5b6c7d8e9f0a1b2c3d4e5f6a7b8c9a0b1c2d3e4f5a6b7c8d9e0f1a2
# No transitive dependencies beyond what's listed
# Total: 4 direct dependencies only

2. Secure Coding Patterns

Input validation and sanitization:

# secure_validators.py
from pydantic import BaseModel, Field, validator
import re

class SecureInferenceRequest(BaseModel):
    """Secure input validation for AI inference requests"""
    prompt: str = Field(
        ...,
        min_length=1,
        max_length=4096,  # Prevent memory exhaustion
        description="User prompt for AI model"
    )
    max_tokens: int = Field(
        default=512,
        ge=1,
        le=2048,  # Prevent resource exhaustion
        description="Maximum tokens to generate"
    )
    temperature: float = Field(
        default=0.7,
        ge=0.0,
        le=2.0,  # Reasonable bounds
        description="Sampling temperature"
    )
    model: str = Field(
        default="deepseek-chat",
        pattern=r"^[a-z0-9\-]+$",  # Alphanumeric and hyphens only
        description="Model identifier"
    )

    @validator('model')
    def validate_model(cls, v):
        """Only allow whitelisted models"""
        allowed_models = {
            "deepseek-chat",
            "llama-3-70b",
            "qwen-2.5-72b",
            "gpt-oss-35"
        }
        if v not in allowed_models:
            raise ValueError(f"Model {v} not allowed. Choose from: {allowed_models}")
        return v

    @validator('prompt')
    def sanitize_prompt(cls, v):
        """Sanitize prompt to prevent injection attacks"""
        v = v.replace('\x00', '')
        sql_patterns = [
            r"(\b(SELECT|INSERT|UPDATE|DELETE|DROP|CREATE|ALTER)\b)",
            r"(--|\#|\/\*|\*\/)",
            r"(\bOR\b.*=.*|1=1|' OR ')"
        ]
        for pattern in sql_patterns:
            if re.search(pattern, v, re.IGNORECASE):
                raise ValueError("Potentially malicious SQL pattern detected")
        cmd_patterns = [r"[;&|`$]", r"\$\(", r"\$\{"]
        for pattern in cmd_patterns:
            if re.search(pattern, v):
                raise ValueError("Potential
...

3. Secure Secret Management

Never hardcode secrets. Use [Phala Cloud KMS](https://docs.phala.com/phala-cloud/key-management/get-a-key):

# secure_secrets.py
import os
from dstack_sdk import DstackClient
from functools import lru_cache
import hashlib

class SecretManager:
    """Secure secret management using Phala Cloud KMS"""
    def __init__(self):
        self.client = DstackClient()
        self._verify_tee_environment()

    def _verify_tee_environment(self) -> bool:
        """Verify we're running in genuine TEE"""
        tee_indicators = [
            os.path.exists("/dev/tdx_guest"),
            os.path.exists("/dev/sev"),
            os.path.exists("/var/run/dstack.sock")
        ]
        if not any(tee_indicators):
            raise RuntimeError("Must run in TEE environment")
        return True

    @lru_cache(maxsize=128)
    def get_key(self, key_path: str) -> bytes:
        """Get deterministic key from Dstack KMS

        Args:
            key_path: Unique identifier like 'my-app/encryption/v1'

        Returns:
            32-byte deterministic key
        """
        try:
            result = self.client.get_key(key_path)
            key_bytes = result.decode_key()  # 32 bytes
            return key_bytes
        except Exception as e:
            print(f"Failed to retrieve key: {type(e).__name__}")
            raise RuntimeError(f"Key retrieval failed for path: {key_path}")

    def derive_secret_key(self, purpose: str, version: str = "v1") -> bytes:
        """Derive application-specific secret key

        Args:
            purpose: Key purpose like 'api-auth', 'db-encryption'
            version: Key version for rotation

        Returns:
            32-byte secret key
        """
        key_path = f"app-secrets/{purpose}/{version}"
        return self.get_key(key_path)

# Global secret manager instance
secrets = SecretManager()

# Usage in application
def get_encryption_key():
    """Get encryption key for sensitive data"""
    # Keys are deterministic - same path always return
...

Secure environment configuration:

# phala-secrets.yaml - Secret configuration for Phala Cloud
app_name: secure-ai-api
# Secrets managed by Phala Cloud KMS
secrets:  
  - name: API_KEY    
    description: "External API authentication key"    
    rotation_days: 90  
  - name: API_SECRET    
    description: "HMAC secret for request signing"    
    rotation_days: 90  
  - name: MODEL_ENCRYPTION_KEY    
    description: "AES-256 key for model weight encryption"    
    rotation_days: 180  
  - name: DB_PASSWORD    
    description: "PostgreSQL password"    
    rotation_days: 30  
  - name: OAUTH_CLIENT_SECRET    
    description: "OAuth 2.0 client secret"    
    rotation_days: 60
# Non-secret configuration (can be env vars)
config:  
  LOG_LEVEL: "INFO"  
  MAX_WORKERS: "4"  
  REQUEST_TIMEOUT: "30"  
  MODEL_NAME: "deepseek-chat"

Input Validation and Output Sanitization

Defense Against Injection Attacks

# secure_handlers.py
import re
from typing import Any, Dict
import html

class SecurityValidator:
    """Comprehensive input validation and output sanitization"""
    INJECTION_PATTERNS = {
        'sql': [
            r"(\bSELECT\b.*\bFROM\b)",
            r"(\bINSERT\b.*\bINTO\b)",
            r"(\bUPDATE\b.*\bSET\b)",
            r"(\bDELETE\b.*\bFROM\b)",
            r"(\bDROP\b.*\bTABLE\b)",
            r"(--|\#|\/\*)",
            r"('.*OR.*'.*=.*')"
        ],
        'command': [
            r"[;&|`]",
            r"\$\(",
            r"\$\{",
            r">\s*/",
            r"<\s*script"
        ],
        'path_traversal': [
            r"\.\./",
            r"\.\.\\",
            r"/etc/passwd",
            r"/proc/",
            r"C:\\Windows"
        ]
    }

    @staticmethod
    def validate_input(data: Any, context: str = "general") -> bool:
        """Validate input data for security threats"""
        if isinstance(data, str):
            return SecurityValidator._validate_string(data, context)
        elif isinstance(data, dict):
            return all(
                SecurityValidator.validate_input(v, context)
                for v in data.values()
            )
        elif isinstance(data, list):
            return all(
                SecurityValidator.validate_input(item, context)
                for item in data
            )
        return True

    @staticmethod
    def _validate_string(value: str, context: str) -> bool:
        """Validate string for injection patterns"""
        if len(value) > 100000:  # 100KB max
            raise ValueError("Input exceeds maximum length")
        if '\x00' in value:
            raise ValueError("Null bytes not allowed")
        for category, patterns in SecurityValidator.INJECTION_PATTERNS.items():
            for pattern in patterns:
                if re.search(pattern, value, re.IGNORECASE):
    
...

Continuous Attestation and Trust Verification

Implementing Continuous Attestation

# continuous_attestation.py
import httpx
import asyncio
from datetime import datetime, timedelta
from typing import Optional

class ContinuousAttestationService:
    """Continuously verify TEE attestation and report to Trust Center"""
    def __init__(
        self,
        app_id: str,
        trust_center_url: str = "https://trust-center.phala.network",
        attestation_interval: int = 300  # 5 minutes
    ):
        self.app_id = app_id
        self.trust_center_url = trust_center_url
        self.attestation_interval = attestation_interval
        self.last_attestation: Optional[datetime] = None
        self.attestation_hash: Optional[str] = None

    async def start(self):
        """Start continuous attestation loop"""
        print(f"Starting continuous attestation (interval: {self.attestation_interval}s)")
        while True:
            try:
                await self.perform_attestation()
                await asyncio.sleep(self.attestation_interval)
            except Exception as e:
                print(f"Attestation failed: {e}")
                await self.alert_attestation_failure(str(e))
                await asyncio.sleep(60)  # Retry after 1 minute

    async def perform_attestation(self):
        """Perform single attestation cycle"""
        report = await self.generate_attestation_report()
        response = await self.submit_to_trust_center(report)
        self.last_attestation = datetime.utcnow()
        self.attestation_hash = response['attestation_hash']
        print(f"Attestation successful: {self.attestation_hash[:16]}...")

    async def generate_attestation_report(self) -> dict:
        """Generate TEE attestation report via Dstack SDK"""
        from dstack_sdk import DstackClient

        try:
            client = DstackClient()
            # Get attestation quote from TEE
            quote_result = client.get_quote(report_data=b"")

...

Production Deployment Security Checklist

Pre-Deployment Security Audit

# Security Checklist for Confidential AI Deployment

## Code Security
- [ ] All dependencies audited (pip-audit, npm audit)
- [ ] Package hashes verified in requirements.txt
- [ ] No secrets in source code or environment files
- [ ] Input validation on all user inputs
- [ ] Output sanitization implemented
- [ ] Error messages don't leak sensitive info
- [ ] Logging redacts PII and secrets

## Configuration Security
- [ ] Running as non-root user
- [ ] Read-only root filesystem
- [ ] Minimal capabilities (no unnecessary privileges)
- [ ] Resource limits configured (CPU, memory, disk)
- [ ] Network policies restrict unnecessary connections
- [ ] Firewall rules follow least-privilege

## Secret Management
- [ ] All secrets stored in Phala Cloud KMS
- [ ] No secrets in environment variables (production)
- [ ] Secret rotation policy configured
- [ ] Secrets never logged or printed
- [ ] Secrets loaded from KMS at runtime only

## TEE Security
- [ ] Continuous attestation enabled (5-minute intervals)
- [ ] Attestation reports published to Trust Center
- [ ] Alert system configured for attestation failures
- [ ] Code hash documented for client verification
- [ ] TEE type and version documented

## Application Security
- [ ] mTLS configured for external connections
- [ ] HTTPS enforced (no HTTP)
- [ ] Request authentication implemented
- [ ] Rate limiting configured
- [ ] CORS policy configured correctly
- [ ] SQL injection defenses (if using database)
- [ ] XSS defenses (if rendering HTML)

## Monitoring and Incident Response
- [ ] Structured logging with security events
- [ ] Log aggregation configured
- [ ] Alert system tested
- [ ] Incident response playbook documented
- [ ] Security contact information current
- [ ] Backup and recovery tested

## Compliance (if applicable)
- [ ] GDPR compliance verified
- [ ] HIPAA compliance verified
- [ ] SOC 2 controls documented
- [ 
...

Automated Security Testing

# security_tests.py
import pytest
import httpx

@pytest.mark.asyncio
class TestSecurityControls:
    """Automated security tests for confidential AI service"""

    @pytest.fixture
    def base_url(self):
        return "https://my-ai-service.phala.cloud"

    async def test_https_enforced(self, base_url):
        """Test that HTTP is redirected to HTTPS"""
        http_url = base_url.replace("https://", "http://")
        async with httpx.AsyncClient(follow_redirects=False) as client:
            response = await client.get(http_url)
            assert response.status_code == 301
            assert response.headers['location'].startswith('https://')

    async def test_authentication_required(self, base_url):
        """Test that endpoints require authentication"""
        async with httpx.AsyncClient() as client:
            response = await client.post(
                f"{base_url}/v1/chat",
                json={"message": "test"}
            )
            assert response.status_code == 401

    async def test_sql_injection_blocked(self, base_url, auth_headers):
        """Test SQL injection prevention"""
        malicious_inputs = [
            "' OR '1'='1",
            "1; DROP TABLE users--",
            "' UNION SELECT * FROM secrets--"
        ]
        for payload in malicious_inputs:
            async with httpx.AsyncClient() as client:
                response = await client.post(
                    f"{base_url}/v1/chat",
                    json={"message": payload},
                    headers=auth_headers
                )
                assert response.status_code == 400

    async def test_xss_prevention(self, base_url, auth_headers):
        """Test XSS attack prevention"""
        xss_payloads = [
            "<script>alert('xss')</script>",
            "<img src=x onerror=alert('xss')>",
            "javascript:alert('xss')"
        ]
        fo
...

Common Security Vulnerabilities and Prevention

1. Side-Channel Attacks

Vulnerability: Timing attacks can leak information about model weights or input data.

Prevention:

# timing_protection.py
import time
import random

def constant_time_compare(a: str, b: str) -> bool:
    """Constant-time string comparison to prevent timing attacks"""
    if len(a) != len(b):
        return False
    result = 0
    for x, y in zip(a, b):
        result |= ord(x) ^ ord(y)
    return result == 0

def add_timing_jitter(min_ms: int = 10, max_ms: int = 50):
    """Add random delay to prevent timing analysis"""
    delay = random.uniform(min_ms / 1000, max_ms / 1000)
    time.sleep(delay)

# Use in authentication
def verify_api_key(provided_key: str, expected_key: str) -> bool:
    """Verify API key with timing attack protection"""
    is_valid = constant_time_compare(provided_key, expected_key)
    add_timing_jitter(10, 50)
    return is_valid

2. Memory Disclosure

Vulnerability: Uninitialized memory or buffer overflows can leak data.

Prevention:

# memory_safety.py
import ctypes

def secure_zero_memory(data: bytearray):
    """Securely zero memory before deallocation"""
    length = len(data)
    ctypes.memset(id(data), 0, length)

def secure_string_handling(secret: str) -> bytearray:
    """Handle sensitive strings with secure memory management"""
    secure_data = bytearray(secret.encode('utf-8'))
    return secure_data

# Example usage with API keys
def process_api_key(api_key: str):
    """Process API key with secure memory handling"""
    secure_key = secure_string_handling(api_key)
    try:
        result = authenticate_with_key(bytes(secure_key))
        return result
    finally:
        secure_zero_memory(secure_key)

3. Dependency Vulnerabilities

Vulnerability: Vulnerable dependencies introduce security risks.

Prevention:

#!/bin/bash
# continuous_dependency_scanning.sh
# Run daily as cron job or CI/CD pipeline
set -e
echo "=== Daily Dependency Security Scan ==="
pip install --upgrade pip-audit
echo "Scanning for vulnerabilities..."
pip-audit --desc --format json > audit-report.json
CRITICAL=$(jq '[.vulnerabilities[] | select(.severity=="CRITICAL")] | length' audit-report.json)
HIGH=$(jq '[.vulnerabilities[] | select(.severity=="HIGH")] | length' audit-report.json)
echo "Critical vulnerabilities: $CRITICAL"
echo "High vulnerabilities: $HIGH"
if [ "$CRITICAL" -gt 0 ]; then
    echo "ALERT: Critical vulnerabilities detected!"
    curl -X POST "$ALERT_WEBHOOK" \
        -H "Content-Type: application/json" \
        -d "{\"severity\":\"CRITICAL\",\"message\":\"$CRITICAL critical vulnerabilities detected\",\"report\":\"$(cat audit-report.json)\"}"
    exit 1
fi
if [ "$HIGH" -gt 0 ]; then
    echo "WARNING: High severity vulnerabilities detected"
fi
echo "=== Scan Complete ==="

Summary and Best Practices

Security-First Development Mindset

Assume Breach: Design systems assuming attackers may gain partial access
Defense in Depth: Implement multiple layers of security controls
Least Privilege: Grant minimum necessary permissions
Fail Securely: Ensure failures don’t compromise security
Continuous Verification: Regularly verify security posture

Key Takeaways

Security Layer	Critical Controls	Phala Cloud Implementation
Infrastructure	TEE, network isolation, firewall	Intel TDX, AMD SEV-SNP, H100 TEE
Application	Input validation, output sanitization	Pydantic validation, HTML escaping
Secrets	KMS integration, no hardcoded secrets	Phala Cloud KMS via Dstack SDK
Attestation	Continuous verification, public reporting	Trust Center integration
Monitoring	Structured logging, alerting	Audit logs with PII redaction

Quick Start Security Template

# secure-deployment-template.yaml
# Drop-in template for secure Phala Cloud deployments
app_name: ${YOUR_APP_NAME}
tee_type: intel-tdx  # or amd-sev, nvidia-h100
# Container security
container:  
  image: ${YOUR_REGISTRY}/${YOUR_IMAGE}:${VERSION}  
  image_verification: true  
  readonly_rootfs: true  
  user: 1000  # Non-root
# Resource limits (prevent DoS)
resources:  
  cpu_limit: "4"  
  memory_limit: "8Gi"  
  storage_limit: "20Gi"
# Network security
network:  
  isolation: true  
  allow_egress:    
    - "api.openai.com:443"  # Whitelist only needed endpoints  
  firewall_rules:    
    - port: 443      
      protocol: tcp
# Secret management
secrets:  
  - name: API_KEY    
    source: phala-kms  
  - name: MODEL_KEY    
    source: phala-kms
# Security monitoring
security:  
  attestation:    
    enabled: true    
    interval: 300  # 5 minutes  
  audit_logging:    
    enabled: true    
    redact_pii: true  
  alerts:    
    webhook: ${ALERT_WEBHOOK_URL}

FAQ

Q: How much performance overhead does security hardening add?

A: Typical overhead is 5-15% total:

TEE memory encryption: 2-5%
Input validation: 1-3%
Attestation: <1%
Logging and monitoring: 3-10%

The performance cost is worth the security guarantee.

Q: Can I use existing Python/Node.js packages in TEE?

A: Yes, but audit them first:

Run pip-audit or npm audit
Verify package hashes
Review dependencies for unnecessary scope
Consider vendoring critical dependencies

Q: How do I handle secrets during development vs production?

A: Use environment-aware configuration:

Development: Environment variables (with warnings)
Production: Phala Cloud KMS only
Never: Hardcoded secrets or committed .env files

Q: What if my attestation fails in production?

A: Implement graceful degradation:

Alert immediately (critical security event)
Stop accepting new requests
Complete in-flight requests
Investigate root cause (configuration change, hardware issue, attack)

Q: How often should I rotate secrets?

A: Recommended rotation schedule:

API keys: 90 days
Database passwords: 30 days
TLS certificates: Before expiration (typically 90 days)
Model encryption keys: 180 days

Q: Can attackers extract model weights from a secure TEE?

A: No, if properly configured:

Model weights stay in encrypted TEE memory
Attestation proves code integrity
No debugging interfaces exposed
Timing attacks mitigated with jitter

However, ensure all security best practices are followed.

Q: How do I verify a third-party confidential AI service?

A: Before trusting any service:

Request their attestation hash
Verify with Phala Trust Center
Check code hash matches expected value
Verify attestation is recent (<1 hour)
Check TEE type matches requirements

Q: What’s the difference between TEE security and traditional cloud security?

Aspect	Traditional Cloud	TEE (Phala Cloud)
Trust model	Trust cloud provider	Zero-trust (don’t trust anyone)
Memory protection	Software isolation	Hardware encryption
Verification	Provider claims	Cryptographic attestation
Admin access	Provider has access	No one has access (even Phala)

What’s Next?

Now that you understand secure AI development practices, explore:

**Remote Attestation Deep Dive** - Technical details of TEE verification
**Open Source Tools** - Dstack SDK and confidential computing ecosystem
**Cloud Provider Comparison** - Phala Cloud vs AWS vs Azure vs GCP security

Ready to build secure confidential AI?

Get Started on Phala Cloud - Deploy your first secure AI application with built-in best practices.

Quick Start Security Template

FAQ

Secure AI Development with Confidential Computing: Best Practices Guide

Introduction

Why Secure Development Matters in TEE Environments

The TEE Security Model

The Cost of Security Failures

The Security-Performance Balance

Security-First Architecture Patterns

1. Principle of Least Privilege

2. Defense in Depth Architecture

3. Secure Service Communication

Code Hardening Best Practices

1. Minimal Dependency Strategy

2. Secure Coding Patterns

3. Secure Secret Management

Input Validation and Output Sanitization

Defense Against Injection Attacks

Continuous Attestation and Trust Verification

Implementing Continuous Attestation

Production Deployment Security Checklist

Pre-Deployment Security Audit

Automated Security Testing

Common Security Vulnerabilities and Prevention

1. Side-Channel Attacks

2. Memory Disclosure

3. Dependency Vulnerabilities

Summary and Best Practices

Security-First Development Mindset

Key Takeaways

Quick Start Security Template

FAQ

What’s Next?

Related Resources

Next Steps

Recent Articles

Confidential Computing Trends 2025

Phala Private AI Cloud Guide

Confidential LLMs

Recent Articles

Related Articles

HIPAA-Compliant Medical AI with TEE

Financial Analysis with Confidential Computing

Building Customer Trust Through Transparenc

Related Articles

Recent Articles

Confidential Computing Trends 2025

Phala Private AI Cloud Guide

Confidential LLMs

Related Articles

HIPAA-Compliant Medical AI with TEE

Financial Analysis with Confidential Computing

Building Customer Trust Through Transparenc