NVIDIA H200 with Full-Stack TEE
NVIDIA H200 Tensor Core GPU

Immediate Access to NVIDIA H200 GPUs

From a single GPU to thousands. Industry's largest GPU memory with complete Intel TDX + NVIDIA Confidential Computing protection. On-demand or reserved pricing.

141GB
HBM3e Memory
4.8 TB/s
Bandwidth
2x Faster
vs H100
Full TEE
Intel + NVIDIA
Starting at $3.50/GPU/hr on-demand or $2.56/GPU/hr with 6-month commitment (27% savings)

Flagship AI Performance

Industry's largest GPU memory with Full-Stack TEE security

GPU TEE Bento
33K

tokens/sec

Llama 2 70B Inference (1.51x vs H100)

141GB

HBM3e Memory (1.76x vs H100)
4.8 TB/s Bandwidth

1.9x

Faster training
vs H100

700W

Same power, more performance

Full-Stack TEE

Intel TDX + NVIDIA Confidential Computing with dual attestation

Memory encryption
Verified boot chain
4.8

TB/s Memory Bandwidth

1.4x improvement over H100's 3.35 TB/s

Available Now

US-West & India

1-8 GPUs

Scalable Configurations

Configure Now

H200 vs B200 / H100

Official benchmarks from MLPerf v5.0 and NVIDIA Technical Labs

LLM Inference Performance (Llama 2 70B)

Tokens per second (single GPU) - higher is better

H10021.8K tok/s
29%
H20033.0K tok/s
44%
B20075.0K tok/s
100%

H100: 21.8K tokens/sec (baseline) | H200: 33K tokens/sec (1.51x) | B200: 75K+ tokens/sec (3.4x)

Source: MLPerf Inference v5.0 (2024), NVIDIA Technical Blogs

All benchmarks include Full-Stack TEE protection with <5% overhead

H200 Pricing

Flexible pricing for flagship performance. All prices include Full-Stack TEE protection.

On-Demand

Pay only for what you use

$3.50/GPU/hr

No commitment required

Scale from 1-8 GPUs instantly
US-West & India regions
Full-Stack TEE included
Dual attestation reports
SAVE 27%
Reserved

6-month commitment

$2.56/GPU/hr

Best value for sustained workloads

Guaranteed GPU availability
Priority support
Custom configurations
Enterprise SLA options

Available H200 Configurations

1x H200

Available Now
Total Memory141GB HBM3e
Total Bandwidth4.8 TB/s
TEE ProtectionFull-Stack TEE
RegionsUS-West, India
Flagship performance for demanding AI workloads with complete hardware protection
Configure & Deploy

2x H200

Available Now
Total Memory282GB HBM3e
Total Bandwidth9.6 TB/s
TEE ProtectionFull-Stack TEE
RegionsUS-West, India
Flagship performance for demanding AI workloads with complete hardware protection
Configure & Deploy

4x H200

Available Now
Total Memory564GB HBM3e
Total Bandwidth19.2 TB/s
TEE ProtectionFull-Stack TEE
RegionsUS-West, India
Flagship performance for demanding AI workloads with complete hardware protection
Configure & Deploy

8x H200

Available Now
Total Memory1.1TB HBM3e
Total Bandwidth38.4 TB/s
TEE ProtectionFull-Stack TEE
RegionsUS-West, India
Flagship performance for demanding AI workloads with complete hardware protection
Configure & Deploy
Security

Full-Stack TEE Architecture

Complete hardware protection from CPU to GPU. Intel TDX + NVIDIA Confidential Computing working together.

Full VM Isolation

Intel TDX protects CPU, memory, and VM from host access. Complete isolation.

GPU Memory Encryption

NVIDIA CC encrypts all GPU memory. Model weights and data stay secure.

Dual Attestation

Cryptographic proof from Intel + NVIDIA. Independently verifiable.

End-to-End Protection

Data encrypted in transit (TLS), at rest (AES-256), and during processing (TEE).

Multi-Region Deployment

Deploy in US-West and India. Same Full-Stack TEE protection everywhere.

Compliance Ready

GDPR, HIPAA, SOC 2 compliant. Hardware-backed security guarantees.

What You Can Build with GPU TEE

Real-world applications running on Phala Cloud with complete Intel TDX + NVIDIA Confidential Computing protection

Private Enterprise AI

Train and deploy models on sensitive healthcare, financial, or legal data with complete hardware protection. Your data never leaves the TEE.

User-Owned AI Agents

Build autonomous AI agents that securely manage cryptographic keys and digital assets. Powers platforms like Eliza and Virtuals Game Agents.

ZK Proof Generation

Accelerate zkVM and zkRollup proof generation with GPU TEE. SP1 zkVM runs with <5% TEE overhead—verified with dual attestation.

FHE/MPC Acceleration

Use GPU TEE as 2FA for FHE and MPC systems. Secure key generation, computation integrity, and attestation in one platform. Powers Fairblock and Mind Network.

Multi-Proof Systems

Combine ZK proofs with TEE attestation for double security. Hedge against cryptographic bugs while maintaining verifiability.

Regulatory Compliance

Meet GDPR, HIPAA, and SOC 2 requirements with hardware-backed privacy guarantees. Full audit trail with Intel and NVIDIA attestation.

Three Ways to Deploy GPU TEE

Choose the deployment model that fits your needs—from full control to instant deployment

CVM + GPU: Maximum Flexibility

Deploy your own Docker containers with SSH access to TEE-protected GPUs. Perfect for developers who need complete control.

  • Deploy custom Docker containers with full SSH access
  • Fine-tune models on private data with complete hardware protection
  • Intel TDX + NVIDIA Confidential Computing protection
  • Dual attestation reports (Intel + NVIDIA) for verification
Deploy CVM Now