GPU TEE Marketplace

TEE-ready GPUs for AI builders.

Name: GPU TEE - Confidential GPU Computing
Brand: Phala
Price: 50.37 USD
Availability: InStock
Rating: 4.8 (127 reviews)

H100, H200, and B300 capacity with CVMs, dual attestation, and TEE-aware operations.

Trial a machine for 24 hours, reserve a slot, or quote dedicated clusters. Phala handles the hard part: confidential GPUs, Intel TDX runtime, NVIDIA attestation, and the DevOps required to keep it working.

Trial nowQuote price

Confidential GPU cloud

Capacity first. Proof after the workload runs.

H100 / H200 / B300

hardware proof rail

H100, H200, and B300 move through one verifiable GPU path.

GPU TEE

H100

GPU TEE

H200

GPU TEE

B300

Trial

24h minimum

Reserve

slots and clusters

Verify

CVM + GPU evidence

Marketplace inventory

Capacity with proof built in.

Pick a GPU for a 24-hour trial, reserve a slot for sustained jobs, or quote a dedicated cluster. Every path starts from TEE-ready infrastructure instead of a raw GPU box.

Trial ready

NVIDIA H100

Proven confidential inference and fine-tuning capacity.

Memory

80GB HBM3

Bandwidth

3.35 TB/s

Region

US-West

Scale

1-2 GPUs

On-demand

$3.08/GPU/hr

24h minimum

Slot

$2.38/GPU/hr

reserved

Trial now Details

Slot ready

NVIDIA H200

High-memory runtime for larger private model jobs.

Memory

141GB HBM3e

Bandwidth

4.8 TB/s

Region

US-West / India

Scale

1-8 GPUs

On-demand

$4.80/GPU/hr

24h minimum

Slot

$3.20/GPU/hr

reserved

Trial now Details

Quote now

NVIDIA B300

Blackwell Ultra confidential capacity for frontier inference.

Memory

288GB HBM3e

Bandwidth

8 TB/s

Region

US-East / US-West

Scale

1-8 GPUs

1-month

$6.50/GPU/hr

30d minimum

Slot

$5.60/GPU/hr

reserved

Trial now Details

Prices include Intel TDX + NVIDIA confidential computing readiness. Volume and enterprise pricing are quoted by workload.

Quote price

用于隐私 AI GPU 规划的性能指标

H100

H200

B300

relative index

H100

1.9x

H200

3.2x

B300

LLM 推理

model + KV cache

80GB

H100

141GB

H200

288GB

B300

GPU 内存

feed batches

3.35TB/s

H100

4.8TB/s

H200

8TB/s

B300

内存带宽

NVIDIA H100

80GB HBM3

NVIDIA H200

141GB HBM3e

NVIDIA B300

288GB HBM3e

GPU 对比

H100、H200 与 B300

在报价前先比较容量形态。H100 适合快速试用，H200 提供更大显存余量，B300 则是面向前沿推理和专属集群的 Blackwell Ultra 路径。

实际吞吐量取决于模型、批大小、精度和运行时。Phala 会将 GPU 与隐私虚拟机路径、GPU CC 就绪度和证明操作一起报价。

GPU 云示意图

带有证明状态的容量通道。

市场视图应当让购买路径一目了然：试用、预留，然后扩展到带有 TEE 就绪状态的专属集群。

H100

80GB HBM3

from

$3.08/hr

H200

141GB HBM3e

from

$4.80/hr

B300

288GB HBM3e

from

$6.50/hr

已验证的

CVM 运行时

已验证的

GPU CC 模式

已验证的

双重证明

GPU TEE 证明路径

Phala 为 CVM 路径处理的内容。

只有当整个路径——运行时、GPU 模式和证据采集——端到端可验证时，GPU 隔离才有意义。Phala 将三者一起交付。

                                                                                
                                                                                
                                                                                
                                                                                
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++

cvm-enclave · 80×24 · 24fpsdensity: .:-=+*#%@

CVM 运行时

Docker 工作负载在带 GPU 直通的 Intel TDX 机密虚拟机中运行。运行时在工作负载启动前会被封存以对抗操作员，并由固件度量。

                                                                                
                                                                                
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@      
      @                                                                  @      
      @  @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @                                                                  @      
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

gpu-cc · 80×22 · 24fpsdensity: .:-=+*#%@

GPU CC 模式

NVIDIA Confidential Computing 将模型权重、激活值和 KV cache 封装在受保护的 GPU 内存中。GPU 与 CPU TEE 一起执行计算隔离。

                                                                                
                                                                                
                                                                                
     @@@@@@@@@@@@                                                               
     @=--::--=++@+-:.                                                           
     @=--::--=++@#*+-::....                                                     
     @@@@@@@@@@@@        ...........                                            
                                   ..........                  @@@@@@@@@@@@@    
                                            ...........        @======++***@    
                                                     ..........@=====++****@    
                                                     ..........@====++*****@    
                                       .::::...........        @===++******@    
                                  .:-*%@@%*-:.                 @@@@@@@@@@@@@    
     @@@@@@@@@@@@        ...........::::.                                       
     @:::-==++++@..........                                                     
     @:::-==++++@                                                               
     @@@@@@@@@@@@

dual-attestation · 80×20 · 24fpsdensity: .:-=+*#%@

双重证明

Intel TDX 和 NVIDIA 都会输出签名 quote。Phala 将两者收集并通过一个验证器对外呈现，让 CVM 和 GPU 一起证明自身。

购买路径

先小规模启动，验证后再预留。

这个市场的结构与 AI 构建者实际采购 GPU 的方式一致：先快速测试，在工作负载验证后预留容量，然后在集群变成生产关键时转入企业交易。

01 / 按需

24 小时内试用机密 GPU。

供构建者验证隐私推理、模型服务或证明生成的短时测试窗口。

立即试用

02 / 席位

在下一次运行前预留容量。

为持续训练、微调和基准测试窗口提供可预测的 GPU 访问。

获取报价

03 / 企业版

具备 TEE 操作的专用集群。

支持 TEE 感知基础设施和部署规划的定制 H100、H200 或 B300 合作方案。

联系销售

AI 解决方案路径

Use GPU TEE where AI touches secrets.

GPU capacity is one part of the privacy boundary. The same confidential compute path supports private inference, agents, training, and data workflows.

LLM API