GPU TEE Marketplace

TEE-ready GPUs for AI builders.

Name: GPU TEE - Confidential GPU Computing
Brand: Phala
Price: 50.37 USD
Availability: InStock
Rating: 4.8 (127 reviews)

H100, H200, and B300 capacity with CVMs, dual attestation, and TEE-aware operations.

Trial a machine for 24 hours, reserve a slot, or quote dedicated clusters. Phala handles the hard part: confidential GPUs, Intel TDX runtime, NVIDIA attestation, and the DevOps required to keep it working.

Trial nowQuote price

Confidential GPU cloud

Capacity first. Proof after the workload runs.

H100 / H200 / B300

hardware proof rail

H100, H200, and B300 move through one verifiable GPU path.

GPU TEE

H100

GPU TEE

H200

GPU TEE

B300

Trial

24h minimum

Reserve

slots and clusters

Verify

CVM + GPU evidence

Marketplace inventory

Capacity with proof built in.

Pick a GPU for a 24-hour trial, reserve a slot for sustained jobs, or quote a dedicated cluster. Every path starts from TEE-ready infrastructure instead of a raw GPU box.

Trial ready

NVIDIA H100

Proven confidential inference and fine-tuning capacity.

Memory

80GB HBM3

Bandwidth

3.35 TB/s

Region

US-West

Scale

1-2 GPUs

On-demand

$3.08/GPU/hr

24h minimum

Slot

$2.38/GPU/hr

reserved

Trial now Details

Slot ready

NVIDIA H200

High-memory runtime for larger private model jobs.

Memory

141GB HBM3e

Bandwidth

4.8 TB/s

Region

US-West / India

Scale

1-8 GPUs

On-demand

$4.80/GPU/hr

24h minimum

Slot

$3.20/GPU/hr

reserved

Trial now Details

Quote now

NVIDIA B300

Blackwell Ultra confidential capacity for frontier inference.

Memory

288GB HBM3e

Bandwidth

8 TB/s

Region

US-East / US-West

Scale

1-8 GPUs

1-month

$6.50/GPU/hr

30d minimum

Slot

$5.60/GPU/hr

reserved

Trial now Details

Prices include Intel TDX + NVIDIA confidential computing readiness. Volume and enterprise pricing are quoted by workload.

Quote price

Prestatiegegevens voor planning van private AI-GPU's

H100

H200

B300

relative index

H100

1.9x

H200

3.2x

B300

LLM-inferentie

model + KV cache

80GB

H100

141GB

H200

288GB

B300

GPU-geheugen

feed batches

3.35TB/s

H100

4.8TB/s

H200

8TB/s

B300

Geheugenbandbreedte

NVIDIA H100

80GB HBM3

NVIDIA H200

141GB HBM3e

NVIDIA B300

288GB HBM3e

GPU-vergelijking

H100 vs H200 vs B300

Vergelijk eerst de capaciteitsvorm vóór de offerte. H100 is het snelle proefpad, H200 voegt extra geheugenruimte toe, en B300 is het Blackwell Ultra-pad voor frontier inference en dedicated clusters.

De exacte throughput hangt af van model, batchgrootte, precisie en runtime. Phala vermeldt de GPU samen met het vertrouwelijke VM-pad, GPU CC-gereedheid en attesteringsoperaties.

Mockup van GPU-cloud

Capaciteitslanen met bewijsstatus.

De marketplace-weergave moet de aankoopstroom duidelijk maken: proberen, reserveren, dan opschalen naar een dedicated cluster met TEE-gereedheid erbij.

H100

80GB HBM3

from

$3.08/hr

H200

141GB HBM3e

from

$4.80/hr

B300

288GB HBM3e

from

$6.50/hr

geverifieerd

CVM-runtime

geverifieerd

GPU CC-modus

geverifieerd

Dubbele attestation

GPU TEE-proofpad

Wat Phala afhandelt voor het CVM-pad.

GPU-isolatie is alleen nuttig wanneer het volledige pad — runtime, GPU-modus en evidence-collectie — end-to-end verifieerbaar is. Phala levert alle drie samen.

                                                                                
                                                                                
                                                                                
                                                                                
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++                         
                         ++++++++++++++++++++++++++++++

cvm-enclave · 80×24 · 24fpsdensity: .:-=+*#%@

CVM-runtime

Docker-workloads draaien in een Intel TDX-confidential VM met GPU passthrough. De runtime is afgeschermd tegen de operator en wordt vóór start van de workload door firmware gemeten.

                                                                                
                                                                                
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@      
      @                                                                  @      
      @  @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @@@@ @   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @  ...: :::: :... .::: :::: ...: :::: ::.. .::: :::: ...: :::: :   @      
      @                                                                  @      
      @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

gpu-cc · 80×22 · 24fpsdensity: .:-=+*#%@

GPU CC-modus

NVIDIA Confidential Computing verzegelt modelgewichten, activaties en KV-cache in beschermd GPU-geheugen. De GPU handhaaft compute-isolatie naast de CPU TEE.

                                                                                
                                                                                
                                                                                
     @@@@@@@@@@@@                                                               
     @=--::--=++@+-:.                                                           
     @=--::--=++@#*+-::....                                                     
     @@@@@@@@@@@@        ...........                                            
                                   ..........                  @@@@@@@@@@@@@    
                                            ...........        @======++***@    
                                                     ..........@=====++****@    
                                                     ..........@====++*****@    
                                       .::::...........        @===++******@    
                                  .:-*%@@%*-:.                 @@@@@@@@@@@@@    
     @@@@@@@@@@@@        ...........::::.                                       
     @:::-==++++@..........                                                     
     @:::-==++++@                                                               
     @@@@@@@@@@@@

dual-attestation · 80×20 · 24fpsdensity: .:-=+*#%@

Dubbele attestation

Intel TDX en NVIDIA geven elk een ondertekende quote af. Phala verzamelt beide en biedt ze via één verifier aan, zodat de CVM en de GPU samen bewijzen dat ze echt zijn.

Aankooproutes

Begin klein. Reserveer wanneer het werkt.

De marketplace is ingericht rond hoe AI-builders GPU’s echt inkopen: snel testen, capaciteit reserveren zodra een workload is bewezen, en daarna overstappen op enterprise-deals wanneer het cluster productie-kritisch wordt.

01 / Op aanvraag

Test een vertrouwelijke GPU binnen 24 uur.

Korte testvensters voor builders die private inference, modelserving of proof generation valideren.

Probeer nu

02 / Slot

Reserveer capaciteit vóór de volgende run.

Voorspelbare GPU-toegang voor langdurige training, fine-tuning en benchmarkvensters.

Prijs opvragen

03 / Enterprise

Toegewijde clusters met TEE-operaties.

Aangepaste deals voor H100, H200 of B300 met TEE-bewuste infrastructuurondersteuning en deploymentplanning.

Neem contact op met sales

AI-oplossingspaden

Use GPU TEE where AI touches secrets.

GPU capacity is one part of the privacy boundary. The same confidential compute path supports private inference, agents, training, and data workflows.

LLM API

Private AI-inference

Bied OpenAI-compatibele modelaanroepen aan waarbij prompts, outputs en klantcontext versleutelde-bij-gebruik bescherming nodig hebben.

Open oplossing

Agents

Privé AI-agents

Laat agents draaien met sleutels, tools, geheugen en acties binnen een geverifieerde runtime in plaats van een zichtbare automation cloud.

Open oplossing

Training

Privémodeltraining

Pas modellen aan op propriëtaire data terwijl datasets, gradients, checkpoints en evaluatietraces binnen de grens blijven.

Open oplossing

Data

Privé AI-data

Verplaats modellen naar gevoelige records en geef goedgekeurde outputs terug zonder ruwe data bloot te stellen aan de modeloperator.

Open oplossing

Private uitvoering. Verifieerbare resultaten.

Nieuwsbrief

GPU TEE Cloud — H100/H200/B300 Confidential AI | Phala