OpenRouter
enterprise tier · drop-in
“Drop-in OpenAI-compatible endpoint with verifiable, no-log routing. The receipt is the audit trail.”
18B+ tokens
no-log · verified routing
Private Inference
OpenAI-compatible. Signed receipts. By construction, no logs.
Private LLM catalog
OpenAI-compatible models with hardware-backed privacy and verification. Keep your SDK flow, change the endpoint, and copy the real call when you need it.
What you say to the model stays between your client and an attested CVM. Three primitives — encryption, TEE, no-logs — make that a property of the build, not a promise.
Toggle dstack off to see exactly which guarantee disappears.
Two-hop RA-TLS into a fleet of attested model CVMs — verifiable, no-log by construction
Client SDK fetches each candidate CVM’s TDX quote and runs dcap-qvl locally — confirms the build matches a no-log entry in DstackApp.sol. The trust decision is client-side; Phala is not asked to vouch for itself.
cURL · drop-in
Hit api.redpill.ai/v1/chat/completions with the OpenAI request shape. Receipt headers come back on every response — even from curl.
OpenAI Python SDK
`base_url="https://api.redpill.ai/v1"` and you’re done. Existing code keeps working; receipts attach to the response object.
One unified verifier
Whether the model runs on Intel TDX + H100 or AMD SEV + B300, the receipt format is identical. One verification path covers your whole TEE-LLM fleet — even when you mix providers.
phalaLlama 3.1
near aiDeepSeek V3
tinfoilQwen2.5
chutesMistralNo premium for privacy
Confidential routes through Phala on OpenRouter price the same as the open route. Privacy is no longer a procurement line item — just a header you opted into.
two-hop RA-TLS · X.509 with TDX-quote extension
tunneled · no plaintext intermediary
hop 01 · client → gateway
hop 02 · gateway → model CVM
The first TLS hop terminates inside the dstack-gateway CVM (whose certificate carries its TDX quote). The second terminates inside the model CVM. There is no plaintext intermediary — just two confidential VMs whose X.509 certificates ARE their attestations.
response · /v1/chat/completions
200Every response carries x-phala-receipt-sig + x-phala-compose-hash. The signature chains to the TDX root and the on-chain DstackApp.sol entry — verify offline that the build that ran is the build that was registered.
in production today · 3 live partners
OpenRouter routes its enterprise tier through Phala. NEAR AI ships verifiable agent inference. OODA AI runs decentralized GPU TEE.
enterprise tier · drop-in
“Drop-in OpenAI-compatible endpoint with verifiable, no-log routing. The receipt is the audit trail.”
18B+ tokens
no-log · verified routing
verifiable agent inference
“Verifiable agent inference for autonomous, on-chain workflows. Every model call lands on-chain with proof.”
100% receipts
on-chain verified · zk inference
NASDAQ-listed · decentralized GPUs
“Decentralized GPUs with hardware attestation guarantees. No host root, no off-band access, no policy promises.”
12M tokens / day
TDX + H100 · hardware-attested
OpenAI-compatible
drop-in /v1 surface
TDX + H100/H200/B300
CPU + GPU TEE
5–15% overhead
vs bare-metal
No host root
compose-hash IS the policy
AI solution paths
The private model endpoint is the first entry point. The same privacy primitive extends to agents, data workflows, and training.
Run agents with keys, tools, memory, and actions inside a verified runtime instead of a visible automation cloud.
Adapt models on proprietary data while keeping datasets, gradients, checkpoints, and evaluation traces inside the boundary.
private training run
01
sealed
02
running
03
private
04
verified
loss curve
proof attached
attestation.json
Move models to sensitive records and return approved outputs without exposing raw data to the model operator.
source
EHR data
source
Customer records
source
Internal docs
TEE clean room
approved output
Deploy private inference
Drop-in with the OpenAI SDK you already use. Point at api.redpill.ai. Get a signed receipt with every response.