Models
ReadyIntel TDX

OpenAI: GPT-4.1

Model IDopenai/gpt-4.1

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

input

$2.00/M

output

$8.00/M

context

1.0M

created

Apr 15, 2025

Supported API shape

input

image · text · file

output

text

tools

Supported

json mode

Supported

Verification

signature

response ID

attestation

Intel TDX

provider

1 routes

Providers

openai

TDX route

input

$2.00/M

output

$8.00/M

context

1.0M

More models

Other private inference routes.

View catalog
encrypted

OpenAI: GPT OSS 20B

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

context

131K

input

$0.04/M

encrypted

OpenAI: GPT OSS 120B

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

context

131K

input

$0.10/M

encrypted

OpenAI: GPT-5.4 Mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

context

400K

input

$0.75/M

encrypted

OpenAI: GPT-5.5

GPT-5.5 is OpenAI's frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

context

1.1M

input

$5.00/M