Inference API

Change two lines of code. Get hardware-enforced confidentiality and a receipt for every request.

Optional Tresor SDK

Your OpenAI SDK works as-is. Ours verifies the enclave before a single byte leaves your machine.

Use the OpenAI SDK

import osfrom openai import OpenAIclient = OpenAI(    base_url="https://api.tresor.co/v1",    api_key=os.environ["TRESOR_API_KEY"],)resp = client.chat.completions.create(    model="eu/auto/gpt-oss-120b",    messages=[{"role": "user", "content": "Hello!"}],)print(resp.choices[0].message.content)

import OpenAI from "openai";const client = new OpenAI({  baseURL: "https://api.tresor.co/v1",  apiKey: process.env.TRESOR_API_KEY,});const resp = await client.chat.completions.create({  model: "eu/auto/gpt-oss-120b",  messages: [{ role: "user", content: "Hello!" }],});console.log(resp.choices[0]?.message?.content);

package mainimport (    "context"    "fmt"    "os"    openai "github.com/sashabaranov/go-openai")func main() {    cfg := openai.DefaultConfig(os.Getenv("TRESOR_API_KEY"))    cfg.BaseURL = "https://api.tresor.co/v1"    client := openai.NewClientWithConfig(cfg)    resp, err := client.CreateChatCompletion(        context.Background(),        openai.ChatCompletionRequest{            Model: "eu/auto/gpt-oss-120b",            Messages: []openai.ChatCompletionMessage{                {Role: "user", Content: "Hello!"},            },        },    )    if err != nil {        panic(err)    }    fmt.Println(resp.Choices[0].Message.Content)}

Use the Tresor SDK for attestation

import osimport httpxfrom openai import OpenAIfrom attest import AttestedTransportclient = OpenAI(    base_url="https://api.tresor.co/v1",    api_key=os.environ["TRESOR_API_KEY"],    http_client=httpx.Client(transport=AttestedTransport()),)resp = client.chat.completions.create(    model="eu/auto/gpt-oss-120b",    messages=[{"role": "user", "content": "Hello!"}],)print(resp.choices[0].message.content)

import { createAttestedFetch } from "@tresorhq/attest";import OpenAI from "openai";const client = new OpenAI({  baseURL: "https://api.tresor.co/v1",  apiKey: process.env.TRESOR_API_KEY,  fetch: createAttestedFetch(),});const resp = await client.chat.completions.create({  model: "eu/auto/gpt-oss-120b",  messages: [{ role: "user", content: "Hello!" }],});console.log(resp.choices[0]?.message?.content);

package mainimport (    "fmt"    "time"    "github.com/tresorhq/attest/go/attest")func main() {    releaseRoot, err := attest.LoadBundledReleaseRoot()    if err != nil {        panic(err)    }    tuple, err := attest.Verify(attest.Inputs{        Envelope:          envelope,  // GET /attestation        BundleJWS:         bundleJWS, // GET <trust_bundle_url>        ReleaseRootPubKey: releaseRoot,        LiveTLSSPKI:       spki, // SHA-256 from live TLS        Now:               time.Now(),    })    if err != nil {        panic(err)    }    fmt.Println(tuple.WorkloadIdentityTag)}

Familiar API. Verifiable runtime.

Every call goes through the same path: into an attested enclave, back with a receipt you can check.

Drop in your OpenAI client

Change two lines. Keep the rest.

Set base_url to api.trytresor.com/v1 and use a Tresor API key; chat completions, streaming, and transcription work exactly as your existing SDK expects. Compatible with Python, Node, Go, and any OpenAI client lib.

Code sample artwork for dropping the Tresor API into an OpenAI client.

Verify what's running first

See the enclave. Pin the runtime. Then send.

Fetch live attestation evidence, match it against the signed trust bundle, and pin TLS to the expected enclave so no request goes out until the destination checks out. GET /attestation. Trust bundle at /.well-known/trust.json.

Runtime verification artwork for the Inference API product page.

Signed receipts by default

Proof attached to the request, not promised after.

Every successful call returns a receipt_id for a JWS that binds the request and response to live attestation evidence. JWS/ES256. Verify with any JWT library or the tresorhq-attest SDK.

Signed receipt artwork for the Inference API product page.

Route on your terms

Route on purpose. Or let auto handle it.

Address compound IDs like lux/tresor/kimi-k2.5 to pin every detail, or pass auto to let the router pick from a set you've approved. Region/provider/model selection per request or per key.

Routing control artwork for the Inference API product page.

Set your own failover rules

Resilience without silent rerouting.

Declare ordered fallback routes per key or per request, and the router only switches within the alternatives you explicitly named. Every failover event is recorded in the receipt and usage log.

Failover policy artwork for the Inference API product page.

Keys per service or team

Separate environments without sharing secrets.

Create, name, and revoke keys per service or environment, and every call carries a key_prefix so usage stays attributable. Per-key attribution in the dashboard and usage API.

API key management artwork for the Inference API product page.

Powered by frontier models.

The best of open source, isolated and verifiable in Zero-Access TEEs.

See the catalog

API Plans

Developer

Self-serve API access that scales with your usage.

Pay as you go

prepaid credits, no subscription

Instant self-serve activation
Prepaid credits from €5
Optional auto-recharge

Pay by card
Standard rate limits
Dashboard, invoices & receipts

Start Building

Enterprise

Custom contracts with negotiated rates, SLA, and compliance.

Custom

monthly commitment + overage

Negotiated model discounts
Committed spend
Metered overage

Custom rate limits
Invoice billing
SLA & priority support

Talk to Sales

API Pricing

Live route catalogue as used by current API calls.

deepseek-v4-pro

Chat

1 route available

Route ID	Input	Output
global/tinfoil/deepseek-v4-pro	$1.50/M	$5.25/M

gemma-4-31b

Chat

2 routes available

Route ID	Input	Output
eu/privatemode/gemma-4-31b	€0.77/M	€1.27/M
global/tinfoil/gemma-4-31b	$0.45/M	$1.00/M

glm-5.2

Chat

1 route available

Route ID	Input	Output
global/redpill/glm-5.2	$1.40/M	$4.40/M

gpt-oss-120b

Chat

2 routes available

Route ID	Input	Output
eu/privatemode/gpt-oss-120b	€0.43/M	€1.70/M
global/tinfoil/gpt-oss-120b	$0.15/M	$0.60/M

gpt-oss-20b

Chat

1 route available

Route ID	Input	Output
global/redpill/gpt-oss-20b	$0.04/M	$0.15/M

kimi-k2.6

Chat

3 routes available

Route ID	Input	Output
eu/privatemode/kimi-k2.6	€1.55/M	€7.74/M
global/chutes/kimi-k2.6	$0.95/M	$4.00/M
global/tinfoil/kimi-k2.6	$1.50/M	$5.25/M

llama3-3-70b

Chat

1 route available

Route ID	Input	Output
global/tinfoil/llama3-3-70b	$1.75/M	$2.75/M

mistral-24b-uncensored

Chat

1 route available

Route ID	Input	Output
global/redpill/mistral-24b-uncensored	$0.20/M	$0.90/M

qwen-2.5-7b-instruct

Chat

1 route available

Route ID	Input	Output
global/redpill/qwen-2.5-7b-instruct	$0.04/M	$0.10/M

qwen3.5-27b

Chat

1 route available

Route ID	Input	Output
global/redpill/qwen3.5-27b	$0.30/M	$2.40/M

voxtral-mini-3b

Transcription

1 route available

Route ID	Input	Output
eu/privatemode/voxtral-mini-3b	—	€0.0040/min

voxtral-small-24b

Transcription

1 route available

Route ID	Input	Output
global/tinfoil/voxtral-small-24b	$0.20/M	$0.60/M

whisper-large-v3

Transcription

1 route available

Route ID	Input	Output
eu/privatemode/whisper-large-v3	—	€0.014/min

whisper-large-v3-turbo

Transcription

1 route available

Route ID	Input	Output
global/tinfoil/whisper-large-v3-turbo	—	$0.010/req

API Questions

Privacy without the trade-off.

Public AI tools read everything you send. On-prem is private but impractical. Tresor gives you both: cloud convenience, infrastructure-grade privacy.

Powerful Models

Public Cloud AI

On-Prem

Tresor AI

EU-hosted

Public Cloud AI

On-Prem

Tresor AI

Zero-Access

Public Cloud AI

On-Prem

Tresor AI

No Ops

Public Cloud AI

On-Prem

Tresor AI

No Setup

Public Cloud AI

On-Prem

Tresor AI

Verifiable Proof

Public Cloud AI

On-Prem

Tresor AI

Capability

Public Cloud AI
(ChatGPT, Claude, etc)

On-
Prem

Tresor
AI

Powerful Models

EU-hosted

Zero-Access

No Ops

No Setup

Verifiable Proof

Trusted by people who don't trust easily.

I can only use AI in coaching when it fully protects the sacred trust between practitioner and client; confidentiality is non-negotiable.

Tove Thyes

Transformational Coach and Energy Medicine Practitioner

As a software strategist, I treat privacy as the blueprint that lets my team turn client visions into AI people can trust.

Igor Miazek

CEO & Founder, Techs

At Dance, we build experiences people trust. Tresor’s approach to privacy-first AI is a simply a great match.

Christian Springub

CEO & Co-Founder, Dance

As a therapist, I can only use AI that protects client privacy. Tresor delivers exactly that.

Magali Cahen

Psychologist and Therapist, Independent

The only way to earn trust in AI is to make privacy a design principle, not a feature. Tresor’s system does exactly that.

Ingmar Schuster

CEO & Co-Founder, Provolut

In the AI era, rigorous risk-based security and GDPR-aligned privacy are non-negotiable foundations for any trustworthy system.

Davy Cox

Founder, Brainframe

Understand the technical details.

Zero-Access AI Conversations:
How Tresor Protects Your Privacy

Executive Summary

Tresor is built on a simple promise: your conversations belong to you, not us. Every message you type is protected by end-to-end encryption and processed only inside secure computing environments that even Tresor cannot inspect. Teams can now collaborate inside shared workspaces without ever handing Tresor access to their plaintext. This whitepaper explains the principles and safeguards behind Tresor’s zero-access design, showing how we deliver practical confidentiality without trade-offs in usability.

Download White Paper

Inference API

Optional Tresor SDK

Use the OpenAI SDK

Use the Tresor SDK for attestation

Familiar API. Verifiable runtime.

Drop in your OpenAI client

Verify what's running first

Signed receipts by default

Route on your terms

Set your own failover rules

Keys per service or team

Powered by frontier models.

API Plans

Developer

Enterprise

API Pricing

deepseek-v4-pro

gemma-4-31b

glm-5.2

gpt-oss-120b

gpt-oss-20b

kimi-k2.6

llama3-3-70b

mistral-24b-uncensored

qwen-2.5-7b-instruct

qwen3.5-27b

voxtral-mini-3b

voxtral-small-24b

whisper-large-v3

whisper-large-v3-turbo

API Questions

How is the Inference API priced?

Does the price depend on which model I use?

How do I track usage and costs?

Can I verify my bill against actual usage?

What's the difference between Standard and Enterprise pricing?

Privacy without the trade-off.

Powerful Models

EU-hosted

Zero-Access

No Ops

No Setup

Verifiable Proof

Trusted by people who don't trust easily.

Understand the technical details.

Zero-Access AI Conversations: How Tresor Protects Your Privacy

Frequently asked questions.

What is Tresor?

Should I use the Workspace or the API?

How much does Tresor cost?

How is my data protected?

What happens if Tresor is breached?

Is my data used to train AI models?

Which AI models can I use?

Can my team collaborate without compromising privacy?

Is there a free trial?

How do I verify Tresor's claims for myself?

Zero-Access AI Conversations:
How Tresor Protects Your Privacy