Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Supported Models

This page lists all models supported by the Archia server. Use the Model ID value when specifying a model in the Responses API or in agent configuration files.


Remote Models

Remote models are hosted by their respective providers and require API keys to be configured.

OpenAI

Model IDDisplay NameDescriptionCapabilities
gpt-5.2GPT-5.2OpenAI’s most advanced modelnlp, vision, tools, reasoning
gpt-5.1GPT-5.1Improved GPT-5 with better reasoning, coding, and visionnlp, vision, tools, reasoning
gpt-5GPT-5Flagship general model for coding, reasoning, and agentic tasksnlp, vision, tools, reasoning
gpt-5-miniGPT-5 miniFast, affordable small model for focused tasksnlp, vision
gpt-5-nanoGPT-5 nanoSmallest GPT-5 variant; optimized for cost and latencynlp
gpt-4.1GPT-4.1Improved GPT-4 with better reasoning, coding, and visionnlp, vision, tools, reasoning
gpt-4oGPT-4oOmni multimodal model; strong general performance and tool callingnlp, vision, tools
gpt-4o-miniGPT-4o miniFast, affordable small model for focused tasksnlp, vision
o4-miniOpenAI o4-miniLatest small o-series model optimized for efficient reasoning and codingnlp, vision, reasoning

Google

Model IDDisplay NameDescriptionCapabilities
gemini-3-pro-previewGemini 3.0 ProGoogle’s flagship Gemini 3.0 Pro model with advanced reasoning and multimodal capabilitiesnlp, vision, tools, reasoning
gemini-2.5-proGemini 2.5 ProGoogle’s high-performance model with strong reasoning and multimodal capabilitiesnlp, vision, tools, reasoning
gemini-2.5-flashGemini 2.5 FlashGoogle’s lightweight model optimized for speed and efficiencynlp, vision, tools, reasoning

Anthropic

Model IDDisplay NameDescriptionCapabilities
claude-opus-4-5-20251101Opus 4.5Claude Opus 4.5 Premium model combining maximum intelligence with practical performancenlp, vision, tools, reasoning
claude-opus-4-1-20250805Opus 4.1Latest flagship Claude with strongest reasoning/coding and visionnlp, vision, reasoning, tools
claude-sonnet-4-5-20250929Sonnet 4.5Claude 4.5: improved reasoning, coding, and multimodal capabilities with lower latencynlp, vision, reasoning, tools
claude-sonnet-4-20250514Sonnet 4High-performance Claude with 200k–1M context and visionnlp, vision, reasoning, tools
claude-3-7-sonnet-20250219Sonnet 3.7Fast, capable Claude with extended thinking optionsnlp, vision, reasoning, tools
claude-haiku-4-5-20251001Haiku 4.5Anthropic’s fastest model with near-frontier intelligencenlp, vision, tools
claude-3-5-haiku-20241022Haiku 3.5Small/fast Claude for chat + basic visionnlp, vision, tools

Private Models (No Data Retention)

Private models route through Archia’s secure infrastructure with zero data retention guarantees. These are ideal for sensitive workloads where data privacy is critical.

Anthropic (Private)

Model IDDisplay NameDescriptionCapabilities
priv-claude-opus-4-5-20251101Opus 4.5 (Private)Claude Opus 4.5 with no data retentionnlp, vision, tools, reasoning
priv-claude-opus-4-1-20250805Opus 4.1 (Private)Latest Claude Opus with no data retentionnlp, vision, reasoning, tools
priv-claude-sonnet-4-5-20250929Sonnet 4.5 (Private)Claude 4.5 with no data retentionnlp, vision, tools, reasoning
priv-claude-sonnet-4-20250514Sonnet 4 (Private)Claude Sonnet 4 with no data retentionnlp, vision, reasoning, tools
priv-claude-3-7-sonnet-20250219Sonnet 3.7 (Private)Fast, capable Claude with no data retentionnlp, vision, reasoning, tools
priv-claude-haiku-4-5-20251001Haiku 4.5 (Private)Anthropic’s fastest model with no data retentionnlp, vision, tools, reasoning
priv-claude-3-5-haiku-20241022Haiku 3.5 (Private)Fast/cheap Claude with no data retentionnlp, vision, tools

Open Source (Private)

Model IDDisplay NameDescriptionCapabilities
archia::openai/gpt-oss-20bGPT OSS 20BOpen-source 20B parameter model hosted on Archia infrastructurevision, nlp, reasoning
archia::openai/gpt-oss-120bGPT OSS 120BOpen-source 120B parameter model with enhanced capabilitiesvision, nlp, reasoning
archia::moonshotai/kimi-k2-instruct-0905Kimi K2Open source model from Moonshot AI (~1 trillion parameters)vision, nlp, reasoning

Local Models

Local models run directly on your machine using llama.cpp. They require downloading the model weights and sufficient system memory.

Model IDDisplay NameDescriptionCapabilitiesSizeMin Memory
gpt-oss-20b-F16.ggufgpt-oss-20bOpen-source 20B parameter model optimized for performancevision, nlp, reasoning12 GB32 GB
gpt-oss-120b-F16.ggufgpt-oss-120bOpen-source 120B parameter model for complex reasoning tasksvision, nlp, reasoning60 GB100 GB
Qwen3-4B-Instruct-2507-F16.ggufqwen3-4bOpen-source 4B instruction-tuned model for general tasksnlp8 GB32 GB
Qwen3-4B-Q5_K_S.ggufqwen3-4b-think4B model with enhanced thinking, 5-bit quantizednlp8 GB16 GB
Qwen3-32B-Q5_K_S.ggufqwen3-32b-think32B reasoning model with advanced thinking depthnlp22 GB32 GB

Capabilities

Models are tagged with capabilities that indicate their supported features:

CapabilityDescription
nlpNatural language processing (text generation, understanding)
visionImage understanding and analysis
toolsFunction/tool calling support
reasoningExtended thinking/reasoning support (see Reasoning)

Usage Examples

Responses API

{
  "model": "claude-sonnet-4-5-20250929",
  "input": "Hello, how can you help me today?"
}

Agent Configuration

In your agent TOML configuration file:

[assistant]
model = "gpt-5.2"
system_prompt = "You are a helpful assistant."

Using Private Models

For sensitive workloads, use the private model variants:

{
  "model": "priv-claude-sonnet-4-5-20250929",
  "input": "Process this confidential document..."
}

Model Selection Guide

Use CaseRecommended Models
General tasksgpt-5.2, claude-sonnet-4-5-20250929, gemini-2.5-pro
Fast responsesgpt-5-mini, claude-haiku-4-5-20251001, gemini-2.5-flash
Complex reasoningclaude-opus-4-5-20251101, gpt-5.2 with reasoning, o4-mini
Cost optimizationgpt-5-nano, claude-3-5-haiku-20241022, gpt-4o-mini
Privacy-sensitivepriv-claude-* models, archia::* models
Offline/localgpt-oss-20b-F16.gguf, Qwen3-* models
Vision tasksAny model with vision capability

Next Steps