Skip to main content
A Model is a specific LLM your agents can pick — main engine, secondary engine, or as the tenant’s text / multimodal default. Each model belongs to a Provider. Models are managed in Command Center → Settings → Models.
Models catalog

Fields

FieldRequiredDescription
ProviderYesThe provider that serves this model
Model IDYesMust match the model’s deployment name on the provider exactly — this is the identifier sent to the provider’s API, not a label you choose
KeyNoDisplay name shown in Command Center and referenced by agents when selecting an engine. Auto-derived from the Model ID if blank. Edit this field to rename
Context lengthYesMaximum tokens the model accepts. Drives context-management guidance in Engine & Agents
Accepts image inputNoEnable for vision-capable models. Image-capable models become selectable as the tenant’s Multimodal model
EnabledNoOnly enabled models appear as selectable engines. Disable to retire without losing configuration or history
Advanced sampling parameters (temperature, top P, top K, presence/frequency penalty, max output tokens, seed, stop sequences) can be set per model. You can also record pricing plans per model and run an on-demand performance test that reports reachability, time to first token, and throughput.

Thinking effort

A thinking-effort level trades depth of reasoning against latency and cost:
LevelDescription
OffNo reasoning tokens — fastest, lowest cost
MinimalVery light reasoning
LowBasic reasoning pass
MediumBalanced reasoning depth
HighDeep reasoning for complex tasks
xHighMaximum reasoning; highest cost
Some custom or compatible providers do not accept a reasoning argument and reject any request that includes one. If your provider does not support reasoning, leave thinking effort unset — do not select any level.

Tenant defaults

The tenant’s default Text LLM model and Multimodal model are chosen in Settings → Tenant from the catalog’s enabled models. Only image-capable models appear as Multimodal options. These defaults power gateway routing, the AI Assistant, knowledge ingestion, and any agent that doesn’t pick its own engines. See Engine & Agents for how agents combine main and secondary models, and where the tenant defaults are used.

Deletion rule

A model assigned to any agent — main or secondary — cannot be deleted. Reassign those agents or disable the model instead.

Deploy-time seeding

On deploy, the installer seeds models from the providers detected in the engine environment. After that, management moves to Command Center — no engine restart needed.