Skip to content

Models

Models define the language models that power agent conversations. Evonic supports multiple model providers and local inference engines.

RunnerBest ForHardware Requirements
OllamaQuick setup, multi-model supportModerate (CPU/GPU)
llama.cppMaximum portability, edge devicesLow to Moderate
vLLMHigh-throughput production workloadsHigh (GPU recommended)

Models are managed via:

  • CLI: evonic model list, evonic model add, evonic model rm
  • API: REST endpoints for programmatic management
  • Web UI: Manage models from the Models page (/system/models)

Each model has a provider (e.g., openai, anthropic, llama.cpp), a base URL, API key, and configuration options like max tokens, timeout, and temperature.

Introduced in v0.7.0.

The LLM client now supports translating between OpenAI and Anthropic API formats, configurable per-model via an API Format dropdown in the model modal.

FormatDescription
OpenAIStandard chat completions format (default for most providers)
AnthropicAnthropic’s native Messages API format

When you set a model’s API format to Anthropic, the LLM client:

  1. Takes the internal OpenAI-format request (messages, tools, system prompt)
  2. Translates it to Anthropic’s Messages API format
  3. Sends the translated request to the model’s base URL
  4. Translates the Anthropic-format response back to the internal format

This means you can connect Claude and other Anthropic-compatible models natively without a proxy. Just set the base URL to an Anthropic-compatible endpoint and switch the API format to Anthropic.

  1. Go to System → Models (/system/models)
  2. Create a new model or edit an existing one
  3. Find the API Format dropdown in the model form
  4. Select Anthropic for Anthropic-compatible models
  5. Configure the provider, base URL, and API key as usual

The format is set per-model, so you can mix OpenAI-format and Anthropic-format models in the same deployment.

Introduced in v0.2.6.

Instead of creating a new model from scratch, you can duplicate an existing model and tweak its settings. This is useful when you want to:

  • Create a variant of a model with different parameters (e.g., higher temperature)
  • Set up the same model with a different API key or base URL
  • Copy a model from one provider setup to another
  1. Go to System → Models (/system/models)
  2. Find the model you want to duplicate
  3. Click the Duplicate button (copy icon)
  4. A new model form opens pre-filled with the original’s settings
  5. Adjust the settings as needed (name, provider, parameters, etc.)
  6. Click Save
Terminal window
evonic model clone <source_model_id> --id <new_model_id> [--name "New Name"]

Example:

Terminal window
evonic model clone gpt-4 --id gpt-4-hightemp --name "GPT-4 High Temp" --temperature 0.9

This clones the gpt-4 model configuration and creates a new model called gpt-4-hightemp with a higher temperature setting.

AspectCloneCreate from scratch
TimeInstant — all fields pre-filledManual entry of every field
Error-proneLow — settings are copiedHigh — easy to mistype config
Use caseTweaking parametersAdding completely new providers

Introduced in v0.2.0.

Evonic supports the following additional model providers beyond the standard OpenAI-compatible endpoints:

ProviderDescriptionConfiguration
OpenCode ZenOptimized for code generation and analysisSet provider to opencode-zen in model config
OpenCode GoLightweight code-focused model for edge devicesSet provider to opencode-go in model config
Ollama CloudCloud-hosted Ollama models for remote inferenceSet provider to ollama-cloud with your API key

These providers follow the same configuration structure as other models but may have provider-specific parameters available in the Advanced Settings section.