Agent API Plugin

Overview

The Agent API plugin exposes Evonic agents through an OpenAI-compatible REST API. This allows external applications to interact with your agents using the familiar /v1/chat/completions endpoint pattern, with bearer-token authentication, quota management, and model-scoping.

Installation

The plugin is available as a built-in plugin under plugins/agentapi/. Enable it from the Plugins page in the Web UI or via the CLI.

How It Works

The plugin creates an OpenAI-compatible API layer that maps external model names to your internal Evonic agents:

External client calls /plugin/agentapi/v1/chat/completions with a model name like gpt-4-assistant
The plugin looks up the model → agent mapping to find which Evonic agent to use
The request is forwarded to the mapped agent
The agent processes the request and returns a response in OpenAI-compatible format

Session Behavior

Introduced in v0.3.43.

The Agent API is stateless by default — each request is treated as an independent conversation. The agent does not remember previous requests, and no conversation history is preserved between calls.

Stateful Sessions

To enable stateful sessions (where the agent remembers the conversation), include the X-Session-Id header with a unique session identifier:

X-Session-Id: <your-session-id>

When you use a session ID:

The agent maintains the conversation history across requests
Each request appends to the same conversation, so the agent remembers previous messages
The session persists until explicitly cleared or expired

This is useful for multi-turn conversations, like chatbots that need to remember context from earlier messages.

Stateless Usage (Default)

Without a session ID, each request is isolated — the agent has no memory of previous calls. This is ideal for:

One-off queries where context isn’t needed
Load-balanced scenarios where requests may go to different instances
Simple API integrations that only need a single response

Configuration

Model → Agent Mapping

The MODEL_AGENT_MAP variable defines how external model names map to internal agent IDs:

{
  "gpt-4-assistant": "linus",
  "claude-assistant": "siwa",
  "custom-agent": "my_agent"
}

Each key is a public-facing model name (used in API requests), and each value is the ID of an Evonic agent.

API Endpoints

POST `/plugin/agentapi/v1/chat/completions`

Send a chat completion request to an agent.

Headers:

Authorization: Bearer <api_token>
Content-Type: application/json
X-Session-Id: <session-id>         # Optional — enables stateful session

Request body:

{
  "model": "gpt-4-assistant",
  "messages": [
    {"role": "user", "content": "Hello, how are you?"}
  ],
  "temperature": 0.7
}

Parameters:

Parameter	Required	Description
`model`	Yes	The public model name (maps to an agent via `MODEL_AGENT_MAP`)
`messages`	Yes	Array of message objects with `role` and `content`
`temperature`	No	Sampling temperature (passed to the underlying model)

GET `/plugin/agentapi/v1/models`

List all available model names (agent mappings).

Returns the keys from MODEL_AGENT_MAP as available models.

Token Management

Introduced in v0.3.19.

The Agent API now provides a token management UI for creating, editing, deleting, and inspecting API tokens.

Creating Tokens

Go to the Agent API plugin settings in the Web UI
Navigate to the Tokens tab
Click Create Token
Set a label/name for the token (e.g., “Production App”)
Optionally set a usage quota
Copy the generated token — it will not be shown again

Token Features

Feature	Description
Create	Generate new bearer tokens with optional labels
Edit	Update token labels or quotas
Delete	Revoke tokens to block access
Inspect	View token metadata (created date, last used, usage count)

Token Usage

Tokens are passed as Bearer tokens in the Authorization header:

Authorization: Bearer evo_abc123def456...

Quota Management

Each token can have an optional usage quota, limiting how many requests it can make. This is useful for:

Rate-limiting API access for specific applications
Controlling usage before billing
Preventing runaway scripts

Example: Using the Agent API

Stateless Request (Default)

Each request is independent — the agent has no memory of previous calls.

With cURL

curl -X POST https://your-evonic-instance.com/plugin/agentapi/v1/chat/completions \
  -H "Authorization: Bearer evo_abc123def456" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4-assistant",
    "messages": [
      {"role": "user", "content": "What is the weather today?"}
    ]
  }'

With Python

import requests

response = requests.post(
    "https://your-evonic-instance.com/plugin/agentapi/v1/chat/completions",
    headers={
        "Authorization": "Bearer evo_abc123def456",
        "Content-Type": "application/json",
    },
    json={
        "model": "gpt-4-assistant",
        "messages": [{"role": "user", "content": "Hello!"}],
    },
)

print(response.json())

Stateful Request (with `X-Session-Id`)

Include the X-Session-Id header to maintain conversation history across requests. The agent will remember previous messages in the same session.

With cURL

# First message
curl -X POST https://your-evonic-instance.com/plugin/agentapi/v1/chat/completions \
  -H "Authorization: Bearer evo_abc123def456" \
  -H "Content-Type: application/json" \
  -H "X-Session-Id: my-chat-session-1" \
  -d '{
    "model": "gpt-4-assistant",
    "messages": [
      {"role": "user", "content": "My name is Alice."}
    ]
  }'

# Second message — the agent remembers "Alice" from the first call
curl -X POST https://your-evonic-instance.com/plugin/agentapi/v1/chat/completions \
  -H "Authorization: Bearer evo_abc123def456" \
  -H "Content-Type: application/json" \
  -H "X-Session-Id: my-chat-session-1" \
  -d '{
    "model": "gpt-4-assistant",
    "messages": [
      {"role": "user", "content": "What is my name?"}
    ]
  }'

With Python

import requests

session_id = "my-chat-session-1"
base_url = "https://your-evonic-instance.com/plugin/agentapi/v1/chat/completions"
headers = {
    "Authorization": "Bearer evo_abc123def456",
    "Content-Type": "application/json",
    "X-Session-Id": session_id,
}

# First message
response = requests.post(
    base_url,
    headers=headers,
    json={
        "model": "gpt-4-assistant",
        "messages": [{"role": "user", "content": "My name is Alice."}],
    },
)
print(response.json())

# Second message — the agent remembers the conversation
response = requests.post(
    base_url,
    headers=headers,
    json={
        "model": "gpt-4-assistant",
        "messages": [{"role": "user", "content": "What is my name?"}],
    },
)
print(response.json())  # The agent should reply "Alice"

Security

Bearer token authentication — all requests must include a valid token
Per-token quotas — limit usage per application
Token revocation — instantly block compromised tokens
Model scoping — agents are only accessible through their mapped model name

Agent API Plugin

Overview

Installation

How It Works

Session Behavior

Stateful Sessions

Stateless Usage (Default)

Configuration

Model → Agent Mapping

API Endpoints

POST /plugin/agentapi/v1/chat/completions

GET /plugin/agentapi/v1/models

Token Management

Creating Tokens

Token Features

Token Usage

Quota Management

Example: Using the Agent API

Stateless Request (Default)

With cURL

With Python

Stateful Request (with X-Session-Id)

With cURL

With Python

Security

POST `/plugin/agentapi/v1/chat/completions`

GET `/plugin/agentapi/v1/models`

Stateful Request (with `X-Session-Id`)