Database Schema

The application uses a single SQLite database (evonic.db) with tables for evaluation results, test definitions, and the agent platform.

Evaluation Tables

Top-level table for each evaluation run.

Column	Type	Description
`run_id`	TEXT PK	UUID identifier
`started_at`	DATETIME	When the run started
`completed_at`	DATETIME	When the run finished (null if in progress)
`model_name`	TEXT	Model label for this run
`summary`	TEXT	Run summary text
`overall_score`	REAL	Aggregate score (0.0-1.0)
`total_tokens`	INTEGER	Total tokens consumed
`total_duration_ms`	INTEGER	Total wall-clock time

Aggregated results per domain/level.

Column	Type	Description
`id`	INTEGER PK	Auto-increment
`run_id`	TEXT FK	References `evaluation_runs`
`domain`	TEXT	Domain identifier
`level`	INTEGER	Complexity level (1-5)
`score`	REAL	Aggregate score for this cell
`status`	TEXT	`passed`, `failed`, `running`, `pending`
`prompt`	TEXT	Last prompt (legacy)
`response`	TEXT	Last response (legacy)

Per-test results with full details.

Aggregated scores per domain/level.

Cached domain metadata from domain.json files.

Per-level configuration.

Individual test definitions.

Evaluator configuration registry.

Column	Type	Description
`id`	TEXT PK	Evaluator identifier
`type`	TEXT	`regex`, `custom`, `hybrid`
`eval_prompt`	TEXT	LLM evaluation prompt template
`extraction_regex`	TEXT	Regex pattern for score extraction
`uses_pass2`	BOOLEAN	Whether to use two-pass extraction
`config`	TEXT (JSON)	Additional configuration

Tool definition registry.

Column	Type	Description
`id`	TEXT PK	Tool identifier
`name`	TEXT	Display name
`function_def`	TEXT (JSON)	OpenAI function schema
`mock_response`	TEXT	Mock response for evaluation
`mock_response_type`	TEXT	`json` or `javascript`

Agent definitions.

Column	Type	Description
`id`	TEXT PK	Slug identifier (e.g., `bookstore_bot`)
`name`	TEXT	Display name
`description`	TEXT	Short description
`system_prompt`	TEXT	Agent persona and instructions
`model`	TEXT	Model override (null = use default)
`created_at`	TIMESTAMP	Creation time
`updated_at`	TIMESTAMP	Last update time

Many-to-many mapping of agents to tools.

Column	Type	Description
`agent_id`	TEXT PK, FK	References `agents`
`tool_id`	TEXT PK	Tool identifier

Per-agent channel configurations.

Column	Type	Description
`id`	TEXT PK	UUID identifier
`agent_id`	TEXT FK	References `agents`
`type`	TEXT	`telegram`, `whatsapp`, `discord`
`name`	TEXT	Display name
`config`	TEXT (JSON)	Channel-specific config (e.g., bot token)
`enabled`	BOOLEAN	Whether the channel is active

Per-user conversation sessions.

Column	Type	Description
`id`	TEXT PK	UUID identifier
`agent_id`	TEXT FK	References `agents`
`channel_id`	TEXT FK	References `channels` (nullable for web chat)
`external_user_id`	TEXT	User identifier from the channel

Sessions are uniquely identified by the tuple (agent_id, channel_id, external_user_id).

Conversation message history.

Column	Type	Description
`id`	INTEGER PK	Auto-increment
`session_id`	TEXT FK	References `chat_sessions`
`role`	TEXT	`user`, `assistant`, `tool`, `system`
`content`	TEXT	Message content
`tool_calls`	TEXT (JSON)	Tool call objects (for assistant messages)
`tool_call_id`	TEXT	Tool call ID (for tool result messages)
`created_at`	TIMESTAMP	Message timestamp