Configuration Reference

This page documents all properties available in the YAML frontmatter of an AGENT.md file.

Frontmatter property names can be written in camelCase, snake_case, or kebab-case. We recommend kebab-case for readability. All examples on this page use kebab-case.

Required Properties

`name`

Type: string — Required

The name of the agent. Must be lowercase kebab-case: 1–64 characters, only lowercase alphanumeric characters and hyphens, no leading/trailing/consecutive hyphens.

name: code-engineer

`description`

Type: string — Required

A brief description of the agent. Must be non-empty and at most 1024 characters.

description: A general-purpose coding agent for software engineering tasks

Discovery and Delegation

`keywords`

Type: string[]

Keywords or tags for discovery and routing. Helps managers find and rank this agent when deciding which agent to delegate to.

keywords:
  - code
  - review
  - security

`when-to-use`

Type: string[]

Positive selection signals describing when this agent should be used. Each entry is a short sentence describing a scenario where this agent is the right choice.

when-to-use:
  - when the user asks to review or audit code
  - when a pull request needs automated review

`when-not-to-use`

Type: string[]

Negative selection signals describing when this agent should not be used. Helps managers avoid delegating to the wrong agent.

when-not-to-use:
  - when the user wants to write or generate new code
  - when the task is refactoring rather than review

Models and Providers

`models`

Type: string[]

Ordered list of preferred model identifiers. Stencila tries each model in order until one can be routed with the currently configured credentials. If all listed models fail, routing normally stops with an error.

You can include the special value any in the list to allow fallback to the next routing stage instead of failing immediately. In practice this means Stencila will try the named models first, and if none can be used, it will continue with model-size, then providers, then defaults.

models:
  - claude-sonnet-4-5
  - gpt-5.2-codex
  - any

Use models when you want ordered preferences across specific model IDs. Use singular model when you only need one explicit model.

`providers`

Type: string[]

Ordered list of preferred providers. Stencila tries providers in order and uses the default model for the first one with valid credentials.

You can include the special value any in the list to allow fallback to the default provider-selection behavior if none of the named providers are available.

providers:
  - anthropic
  - openai
  - any

Use providers when you want ordered provider preferences without pinning the agent to a specific model.

`model-size`

Type: string

Preferred model size tier, e.g. small, medium, large. Use this when you want to select a broad class of model rather than naming a specific model ID.

Stencila treats model size as a cross-provider classification. It groups provider models into broad size tiers so you can express intent like “use a small, fast, cheap model” or “use a larger, more capable model” without tightly coupling the agent to one provider's naming scheme.

These size tiers are a Stencila abstraction, not a provider-standard guarantee. A small model from one provider may differ from a small model from another provider in latency, price, or capability. The goal is approximate normalization of those tradeoffs across providers.

When both providers and model-size are set, Stencila first constrains selection to the specified providers, then selects a model in the preferred size tier. If models is also set, the explicit models list takes precedence over model-size.

model-size: medium

`reasoning-effort`

Type: string

Reasoning effort level: low, medium, high, or a custom provider-specific value. Controls how much the model reasons before responding. Higher effort uses more tokens but can improve quality. When not specified, the provider's default is used.

This is different from model-size: model-size selects the class of model, while reasoning-effort controls how much the chosen model deliberates before answering.

reasoning-effort: high

Safety and Access Control

`trust-level`

Type: string

Trust level controlling how strictly the agent's tool calls are guarded. See Tool Guards for details.

Value	Description
`low`	Shell is default-deny; strictest file and web rules
`medium`	Default-allow with destructive behavior blocking (default)
`high`	Default-allow with relaxed blocking

trust-level: low

`allowed-tools`

Type: string[]

Tool names this agent is allowed to use. When set, only the listed tools are sent to the model and allowed to execute. When unset, all tools registered for the provider are available.

allowed-tools:
  - read_file
  - write_file
  - edit_file
  - grep
  - glob
  - shell
  - web_fetch

When validating an agent (via stencila agents validate), the validator cross-references this list against the allowed-tools declared by the agent's skills. If a skill needs a tool not in the agent's allowed-tools, a warning is shown. See Creating Agents — Validation for details.

`allowed-domains`

Type: string[]

Domain allowlist for web_fetch. Supports exact hosts and *. wildcard subdomain entries. When set, domains not in this list are denied.

allowed-domains:
  - docs.rs
  - "*.github.com"
  - crates.io

`disallowed-domains`

Type: string[]

Domain denylist for web_fetch. Supports exact hosts and *. wildcard subdomain entries. When both allowed-domains and disallowed-domains are set, the allowlist takes precedence.

disallowed-domains:
  - internal.corp.example.com

Skills

`allowed-skills`

Type: string[]

Skill names this agent can use. When unset, all discovered skills are available. When set to a non-empty array, only the listed skills are available. When set to an empty array, skills are disabled entirely. If the array contains exactly one skill, Stencila automatically preloads that skill's full instructions into the initial system prompt in addition to exposing it via use_skill. See the skills documentation for more on creating and using skills.

allowed-skills:
  - testing
  - documentation

MCP Integration

`enable-mcp`

Type: boolean — Default: false

Whether to register MCP server tools directly in the agent's tool registry. Each tool from every connected MCP server is registered individually. This is simple but token-expensive — prefer enable-mcp-codemode for most agents.

enable-mcp: true

`enable-mcp-codemode`

Type: boolean — Default: true

Whether to register a single mcp_codemode tool for MCP orchestration. The model writes JavaScript to orchestrate MCP calls in a sandboxed environment. TypeScript declarations are included in the system prompt. Much more token-efficient than direct MCP tool registration.

enable-mcp-codemode: true

`allowed-mcp-servers`

Type: string[]

MCP server IDs this agent is allowed to use. When unset, all discovered and connected MCP servers are available. When set, only the listed server IDs are used.

allowed-mcp-servers:
  - context7
  - my-database

Session Limits

`max-turns`

Type: integer — Default: 0 (unlimited)

Maximum total conversation turns. When reached, the session ends.

max-turns: 20

`max-tool-rounds`

Type: integer

Maximum tool-call rounds per user input. Limits how many times the model can call tools before it must respond to the user.

max-tool-rounds: 10

`tool-timeout`

Type: integer

Default timeout for tool execution, in seconds. Must be greater than 0.

tool-timeout: 60

`max-subagent-depth`

Type: integer — Default: 1

Maximum nesting depth for subagents. Controls how many levels deep subagents can spawn their own subagents.

max-subagent-depth: 2

Context Management

`history-thinking-replay`

Type: string — Default: none

Controls whether chain-of-thought content is included when replaying assistant turns in subsequent requests.

Value	Description
`none`	Strip all thinking and reasoning from history, saving context space (default)
`full`	Replay thinking and reasoning content as-is

history-thinking-replay: full

`truncation-preset`

Type: string — Default: balanced

Named preset for tool output truncation limits. Controls how aggressively tool outputs are truncated before being included in conversation context.

Value	Description
`strict`	Tighter limits, preserves more context budget for conversation
`balanced`	Moderate limits suitable for most agents (default)
`verbose`	No additional truncation beyond spec defaults

truncation-preset: strict

`compaction-trigger-percent`

Type: integer — Default: 70

Context usage percentage that triggers proactive history compaction. When estimated context usage exceeds this percentage of the model's context window, the agent proactively compacts conversation history. Set to 0 to disable proactive compaction.

compaction-trigger-percent: 80

Metadata

`compatibility`

Type: string

Environment requirements for the agent. Max 500 characters. Indicates intended product, required system packages, network access needs, etc. Most agents do not need this field.

compatibility: Requires Python 3.11+ and access to a PostgreSQL database

Markdown Body (Instructions)

The Markdown content after the frontmatter closing --- is the agent's system instructions. These are appended to the system prompt as the highest-precedence user instruction layer.

---
name: code-reviewer
description: Reviews code for issues
---

You are a code reviewer. Follow these principles:

- Focus on correctness and security
- Suggest improvements with concrete code examples
- Be concise — flag only meaningful issues
- Do not modify files, only read and analyze

The body is optional. An agent with only frontmatter is valid and useful for model/provider configuration without custom instructions.