Agent Loop

Opal.Agent is the runtime loop that drives prompt handling, model streaming, tool execution, retries, and completion. The loop is now implemented as an OTP :gen_statem (lib/opal/agent/agent.ex) with explicit lifecycle states, while helper modules under lib/opal/agent/ keep stream parsing, tool orchestration, retries, and compaction concerns separated.

Interface

The public API remains stable and routes into the state machine:

Opal.Agent.start_link(opts)
Opal.Agent.prompt(agent, text) #=> %{queued: boolean()}
Opal.Agent.abort(agent)
Opal.Agent.get_state(agent)
Opal.Agent.get_context(agent)
Opal.Agent.set_model(agent, model)
Opal.Agent.set_provider(agent, provider_module)
Opal.Agent.sync_messages(agent, messages)
Opal.Agent.configure(agent, %{features: ..., enabled_tools: ...})

The runtime callback model is explicit:

@behaviour :gen_statem

callback_mode() :: :state_functions
idle(event_type, event_content, state)
running(event_type, event_content, state)
streaming(event_type, event_content, state)
executing_tools(event_type, event_content, state)

FSM States

State	Meaning	External commands
`:idle`	Waiting for prompt input	prompt, abort, calls
`:running`	Building context and starting provider stream	prompt queued, abort, calls
`:streaming`	Processing provider SSE events	prompt queued, abort, calls
`:executing_tools`	Running tool calls through supervised tasks	prompt queued, abort, calls

stateDiagram-v2
    direction LR
    [*] --> idle

    idle --> running: prompt
    running --> streaming: start provider stream

    streaming --> running: finalize response
    running --> executing_tools: tool calls emitted
    executing_tools --> running: tool batch complete

    running --> running: retry timer

    running --> idle: turn complete
    running --> idle: abort or terminal error
    streaming --> idle: abort or stream error
    executing_tools --> idle: abort

    note right of running
      Busy prompts are queued
      in pending_messages until safe handoff.
    end note

How it works

1. Prompt intake and gating

prompt/2 uses :gen_statem.call. In :idle, input is appended as a user message, the state transitions to :running, and the caller receives %{queued: false}. In non-idle states, prompts are queued in pending_messages and the caller receives %{queued: true}.

2. Turn start in `:running`

run_turn/1 builds the message list, applies compaction checks, resolves active tools, and starts streaming through the configured provider. The machine then transitions to :streaming.

3. Streaming in `:streaming`

The loop consumes SSE chunks via Req.parse_message/2, then passes each JSON line through provider.parse_stream_event/1.

Opal.Agent.Stream normalizes provider events (:text_delta, :tool_call_done, :usage, :response_done, etc.) and updates accumulated response fields.

4. Finalization and branch

On stream completion, the assistant message is appended and the machine re-enters :running. If tool calls are present, control moves to :executing_tools. Otherwise the loop emits {:agent_end, ...} and returns to :idle.

5. Tool execution in `:executing_tools`

Tool calls are started concurrently using Task.Supervisor.async_nolink. Results are received through state-machine :info messages, converted to :tool_result messages, and the machine returns to :running for the next provider turn.

flowchart LR
    A[handle_event callback] --> C[handle_call/info]
    C --> E[next_state transition]

    E --> F[streaming]
    F --> G[Opal.Agent.Stream]
    E --> H[executing_tools]
    H --> I[Opal.Agent.ToolRunner]
    E --> J[running]
    J --> K[Opal.Agent.UsageTracker + Opal.Agent.Retry]

6. Resilience paths

Opal.Agent.Retry classifies transient provider errors and schedules exponential backoff.
Opal.Agent.UsageTracker and Opal.Agent.Overflow handle auto-compaction and overflow recovery before retrying turns.
abort/1 cancels in-flight stream/tool work and forces :idle.

Agent module layout

The agent runtime now follows a responsibility-first layout under lib/opal/agent/:

agent.ex — :gen_statem loop and state transitions
state.ex — runtime state struct/types
stream.ex — provider event parsing and stream-state updates
tool_runner.ex — concurrent tool lifecycle orchestration
retry.ex — retry policy and backoff classification
usage_tracker.ex + overflow.ex — usage tracking, compaction, and overflow handling
repair.ex + system_prompt.ex + emitter.ex — message repair, prompt assembly, and event broadcasting
spawner.ex + collector.ex — sub-agent orchestration and response collection

References

Erlang gen_statem — OTP state machine behaviour used by Opal.Agent.
Elixir GenServer — messaging model still used by sibling subsystems and APIs around the loop.
Erlang/OTP Supervisor Principles — supervision strategy used by session-local processes and tool tasks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Loop

Interface

FSM States

How it works

1. Prompt intake and gating

2. Turn start in `:running`

3. Streaming in `:streaming`

4. Finalization and branch

5. Tool execution in `:executing_tools`

6. Resilience paths

Agent module layout

References

FilesExpand file tree

agent-loop.md

Latest commit

History

agent-loop.md

File metadata and controls

Agent Loop

Interface

FSM States

How it works

1. Prompt intake and gating

2. Turn start in :running

3. Streaming in :streaming

4. Finalization and branch

5. Tool execution in :executing_tools

6. Resilience paths

Agent module layout

References

2. Turn start in `:running`

3. Streaming in `:streaming`

5. Tool execution in `:executing_tools`