Spec

Router

Purpose

The router is the only place in the system where the decision “what should happen now?” is made for an incoming event. It is built to be the smallest layer that does that job — most events route by a literal type check, and only the genuinely ambiguous ones (comments mentioning the bot, free-form chat messages) reach an LLM.

There is no LLM in any path that decides whether to act on issue.opened or pr.opened. Those go to skills directly. The LLM enters only when a human has typed natural language at the bot.

Public contract

export async function routeEvent(
  envelope: EventEnvelope,
  deps: RouterDeps = {},
): Promise<RoutingResult>;

export type RoutingResult =
  | { action: "skill";  skill: string; context: Record<string, unknown> }
  | { action: "reply";  message: string }
  | { action: "ignore"; reason: string };

Three possible outcomes — dispatch a skill, send a direct reply (no agent involved), or drop the event. The harness consumes the result in src/index.ts:560–1124 and routes to the matching handler.

RouterDeps carries the DB handle (for the reply-gate lookup), the managed-repos set, optional model overrides for the classifier and screener, and a github client used solely to enrich a dependency-PR mention comment with its check state before classification (see the comment table). The router is otherwise side-effect-free — this one read-only fetch is gated to Dependabot / Renovate PR comments.

Defined in src/engine/router.ts:8–40.

Deterministic routes

These run before any LLM call. For every envelope, the first matching rule wins.

The @last-light handle shown throughout this table is the default. The mention handle is derived from the configured bot slug (botName, default last-light; set via overlay config.yaml or GITHUB_APP_BOT_NAME), so a deployment with botName: nearform-lastlight triggers on @nearform-lastlight instead. Only the configured handle matches — there is no legacy fallback (see Configuration).

Trigger	Result	Notes
`issue.opened` / `issue.reopened`	`skill: issue-triage`	`reopened=true` for the latter
`pr.opened` / `pr.synchronize` / `pr.reopened`	`skill: pr-review`
`pr.checks_failed`	classifier → the workflow claiming the intent (else `ignore`)	A failing `check_suite` — the connector only emits this for a dependency-update PR (deterministic commit-author / branch-prefix gate, mirroring `pr.checks_passed`), so a human’s red PR never reaches here. The classifier then routes the recognised bump to `dependabot-ci-fix`. Unlike other structured events, routing goes through the intent classifier so workflows self-register via `classification`
`pr.checks_passed`	the workflow claiming the `dependabot-pr-merge` intent (else `ignore`)	A green `check_suite` on a dependency-update PR (the connector already pre-filtered to Dependabot / Renovate). Routed deterministically via `getWorkflowByIntent("dependabot-pr-merge")` — no classifier call, since the dependency-PR gate is the connector’s job
`comment.created` with pending reply gate	`skill: explore-reply`	Reply-gate short-circuit — see below
`comment.created` on a pre-build issue, plain (no `@last-light`)	`skill: issue-triage` (`mode: retriage`)	Reporter-driven re-triage — see below
`comment.created` without `@last-light`	`ignore`	reason: “no bot mention”
`comment.created` from non-maintainer	`reply: "only maintainers can trigger builds"`	`authorAssociation` not in `MAINTAINER_ROLES`
`comment.created` matching `@last-light approve\|reject [reason]`	`skill: approval-response`	Regex parse, no classifier
`comment.created` matching `@last-light security-review`	`skill: security-review`
`comment.created` matching `@last-light verify <claim>`	`skill: verify`	Text after the keyword becomes `commentBody`; works on issues + PRs
`comment.created` matching `@last-light qa-test <target>`	`skill: qa-test`	Text after the keyword becomes `commentBody`; works on issues + PRs
`comment.created` matching `@last-light demo <notes>`	`skill: demo`	Text after the keyword becomes `commentBody`; works on issues + PRs
`comment.created` on issue with `security-scan` label	`skill: security-feedback`	Overrides classifier — every comment on a scan summary issue is feedback
`@last-light`-mention on a Dependabot / Renovate PR	classifier → `dependabot-ci-fix` (red) / `dependabot-pr-merge` (green)	For a dependency-authored PR the router fetches the PR + `getChecksConclusion` and passes `prAuthor` + `checksState` to the classifier, so an ambiguous “@bot can you look at this?” routes like the webhooks would. Gated on the cheap author/title predicate (`isDependencyPr`) and best-effort — a fetch failure or an explicit “review this” falls back to normal classification. Needs `github` in `RouterDeps`
`message` with pending reply gate on this Slack thread	`skill: explore-reply`	Same short-circuit as GitHub
`pr_review.submitted` / `pr_review_comment.created`	`ignore`	”not yet handled” — placeholder

The remaining comment types — maintainer @-mentions without a special command, and free-form Slack messages — reach the classifier.

Reply-gate short-circuit

Before any mention parsing, the router asks the DB:

deps.db.getPendingReplyGateByTrigger(triggerId)

triggerId is "${repo}#${issueNumber}" for GitHub events and "slack:${teamId}:${channelId}:${threadId}" for Slack messages. If a paused workflow is waiting on this conversation, the comment becomes the next loop iteration’s input — no @last-light mention required, no maintainer check, no classifier call. This is the mechanism that lets the explore workflow have a natural back-and-forth with a human.

(router.ts:97–112 for GitHub, 272–288 for Slack.)

Reporter-driven re-triage

Sitting just above the mention gate (so it can catch the plain replies the mention gate would otherwise drop), this branch lets new information re-open triage on an issue before it has entered a build. It fires only for a GitHub comment.created that is on an issue (not a PR), carries no @last-light mention, and whose issue has no build run in workflow_runs (deps.db.runs.hasRunForTrigger("${repo}#${n}", "build") — any status, so a started/failed/completed build closes the window). Within that gate:

needs-info issue + original author or a maintainer replies → re-triage. Answering a needs-info request always re-opens triage.
Any other state + the original author replies → re-triage only if a cheap classifier (classifyCommentAddsInfo) judges the comment to add substantive information (new detail / repro / clarification / scope change) rather than social noise (“thanks”). Safe default on classifier error is no re-triage.

Re-triage reuses the issue-triage handler with context.mode = "retriage"; the triage agent re-reads the whole thread regardless. Bot comments are filtered at the connector, so this can’t self-loop. Author identity comes from envelope.issueAuthor vs envelope.sender. (router.ts, comment.created branch.)

Maintainer gate

// src/engine/router.ts:28
const MAINTAINER_ROLES = new Set(["OWNER", "MEMBER", "COLLABORATOR"]);

Only these authorAssociation values can trigger work via @last-light on a GitHub comment. Anyone else gets action: "reply" with a polite decline (router.ts:123–130) — the router itself emits the reply, not a workflow.

This check fires only after the @last-light mention check, so a non-maintainer chatting in an issue without summoning the bot just gets the “no bot mention” ignore — no rejection noise.

Slack messages have no equivalent gate; the messaging connector’s SLACK_ALLOWED_USERS allowlist (enforced at the connector layer) is the only access control on chat.

Prompt-injection screening

For events that reach a classifier, the router runs a cheap LLM screener in parallel with the classifier, not sequentially. Both finish in roughly the same time so combined latency is max(a, b), not a + b.

// src/engine/screen/screen.ts:47
async function screenForInjection(text: string, model?: string): Promise<ScreenResult>;
// ScreenResult = { flagged: boolean; reason?: string }

Behaviour:

Short input (< 60 chars) skips the screener and returns flagged: false.
Failure (timeout, parse error) returns flagged: false. The screener is fail-open. A broken screener never blocks an event.
When flagged: true, the router prepends [lastlight-flag: <reason>] to the comment body before passing it to the workflow context (router.ts:180–182). The flag is advisory — the agent still runs, but agent-context/security.md instructs it to treat flagged content with suspicion.

Build-intent classifier

The classifier turns a free-form comment or message into one discrete intent. By default there are fourteen:

BUILD | EXPLORE | QUESTION | TRIAGE | REVIEW | SECURITY |
VERIFY | QATEST | DEMO | APPROVE | REJECT | STATUS | RESET | CHAT

(VERIFY → the verify workflow, QATEST → qa-test, DEMO → demo. The structured @last-light verify / @last-light qa-test / @last-light demo keyword matches above short-circuit before the classifier; natural-language requests like “does this actually fix the crash?” reach verify via this classifier path, and “record a demo of this” reaches demo.)

The prompt is composed, forkable, and workflow-driven (issue #164). The system prompt is assembled at runtime by buildClassifierPrompt(), not hardcoded:

A forkable base template — workflows/prompts/classifier.md, resolved through the same overlay machinery as any other prompt (overlay wins by name; lastlight fork classifier copies it into instance/). It holds the framing, the global disambiguation rules, the five harness-owned control categories (APPROVE/REJECT/STATUS/RESET/CHAT), and the {{categories}} / {{examples}} / {{intentTokens}} slots.
Per-workflow categories. Each workflow YAML contributes its own category via a classification: block (intent + description + optional examples); build.yaml owns BUILD, pr-review.yaml owns REVIEW, and so on. The classifier enumerates listAgentWorkflows(), merges the blocks in canonical order, and derives the token→intent vocabulary (token = intent.toUpperCase().replace(/-/g,""), so qa-test → QATEST) from the same blocks. The composition is memoised and rebuilt when the asset layers change.

So a deployment can add a routable intent by adding a workflow — an overlay incident.yaml with classification.intent: incident teaches the classifier the INCIDENT category, the parser the token, and the router the route (see data-driven routing) with no core edit. The loader validates at boot that every classification.intent is unique and doesn’t shadow a control intent.

The model must reply in exactly four lines:

INTENT: BUILD
REPO: owner/name
ISSUE: 42
REASON: NONE

Heuristics worth knowing:

BUILD requires an object. Either an explicit repo (Slack) or an implicit one (the GitHub issue the comment lives on).
EXPLORE requires brainstorm/spec/design language. “Let’s think about” / “what would it look like if”. Otherwise BUILD or CHAT.
APPROVE/REJECT only matter if a gate is pending — they’re emitted but the harness verifies the gate before acting.
Ambiguous → CHAT. The default bias is conservative.

Failure modes:

Timeout (30 s default) or parse error → { intent: "chat" }. Conservative fallback — the user gets a chat reply instead of an accidental build.
Regex fallback for repo extraction: if the LLM misses an obvious github.com/owner/repo/issues/N URL in the text, classifier.ts parses it directly (182–189).

Called only on (a) GitHub comment.created with maintainer @-mention, and (b) Slack message. Never on deterministic events.

Data-driven routing for new intents

The router keeps its bespoke, context-dependent branches for the well-known intents (build → pr-fix on a PR vs build on an issue; explore is a no-op on a PR; the security-scan diversion). For an intent outside that well-known set — a new one an overlay workflow introduced — the trailing generic default falls back to getWorkflowByIntent(intent) (the workflow whose classification.intent matches), routing to it on both GitHub comments and Slack. It routes to that single workflow uniformly across surfaces; a deployment needing surface-specific routing for its new intent can still add routes.github / routes.slack overrides. Well-known intents never hit the fallback, so their established routing is untouched.

New issues (issue.opened) route through the same composed classifier: classifyIssueIntent() runs the main classifier over the issue title + body and sends a QUESTION intent to the answer workflow, everything else to triage (the safe default). This replaced a separate hardcoded question-vs-work prompt — answer.yaml now owns the QUESTION category, so the two share one vocabulary.

A second, smaller helper — classifyCommentAddsInfo — answers a single yes/no question (does a reporter’s plain comment add substantive information, or is it social noise?) and gates the reporter-driven re-triage branch for non-needs-info issues. Its prompt is the forkable workflows/prompts/classify-adds-info.md. Same cheap-helper path; fail-closed (error → no re-triage).

`llm.ts` — the cheap-helper path

Both screener and classifier dispatch through src/engine/llm.ts, which does direct HTTP POSTs to provider APIs (Anthropic Messages, OpenAI Chat Completions, OpenRouter passthrough). No agent SDK, no tools, no streaming — single-turn calls only.

Fast-model resolution (defaultFastModel(taskType) in llm.ts), in order:

The config models: map for the task key (models.classifier, models.screener) — set it in config.yaml like any other per-task model. Env OPENCODE_MODELS / LASTLIGHT_MODELS is layered into this map at config-load, so it’s covered here too (env wins over config.yaml).
The env OPENCODE_MODELS JSON read directly — a fallback for contexts where runtime config isn’t loaded (some CLI / test paths).
First configured provider’s fast model, in registry order: ANTHROPIC_API_KEY → anthropic/claude-haiku-4-5-20251001, else OPENAI_API_KEY → openai/gpt-5.4-mini, else OPENROUTER_API_KEY → openrouter/google/gemini-2.5-flash.

Only an explicit per-task entry counts — never models.default — so the cheap helpers stay cheap unless a deployment deliberately pins them.

Single retry on 429 / 5xx with a 750 ms back-off; never retries on other 4xx (those are real errors).

This path is intentionally separate from the agentic-pi / pi-ai runtimes used by the sandbox and chat surfaces. Routing decisions should not pay the cost of those richer code paths.

Skill enumeration

The full set of skill strings routeEvent() can emit, and where each is handled in the harness:

Skill	Handler in `src/index.ts`
`chat`	`577–650` — in-process chat runner
`chat-reset`	`654–661` — deactivate session, ack
`status-report`	`664–675` — list running executions
`approval-response`	`839–893` — resume or fail paused run
`explore-reply`	`750–836` — feed comment into paused explore loop
`pr-fix`, `dependabot-ci-fix`	`handlePrFix` — lightweight fix-and-push (both are `PR_FIX_SHAPED_WORKFLOWS`; resolves the PR head branch + failed-check summary, skips fork PRs, dispatches the named workflow)
`build`	`896–976` — full build cycle on an issue
`answer`	`982–1014` — generic `dispatchWorkflow()` for `answer.yaml`; answers a question issue directly (routed via `routes.github.issue_answer` / `routes.slack.answer`)
`pr-review`, `pr-comment`, `issue-triage`, `issue-comment`, `explore`, `security-review`, `security-feedback`, `verify`, `qa-test`, `demo`	`982–1014` — generic `dispatchWorkflow()` + ack

The generic-dispatch lane runs the YAML workflow whose name matches the skill string. Anything bespoke (e.g. build first records an execution row and reacts 🚀 on the comment before dispatching) gets its own branch.

Introspection — the route playground

Because routeEvent performs no side effects, a synthetic event can be threaded through the real classifier and router to preview its decision without ever dispatching a workflow. Two admin endpoints expose this:

GET /admin/api/route-graph — the static map the dashboard draws: inputs (GitHub + Slack), each connector’s event types tagged deterministic vs classifier, the handler set (workflows + in-process handlers), and the deterministic + intent edges (derived from getRoutes() + listAgentWorkflows()).
POST /admin/api/route-test — a hermetic dry-run. It builds an EventEnvelope from the request (with an inert reply no-op and empty raw), calls routeEvent(envelope, {}) — no db, no github, so zero external reads/writes — and, for comment-text types, calls classifyComment(…, { explain: true }) to surface the model’s one-sentence reasoning. It returns the Route, the classification, and a composed explanation. It never touches dispatch / dispatchWorkflow (structurally absent from the admin surface), so the only external effect is the cheap classifier call. Powers the dashboard Router Playground page.

The explain option on classifyComment is test/introspection-only: the REASON: line is already parsed for every intent, so it only nudges the model to populate it — production never sets it, leaving classifier output and token cost unchanged.

Invariants

No LLM in deterministic routes. The opening / synchronize / open events route by event type. The LLM never decides whether to triage an issue.
Reply gate beats mention parsing. If the DB says a workflow is waiting on this thread, the comment goes there — regardless of whether it mentions the bot, contains a slash command, or anything else. The natural-language continuation is the point.
Maintainer gate is a router decision. Workflows assume their caller has been authorized. A re-implementation that lets non- maintainer events reach workflows will leak.
Screener is fail-open, classifier is fail-CHAT. The screener failing should never silence the bot; the classifier failing should never accidentally launch a build cycle. These defaults are not symmetric on purpose.
ignore is silent. No reply, no log entry beyond the router’s console line, no DB write. The contract with the user is “if the bot doesn’t react, the bot didn’t see it”.
Bot self-loop guards live in connectors, not here. The router does not re-check senderIsBot. Adding a duplicate check would mask bugs in the connector layer.

Current implementation

Piece	File
`routeEvent`, `RoutingResult`, `MAINTAINER_ROLES`, `BOT_MENTION` regex	`src/engine/router.ts`
Build-intent classifier (compose + parse)	`src/engine/screen/classifier.ts`
Composable base prompt + adds-info prompt	`workflows/prompts/classifier.md`, `workflows/prompts/classify-adds-info.md`
Per-workflow category source	`classification:` block in each `workflows/<name>.yaml`
Intent → workflow fallback	`getWorkflowByIntent()` in `src/workflows/loader.ts`
Injection screener	`src/engine/screen/screen.ts`
Direct provider calls + model auto-detect	`src/engine/llm.ts`
Harness consumer (skill → handler)	`src/index.ts:560–1124`
URL extraction fallback	`extractGithubRefFromText()` in `classifier.ts`
Route playground endpoints (`/route-graph`, `/route-test`)	`src/admin/routes.ts`

Rebuild notes

Parallelise the two LLM calls. Screener and classifier run in parallel for a reason — they’re both single-shot and roughly the same size. Doing them sequentially would double user-visible latency on every chat turn.
Use a discriminated union for the result. RoutingResult as { action } & ... lets the harness’s switch be exhaustive and type-safe. A re-implementation that returns “skill string or null” is losing information.
Keep the LLM behind a feature gate. A re-implementation may need to operate in environments without any LLM provider key. The classifier should be replaceable with a “default to CHAT” stub so the rest of the system still works.
Don’t centralise auth here. The router checks authorAssociation for @last-light build commands because the check is intent-specific (“is this a maintainer asking for write actions?”). Generic authorization belongs in connectors. Move it out and the router gets cluttered.
Reply-gate lookup is a single indexed DB query. It runs on every comment and every Slack message, so cost it. SQLite handles it trivially; a re-implementation on a remote DB should cache the active set of triggerIds in memory.
Treat the classifier prompt as code. A change to the base template’s output format or fallback rules ripples through every chat surface; the intent set itself is now data (one classification: block per workflow). Version both like config; test with golden cases (buildClassifierPrompt() is pure and snapshot-friendly). Keep the base template and the per-workflow blocks in sync with the token→intent vocabulary they compose into.