docs(exploration): substrate option — multi-agent text MUD (Evennia + N×LLM NPCs)#31
Merged
Merged
Conversation
… N×LLM NPCs) A new exploration evaluating whether the ladder (adr-0007) should replace its deferred AI Town slot with a self-hosted Evennia + N≥3 autonomous LLM NPCs + operator + main agent. Surfaces three core tensions, gates resolution on three concrete spikes (S1/S2/S3) and an operator UX ack. Why this isn't a no-op: - Per the companion zh prior-art supplement, GenerativeAgentsCN empirically validates N=25 LLM agents on local Ollama + Qwen3-4B — materially de-risks the local-cost question that previously made AI Town's slot effectively dormant. Combined with telnet/Evennia integration cost being lower than AI Town's Convex+TS stack and project-stack fit (Python+Ollama already in tree), the option *strictly dominates* AI Town on the dimensions AI Town was placed on the ladder for. Why this isn't proposed as a P0/P1 replacement: - Melvor P0 lock (adr-0003/0007) holds — multi-agent + non-determinism before the apparatus is proven violates the clean-room principle. - Stardew P1 is on the ladder for *operator co-presence warmth* (named NPCs, calendar, gifts, graphical) — Option B doesn't replicate that. The three core tensions, recorded so future readers can interrogate them: 1. Dyad-vs-multi-agent attention split — load-bearing mitigation: foreground/background architectural asymmetry (operator + main agent fully-modeled; NPCs small local + short context + not in reflection loop). 2. Setting choice cuts across criteria — wuxia gives a zh asset trove but imports a skill-grinding culture that risks adr-0004; cannot resolve until S2 + operator ack. 3. Determinism at multi-LLM × persistent world × seeded replay is unprecedented — Paracosm/Miniverse close on a subset. Gating spikes: - S1: fork GenerativeAgentsCN, measure tokens/tick at N=25 on operator HW. - S2: import a DikuMUD area into Evennia + skim wuxia-MUD lib for content decision input. - S3: seeded local Ollama + logical tick + content-cached LLM responses; byte-equal replay of a 50-tick scripted scenario. - S0 (after S1–S3): AgentScope vs ElizaOS-core comms bench. Decision gates and out-of-scope set in note. Status stays `open` pending operator co-presence UX ack — the blocking signal before any spike. Sources include LIGHT (Meta 2019, closest historical MUD-AI precedent), GenerativeAgentsCN, AgentScope, AgentVerse, Concordia, Paracosm, Miniverse, vLLM reproducibility, DikuMUD `tbamud`, AI People (GoodAI, closed commercial parallel), and the zh ecosystem links (pkuxkx wiki, mudcore, mudchina, mud.ren炎黄 MUD). Companion PR adds the underlying zh supplement to exploration/prior-art-l2-l3.md. Session-Id: 019e9e62-7e3f-7286-9de2-7b3bc7b9369d Agent: cc-rc-bot
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
New exploration note evaluating whether the ladder (adr-0007) should replace its deferred AI Town slot with a self-hosted text MUD (Evennia, possibly DikuMUD-area-fed) populated by N≥3 autonomous LLM-driven NPCs + one operator + one designated "main agent."
Surfaces three core tensions, gates resolution on three concrete spikes (S1/S2/S3) and an operator UX ack. Status remains
openpending operator co-presence ack.Why this is more than a no-op
Per the companion PR's zh prior-art supplement, GenerativeAgentsCN empirically validates N=25 LLM agents running on local Ollama + Qwen3-4B. Combined with telnet/Evennia integration cost being lower than AI Town's Convex+TS stack and project-stack fit (Python+Ollama already in tree), Option B strictly dominates AI Town on the dimensions AI Town was placed on the ladder for.
Why this isn't a P0/P1 replacement
The three tensions
Gating spikes (must all pass)
tbamudarea into Evennia + skim wuxia-MUD lib (pkuxkx wiki / mudcore).Companion PR
#30 — adds the zh supplement to
exploration/prior-art-l2-l3.mdthat this note references via [[prior-art-l2-l3]]. The two PRs are intentionally separate: the prior-art update is independently useful regardless of how Option B resolves.Test plan
[[name]]links resolve to existing notes or are intentional forward references.harness-cisignal (path-filtered; no harness/ touched, but runs on every PR per recent change).🤖 Generated with Claude Code