InvolutionHell
diff --git a/‎app/docs/all-projects/ai-town.en.mdx‎
Lines changed: 74 additions & 0 deletions b/‎app/docs/all-projects/ai-town.en.mdx‎
Lines changed: 74 additions & 0 deletions
diff --git a/‎app/docs/all-projects/multimodal-rl.en.mdx‎
Lines changed: 94 additions & 0 deletions b/‎app/docs/all-projects/multimodal-rl.en.mdx‎
Lines changed: 94 additions & 0 deletions
@@ -0,0 +1,74 @@
+---
+title: AI Town Design Document
+description: ""
+date: "2025-10-18"
+tags:
+  - ai-project
+docId: bkxwg1m9p9rnm8062wsm020w
+lang: en
+translatedFrom: zh
+translatedAt: 2026-04-15T08:00:00Z
+translatorAgent: claude-sonnet-4-6
+---
+
+# AI Town Design Document
+
+## 1. Project Overview
+
+- **Type**: A lightweight simulation + social + quest-based mini-game driven by multi-agent NPCs
+- **Core selling points**: NPCs "remember you" and collaborate with each other through dialogue; players can use **community contribution points** (earned by posting, submitting PRs, etc.) to obtain in-game currency and abilities, driving town events
+- **Technology foundation**: Godot 4 (Microverse-style) + multi-agent framework (O-R-P-A: Observe → Retrieve → Plan → Act) + local model first (with template fallback)
+
+## 2. Goals (MVP)
+
+1. Single map + 3 NPCs (merchant / messenger / editor) + quest board (fetch / relay / check-in)
+2. Dialogue with **short-term memory + end-of-day summary**
+3. **Minimal community integration**: support entering a "redemption code" to receive coins / action points (future: automatic issuance via webhook)
+4. Use points (or redeemed coins) to trigger 2–3 **visible world changes** (discount day / extra quests / expanded dialogue budget)
+
+## 3. Core Gameplay (Version 1)
+
+- **Loop**: Accept quest → Dialogue / collaborate with NPCs → Complete to earn coins / AP → Nightly summary generated → Events refresh next day
+- **Uses of points / coins** (pick 2–3 to implement first)
+  - Unlock a **discount day** at the shop (all prices -10%)
+  - Purchase **action points** (one extra quest per day)
+  - Purchase **dialogue budget** (3 additional conversation turns with an NPC that day)
+  - Trigger a **theme-day announcement** (published by the editor NPC; NPC dialogue becomes more active)
+
+## 4. Open-Source Community Integration (Two Phases)
+
+### Phase A (MVP) — Redemption Code Verification
+
+- The community backend issues one-time **redemption codes** (containing point value and expiry); player enters code in-game → server verifies and voids → returns coins / AP
+- **Advantage**: No login or account binding required; maximally stable and ready to ship
+
+### Phase B (Mid-term) — Automatic Issuance via Webhook
+
+- Posts, PR merges on GitHub / the site trigger a Webhook → write to `pending_rewards`
+- Game launch or clicking "Sync" → fetch pending rewards → automatically credited
+- Optional: bind Steam / GitHub account for stronger identity verification
+
+## 5. System Architecture (Minimal Modules)
+
+- **Client (Godot)**
+  - `Wallet` (authoritative entry point for coins / AP)
+  - `TaskManager`, `DialogManager`, `MemoryManager`, `CharacterManager`
+  - `TownEventBus` (broadcasts shop open / midday break / close / theme day)
+  - `RedeemPanel` (redemption code UI)
+
+- **Services (can be merged into community backend)**
+  - `/api/v1/redeem` (one-time verification and voiding)
+  - (Reserved) `/api/v1/rewards/pending`, `/webhooks/github`
+
+**Data Flow (MVP)**  
+Community issues code → Player enters it in-game → `redeem` verifies → Returns coins / AP → `Wallet` credits → `TownEventBus` triggers discount / quest refresh
+
+## 6. Scoring and Spending (Initial Draft)
+
+| Action           | Community Points Earned | In-game Conversion (Example) |
+| ---------------- | ----------------------: | ---------------------------- |
+| Post approved    |                     +80 | 80 pts = 400 coins           |
+| PR merged        |                     +80 | 80 pts = 400 coins           |
+| Article featured |                     +50 | 50 pts = 1 "theme day" item  |
+
+> Conversion rates are stored in a config file; events can apply temporary bonuses (e.g., 1.2× on weekends)
@@ -0,0 +1,94 @@
+---
+title: Multimodal Reinforcement Learning Project (MVP Goals)
+description: Build a lightweight multimodal understanding and generation system that closes the loop from visual perception to language expression, incorporating reinforcement learning and answer-to-image generation.
+date: "2025-10-17"
+tags:
+  - projects
+  - multimodal
+  - reinforcement-learning
+  - RLHF
+docId: ifwz8sqxqsgjrafa79pycrcm
+lang: en
+translatedFrom: zh
+translatedAt: 2026-04-15T08:00:00Z
+translatorAgent: claude-sonnet-4-6
+---
+
+# Multimodal Group – MVP Specification
+
+**Project version:** v0.1  
+**Repository:** [involutionhell](https://github.com/InvolutionHell/involutionhell)
+
+---
+
+<a id="vision"></a>
+## 1. Vision
+
+Build a lightweight multimodal understanding and generation system that enables the model to interpret images, retrieve relevant information, and produce logically coherent text output.  
+The goal is to close the full loop from visual perception to language expression, and further develop the ability to explain answers through generated images.
+
+<a id="mvp-goals"></a>
+## 2. MVP Phase Goals
+
+<a id="phase-1"></a>
+### Phase 1: Basic Multimodal Pipeline
+
+- Image content recognition (objects, scenes, semantic labels).
+- Semantic retrieval (image → text / text → image).
+- Generative understanding and text output.
+- Model references: CLIP / SigLIP / BLIP-2 / LLaVA / Qwen-VL.
+
+<a id="phase-2"></a>
+### Phase 2: Multimodal Reinforcement Learning
+
+- Incorporate user feedback and reward signals to optimise model generation and retrieval performance.
+- Main directions:
+  1. RLHF / DPO fine-tuning to learn user preferences.
+  2. Retrieval strategy optimisation based on behavioural data.
+  3. Generation quality control and consistency improvement.
+
+- Goal: give the system the ability to self-improve and adapt to user preferences.
+
+<a id="phase-2-5"></a>
+### Phase 2.5: Answer-to-Image Generation
+
+- Automatically generate illustrative images from the model's text answers to aid comprehension.
+- Implementation: use Stable Diffusion / SDXL to convert answer text into image prompts.
+- Application examples:
+  - Answer "the process of black hole formation" → generate a structural diagram.
+  - Explain a scene from a novel → generate a conceptual illustration.
+
+- Goal: enable the system not only to understand images and answer questions, but also to explain answers through generated images.
+
+<a id="architecture"></a>
+## 3. System Architecture
+
+```
+[Frontend] → Upload image / Display results
+      ↓
+[Backend API] → FastAPI + LangChain + Vector Search
+      ↓
+[Multimodal Models] → CLIP / BLIP / LLaVA / Qwen-VL
+      ↓
+[RL Module + Answer-to-Image] (Phase 2 and 2.5)
+```
+
+<a id="milestones"></a>
+## 4. Milestones
+
+| Phase     | Goal                                  | Deliverables                                  |
+| --------- | ------------------------------------- | --------------------------------------------- |
+| Phase 1   | Multimodal recognition and generation | Image recognition, retrieval, text generation |
+| Phase 2   | Reinforcement learning optimisation   | RLHF / DPO, retrieval strategy optimisation   |
+| Phase 2.5 | Answer-to-image generation            | Automatic illustration generation             |
+| Phase 3   | Scaling and deployment                | Web demo and API interface                    |
+
+<a id="team"></a>
+## 5. Team Responsibilities
+
+| Module                                          | Owner    |
+| ----------------------------------------------- | -------- |
+| Image recognition and encoding                  | Member A |
+| Semantic retrieval and data processing          | Member B |
+| Generation module and model integration         | Member C |
+| Reinforcement learning and visualisation output | Member D |