refactor(scheduler): modularize handlers and search pipelines #1004

fancyboi999 · 2026-02-03T11:35:20Z

Description

Summary:

Extract scheduler handlers into dedicated modules and register them via a registry.
Split retriever into search/enhance/rerank/filter pipelines for clearer responsibilities.
Centralize text search logic so API and scheduler share the same implementation.

Problem:

Scheduler logic was monolithic and tightly coupled, hard to test/extend.
Search logic duplicated between scheduler and API.

Approach:

Introduce mem_scheduler/handlers with DI-style context.
Introduce retriever pipelines and keep SchedulerRetriever as a facade.
Add memos/search/search_service.py for shared text search.

Dependencies:

None.

Related Issue (Required): Fixes #1003

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Refactor (does not change functionality, e.g. code style improvements, linting)
Documentation update

How Has This Been Tested?

Unit Test
Test Script Or Test Steps (please provide)
Pipeline Automated API Test (please provide)

Test steps:

Start server: cd src && python -m uvicorn memos.api.server_api:app --host 0.0.0.0 --port 8001
Test endpoints:
- GET /product/scheduler/allstatus
- POST /product/add
- POST /product/search (fast / fine / mixture)
- POST /product/scheduler/wait
- GET /product/scheduler/task_queue_status

Test Results:

Endpoint	Status
GET /product/scheduler/allstatus	✅ 200 OK
POST /product/add	✅ 200 OK, data persisted to Neo4j
GET /product/scheduler/task_queue_status	✅ 200 OK
POST /product/search (fast)	✅ 200 OK, returns correct memories
POST /product/search (fine)	✅ 200 OK
POST /product/search (mixture)	✅ 200 OK
POST /product/scheduler/wait	✅ Works as expected
Unit tests (pytest)	✅ 61 passed
Ruff lint checks	✅ All passed

Checklist

I have performed a self-review of my own code | 我已自行检查了自己的代码
I have commented my code in hard-to-understand areas | 我已在难以理解的地方对代码进行了注释
I have added tests that prove my fix is effective or that my feature works | 我已添加测试以证明我的修复有效或功能正常
I have created related documentation issue/PR in MemOS-Docs (if applicable) | 我已在 MemOS-Docs 中创建了相关的文档 issue/PR（如果适用）
I have linked the issue to this PR (if applicable) | 我已将 issue 链接到此 PR（如果适用）
I have mentioned the person who will review this PR | 我已提及将审查此 PR 的人

Reviewer Checklist

closes fix: Scheduler architecture is monolithic and tightly coupled (hard to test/extend) #1003
Made sure Checks passed
Tests have been provided

- extract scheduler handlers into dedicated modules - split retriever into pipelines (search/enhance/rerank/filter) - centralize text search logic for API and scheduler Refs MemTensor#1003

Copilot

Pull request overview

This PR refactors the scheduler and retrieval stack to make handler logic modular, split retrieval into clearer pipelines, and centralize text-search behavior so that the API and scheduler share a single implementation.

Changes:

Introduces a shared search_service module and updates API/scheduler callers to use centralized text-memory search.
Refactors SchedulerRetriever into composable search/enhancement/rerank/filter pipeline classes and extracts corresponding mixins from BaseScheduler.
Modularizes scheduler handlers into dedicated classes wired via a SchedulerHandlerRegistry and DI-style SchedulerHandlerContext.

Reviewed changes

Copilot reviewed 27 out of 27 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
`src/memos/search/search_service.py`	Adds `SearchContext` and `search_text_memories` to centralize text-memory search arguments and behavior.
`src/memos/search/__init__.py`	Exposes the new search service types/functions as public API.
`src/memos/multi_mem_cube/single_cube.py`	Replaces inline text search logic with calls to `search_text_memories` for fast search.
`src/memos/mem_scheduler/optimized_scheduler.py`	Switches scheduler search paths to use centralized search functions and shared `SearchContext`.
`src/memos/mem_scheduler/memory_manage_modules/search_pipeline.py`	Extracts the search step of the scheduler retriever into a standalone `SearchPipeline`.
`src/memos/mem_scheduler/memory_manage_modules/retriever.py`	Refactors `SchedulerRetriever` into a façade composed of search/enhancement/rerank/filter pipelines.
`src/memos/mem_scheduler/memory_manage_modules/rerank_pipeline.py`	Encapsulates LLM-based memory reranking and combined memory post-processing in `RerankPipeline`.
`src/memos/mem_scheduler/memory_manage_modules/filter_pipeline.py`	Wraps `MemoryFilter` operations in a dedicated `FilterPipeline`.
`src/memos/mem_scheduler/memory_manage_modules/enhancement_pipeline.py`	Moves enhancement and recall logic into `EnhancementPipeline`, including batching and parallelization.
`src/memos/mem_scheduler/handlers/registry.py`	Adds `SchedulerHandlerRegistry` that instantiates handler classes and builds the label-to-handler dispatch map.
`src/memos/mem_scheduler/handlers/query_handler.py`	Implements `QueryMessageHandler` to log user queries and re-label them to memory-update tasks.
`src/memos/mem_scheduler/handlers/pref_add_handler.py`	Implements `PrefAddMessageHandler` to process preference-add messages concurrently and write to `PreferenceTextMemory`.
`src/memos/mem_scheduler/handlers/memory_update_handler.py`	Extracts long-memory update flow and retrieval triggering logic into `MemoryUpdateHandler`.
`src/memos/mem_scheduler/handlers/mem_reorganize_handler.py`	Adds `MemReorganizeMessageHandler` to process memory reorganization and emit merge logs.
`src/memos/mem_scheduler/handlers/mem_read_handler.py`	Adds `MemReadMessageHandler` to drive mem-reader processing and downstream knowledge-base logs.
`src/memos/mem_scheduler/handlers/feedback_handler.py`	Implements `FeedbackMessageHandler` to process feedback payloads and emit KB update logs in cloud environments.
`src/memos/mem_scheduler/handlers/context.py`	Defines `SchedulerHandlerContext` and `SchedulerHandlerServices` to inject scheduler dependencies into handlers.
`src/memos/mem_scheduler/handlers/base.py`	Introduces `BaseSchedulerHandler` to store the injected handler context.
`src/memos/mem_scheduler/handlers/answer_handler.py`	Implements `AnswerMessageHandler` to log assistant responses as messages.
`src/memos/mem_scheduler/handlers/add_handler.py`	Extracts add-memory logging (local vs cloud KB) into `AddMessageHandler`.
`src/memos/mem_scheduler/handlers/__init__.py`	Re-exports handler context and registry for use by the scheduler.
`src/memos/mem_scheduler/general_scheduler.py`	Simplifies `GeneralScheduler` to assemble handler context/registry and register handlers instead of inlined consumers; delegates memory ops to mixins.
`src/memos/mem_scheduler/base_scheduler.py`	Refactors core scheduler behavioral methods into separate mixins and updates `BaseScheduler` inheritance accordingly.
`src/memos/mem_scheduler/base_mixins/web_log_ops.py`	Moves web-log publishing and normalization logic into `BaseSchedulerWebLogMixin`.
`src/memos/mem_scheduler/base_mixins/queue_ops.py`	Moves message submission, dispatch, consumer loop, and queue monitoring into `BaseSchedulerQueueMixin`.
`src/memos/mem_scheduler/base_mixins/memory_ops.py`	Moves working/activation memory management and monitoring helpers into `BaseSchedulerMemoryMixin`.
`src/memos/mem_scheduler/base_mixins/__init__.py`	Re-exports the new scheduler base mixins.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/memos/search/search_service.py

src/memos/mem_scheduler/handlers/memory_update_handler.py

src/memos/mem_scheduler/base_mixins/memory_ops.py

src/memos/mem_scheduler/memory_manage_modules/search_pipeline.py

src/memos/mem_scheduler/memory_manage_modules/enhancement_pipeline.py

src/memos/mem_scheduler/base_mixins/memory_ops.py

- Fix TC001/TC002/TC003: move type-only imports into TYPE_CHECKING blocks - Fix RUF059: prefix unused variables with underscore - Fix typos: "Memorires" -> "Memories", "exeption" -> "exception" - Remove self-assignment: `text_mem_base = text_mem_base` - Remove unused `user_context` param from `build_search_context` - Restore original `QUERY_TASK_LABEL` in activation memory update - Apply ruff format to all modified files

refactor(scheduler): modularize handlers and search

695e597

- extract scheduler handlers into dedicated modules - split retriever into pipelines (search/enhance/rerank/filter) - centralize text search logic for API and scheduler Refs MemTensor#1003

Copilot AI review requested due to automatic review settings February 3, 2026 11:35

Copilot started reviewing on behalf of fancyboi999 February 3, 2026 11:35 View session

Copilot AI reviewed Feb 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(scheduler): modularize handlers and search pipelines #1004

refactor(scheduler): modularize handlers and search pipelines #1004

Uh oh!

fancyboi999 commented Feb 3, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor(scheduler): modularize handlers and search pipelines #1004

Are you sure you want to change the base?

refactor(scheduler): modularize handlers and search pipelines #1004

Uh oh!

Conversation

fancyboi999 commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Checklist

Reviewer Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fancyboi999 commented Feb 3, 2026 •

edited

Loading