You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: update default embedding model to jina-embeddings-v2-small-en
Replace Xenova/bge-m3 (1024 dims, ~560 MB) with
Xenova/jina-embeddings-v2-small-en (512 dims, ~33 MB) as the default
embedding model. Update all docs, examples, site content, and tests.
Also fix bge-small/bge-base pooling from cls to mean per HuggingFace docs.
Copy file name to clipboardExpand all lines: docs/concepts-docs-indexing.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,7 +76,7 @@ This means you can search for code examples by the symbols they define, or by se
76
76
77
77
### Step 4: Embed everything
78
78
79
-
Each chunk is embedded into a vector using the configured model (default: `Xenova/bge-m3`). The embedding captures the **semantic meaning** of `title + content`, enabling similarity-based search.
79
+
Each chunk is embedded into a vector using the configured model (default: `Xenova/jina-embeddings-v2-small-en`). The embedding captures the **semantic meaning** of `title + content`, enabling similarity-based search.
80
80
81
81
Root nodes additionally get a `fileEmbedding` — embedded from `file path + h1 title` — used for file-level search ("find docs about authentication").
Copy file name to clipboardExpand all lines: docs/configuration.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
## Zero-config mode
4
4
5
-
No config file needed. Just run `graphmemory serve` in your project directory — the current directory becomes the project with sensible defaults (BGE-M3 q8 model, all graphs enabled).
5
+
No config file needed. Just run `graphmemory serve` in your project directory — the current directory becomes the project with sensible defaults (jina-small q8 model, all graphs enabled).
6
6
7
7
## Config file
8
8
@@ -60,8 +60,8 @@ server:
60
60
search: 120
61
61
auth: 10
62
62
model:
63
-
name: "Xenova/bge-m3"
64
-
pooling: "cls"
63
+
name: "Xenova/jina-embeddings-v2-small-en"
64
+
pooling: "mean"
65
65
normalize: true
66
66
dtype: "q8"
67
67
queryPrefix: ""
@@ -100,7 +100,7 @@ projects:
100
100
name: "Project Bot"
101
101
email: "bot@example.com"
102
102
model:
103
-
name: "Xenova/bge-m3"
103
+
name: "Xenova/jina-embeddings-v2-small-en"
104
104
embedding:
105
105
maxChars: 24000
106
106
access:
@@ -111,7 +111,7 @@ projects:
111
111
include: "**/*.md"
112
112
exclude: "**/drafts/**"
113
113
model:
114
-
name: "Xenova/bge-m3"
114
+
name: "Xenova/bge-m3" # override: use multilingual model for docs
Copy file name to clipboardExpand all lines: docs/docker.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,7 +76,7 @@ docker compose up -d
76
76
77
77
### Model cache
78
78
79
-
The default embedding model (`Xenova/bge-m3`, ~560 MB) downloads on first startup. Use a **named volume** so the model persists across container restarts.
79
+
The default embedding model (`Xenova/jina-embeddings-v2-small-en`, ~33 MB) downloads on first startup. Use a **named volume** so the model persists across container restarts.
**jinaai/jina-embeddings-v2-base-code** — the default code graph model:
@@ -21,7 +21,7 @@ The embedding system converts text into high-dimensional vectors for semantic se
21
21
- Pooling: `mean`
22
22
- Normalization: L2-normalized
23
23
24
-
The code graph uses a separate model inheritance chain (`codeModel`) so it can use a code-optimized model by default while other graphs use BGE-M3.
24
+
The code graph uses a separate model inheritance chain (`codeModel`) so it can use a code-optimized model by default while other graphs use jina-small.
The `model` parameter selects which embedding model to use: `"default"`(general, BGE-M3) or `"code"` (code-optimized, jina-code). Both models are loaded when the embedding API is enabled.
234
+
The `model` parameter selects which embedding model to use: `"default"`(general, jina-small) or `"code"` (code-optimized, jina-code). Both models are loaded when the embedding API is enabled.
226
235
227
236
### Embedding API configuration
228
237
@@ -271,19 +280,19 @@ projects:
271
280
my-app:
272
281
projectDir: "/path/to/my-app"
273
282
model:
274
-
name: "Xenova/bge-m3" # default for most graphs
283
+
name: "Xenova/bge-m3" # multilingual model for most graphs
275
284
pooling: "cls"
276
285
normalize: true
277
286
graphs:
278
287
files:
279
288
model:
280
-
name: "Xenova/bge-small-en-v1.5" # smaller model for file paths
281
-
pooling: "cls"
289
+
name: "Xenova/jina-embeddings-v2-small-en" # lighter model for file paths
290
+
pooling: "mean"
282
291
normalize: true
283
292
code:
284
293
model:
285
294
name: "Xenova/bge-base-en-v1.5" # different model for code
Copy file name to clipboardExpand all lines: docs/indexer.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,8 +25,8 @@ flowchart TD
25
25
During initial indexing, the three queues run **sequentially by phase** rather than concurrently. This ensures only one embedding model is loaded at a time, reducing peak memory:
Copy file name to clipboardExpand all lines: docs/overview.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@
10
10
-**Stores knowledge** (facts, notes, decisions) in a dedicated knowledge graph with typed relations, file attachments, and cross-graph links
11
11
-**Tracks tasks** with kanban workflow, priorities, due dates, estimates, assignees, and cross-graph links
12
12
-**Manages skills** (reusable recipes/procedures) with steps, triggers, usage tracking, and cross-graph links
13
-
-**Embeds every node** locally using `Xenova/bge-m3` by default (no external API calls); supports per-graph models with configurable pooling, normalization, dtype, and prefixes
13
+
-**Embeds every node** locally using `Xenova/jina-embeddings-v2-small-en` by default (no external API calls); supports per-graph models with configurable pooling, normalization, dtype, and prefixes
14
14
-**Answers search queries** via hybrid search (BM25 keyword + vector cosine similarity) with BFS graph expansion
15
15
-**Watches for file changes** and re-indexes incrementally in real time
16
16
@@ -45,7 +45,7 @@
45
45
## Requirements
46
46
47
47
-**Node.js** >= 22
48
-
- The default embedding model (`Xenova/bge-m3`, ~560 MB) downloads on first startup
48
+
- The default embedding model (`Xenova/jina-embeddings-v2-small-en`, ~33 MB) downloads on first startup
0 commit comments