localai-bot
diff --git a/‎docs/content/features/_index.en.md‎
Lines changed: 6 additions & 0 deletions b/‎docs/content/features/_index.en.md‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎docs/content/features/backend-monitor.md‎
Lines changed: 93 additions & 0 deletions b/‎docs/content/features/backend-monitor.md‎
Lines changed: 93 additions & 0 deletions
diff --git a/‎docs/content/features/p2p.md‎
Lines changed: 175 additions & 0 deletions b/‎docs/content/features/p2p.md‎
Lines changed: 175 additions & 0 deletions
@@ -14,6 +14,10 @@ LocalAI provides a comprehensive set of features for running AI models locally.
 - **[Text Generation](text-generation/)** - Generate text with GPT-compatible models using various backends
 - **[Image Generation](image-generation/)** - Create images with Stable Diffusion and other diffusion models
 - **[Audio Processing](audio-to-text/)** - Transcribe audio to text and generate speech from text
+- **[Text to Audio](text-to-audio/)** - Generate speech from text with TTS models
+- **[Sound Generation](sound-generation/)** - Generate music and sound effects from text descriptions
+- **[Voice Activity Detection](voice-activity-detection/)** - Detect speech segments in audio data
+- **[Video Generation](video-generation/)** - Generate videos from text prompts and reference images
 - **[Embeddings](embeddings/)** - Generate vector embeddings for semantic search and RAG applications
 - **[GPT Vision](gpt-vision/)** - Analyze and understand images with vision-language models
 
@@ -24,6 +28,7 @@ LocalAI provides a comprehensive set of features for running AI models locally.
 - **[Constrained Grammars](constrained_grammars/)** - Control model output format with BNF grammars
 - **[GPU Acceleration](GPU-acceleration/)** - Optimize performance with GPU support
 - **[Distributed Inference](distributed_inferencing/)** - Scale inference across multiple nodes
+- **[P2P API](p2p/)** - Monitor and manage P2P worker and federated nodes
 - **[Model Context Protocol (MCP)](mcp/)** - Enable agentic capabilities with MCP integration
 - **[Agents](agents/)** - Autonomous AI agents with tools, knowledge base, and skills
 
@@ -34,6 +39,7 @@ LocalAI provides a comprehensive set of features for running AI models locally.
 - **[Stores](stores/)** - Vector similarity search for embeddings
 - **[Model Gallery](model-gallery/)** - Browse and install pre-configured models
 - **[Backends](backends/)** - Learn about available backends and how to manage them
+- **[Backend Monitor](backend-monitor/)** - Monitor backend status and resource usage
 - **[Runtime Settings](runtime-settings/)** - Configure application settings via web UI without restarting
 
 ## Getting Started
 
@@ -0,0 +1,93 @@
++++
+disableToc = false
+title = "Backend Monitor"
+weight = 20
+url = "/features/backend-monitor/"
++++
+
+LocalAI provides endpoints to monitor and manage running backends. The `/backend/monitor` endpoint reports the status and resource usage of loaded models, and `/backend/shutdown` allows stopping a model's backend process.
+
+## Monitor API
+
+- **Method:** `GET`
+- **Endpoints:** `/backend/monitor`, `/v1/backend/monitor`
+
+### Request
+
+The request body is JSON:
+
+| Parameter | Type     | Required | Description                    |
+|-----------|----------|----------|--------------------------------|
+| `model`   | `string` | Yes      | Name of the model to monitor   |
+
+### Response
+
+Returns a JSON object with the backend status:
+
+| Field                | Type     | Description                                           |
+|----------------------|----------|-------------------------------------------------------|
+| `state`              | `int`    | Backend state: `0` = uninitialized, `1` = busy, `2` = ready, `-1` = error |
+| `memory`             | `object` | Memory usage information                              |
+| `memory.total`       | `uint64` | Total memory usage in bytes                           |
+| `memory.breakdown`   | `object` | Per-component memory breakdown (key-value pairs)      |
+
+If the gRPC status call fails, the endpoint falls back to local process metrics:
+
+| Field            | Type    | Description                    |
+|------------------|---------|--------------------------------|
+| `memory_info`    | `object`| Process memory info (RSS, VMS) |
+| `memory_percent` | `float` | Memory usage percentage        |
+| `cpu_percent`    | `float` | CPU usage percentage           |
+
+### Usage
+
+```bash
+curl http://localhost:8080/backend/monitor \
+  -H "Content-Type: application/json" \
+  -d '{"model": "my-model"}'
+```
+
+### Example response
+
+```json
+{
+  "state": 2,
+  "memory": {
+    "total": 1073741824,
+    "breakdown": {
+      "weights": 536870912,
+      "kv_cache": 268435456
+    }
+  }
+}
+```
+
+## Shutdown API
+
+- **Method:** `POST`
+- **Endpoints:** `/backend/shutdown`, `/v1/backend/shutdown`
+
+### Request
+
+| Parameter | Type     | Required | Description                     |
+|-----------|----------|----------|---------------------------------|
+| `model`   | `string` | Yes      | Name of the model to shut down  |
+
+### Usage
+
+```bash
+curl -X POST http://localhost:8080/backend/shutdown \
+  -H "Content-Type: application/json" \
+  -d '{"model": "my-model"}'
+```
+
+### Response
+
+Returns `200 OK` with the shutdown confirmation message on success.
+
+## Error Responses
+
+| Status Code | Description                                    |
+|-------------|------------------------------------------------|
+| 400         | Invalid or missing model name                  |
+| 500         | Backend error or model not loaded              |
@@ -0,0 +1,175 @@
++++
+disableToc = false
+title = "P2P API"
+weight = 22
+url = "/features/p2p/"
++++
+
+LocalAI supports peer-to-peer (P2P) networking for distributed inference. The P2P API endpoints allow you to monitor connected worker and federated nodes, retrieve the P2P network token, and get cluster statistics.
+
+For an overview of distributed inference setup, see [Distributed Inference](/features/distributed_inferencing/).
+
+## Endpoints
+
+### List all P2P nodes
+
+- **Method:** `GET`
+- **Endpoint:** `/api/p2p`
+
+Returns all worker and federated nodes in the P2P network.
+
+#### Response
+
+| Field              | Type    | Description                          |
+|--------------------|---------|--------------------------------------|
+| `nodes`            | `array` | List of worker nodes                 |
+| `federated_nodes`  | `array` | List of federated nodes              |
+
+Each node object:
+
+| Field            | Type     | Description                              |
+|------------------|----------|------------------------------------------|
+| `Name`           | `string` | Node name                                |
+| `ID`             | `string` | Unique node identifier                   |
+| `TunnelAddress`  | `string` | Network tunnel address                   |
+| `ServiceID`      | `string` | Service identifier                       |
+| `LastSeen`       | `string` | ISO 8601 timestamp of last heartbeat     |
+
+#### Usage
+
+```bash
+curl http://localhost:8080/api/p2p
+```
+
+#### Example response
+
+```json
+{
+  "nodes": [
+    {
+      "Name": "worker-1",
+      "ID": "abc123",
+      "TunnelAddress": "192.168.1.10:9090",
+      "ServiceID": "worker",
+      "LastSeen": "2025-01-15T10:30:00Z"
+    }
+  ],
+  "federated_nodes": [
+    {
+      "Name": "federation-1",
+      "ID": "def456",
+      "TunnelAddress": "192.168.1.20:9090",
+      "ServiceID": "federated",
+      "LastSeen": "2025-01-15T10:30:05Z"
+    }
+  ]
+}
+```
+
+---
+
+### Get P2P token
+
+- **Method:** `GET`
+- **Endpoint:** `/api/p2p/token`
+
+Returns the P2P network token used for node authentication.
+
+#### Usage
+
+```bash
+curl http://localhost:8080/api/p2p/token
+```
+
+#### Response
+
+Returns the token as a plain text string.
+
+---
+
+### List worker nodes
+
+- **Method:** `GET`
+- **Endpoint:** `/api/p2p/workers`
+
+Returns worker nodes with online status.
+
+#### Response
+
+| Field                    | Type     | Description                          |
+|--------------------------|----------|--------------------------------------|
+| `nodes`                  | `array`  | List of worker nodes                 |
+| `nodes[].name`           | `string` | Node name                            |
+| `nodes[].id`             | `string` | Unique node identifier               |
+| `nodes[].tunnelAddress`  | `string` | Network tunnel address               |
+| `nodes[].serviceID`      | `string` | Service identifier                   |
+| `nodes[].lastSeen`       | `string` | Last heartbeat timestamp             |
+| `nodes[].isOnline`       | `bool`   | Whether the node is currently online |
+
+A node is considered online if it was last seen within the past 40 seconds.
+
+#### Usage
+
+```bash
+curl http://localhost:8080/api/p2p/workers
+```
+
+---
+
+### List federated nodes
+
+- **Method:** `GET`
+- **Endpoint:** `/api/p2p/federation`
+
+Returns federated nodes with online status. Same response format as `/api/p2p/workers`.
+
+#### Usage
+
+```bash
+curl http://localhost:8080/api/p2p/federation
+```
+
+---
+
+### Get P2P statistics
+
+- **Method:** `GET`
+- **Endpoint:** `/api/p2p/stats`
+
+Returns aggregate statistics about the P2P cluster.
+
+#### Response
+
+| Field              | Type     | Description                       |
+|--------------------|----------|-----------------------------------|
+| `workers.online`   | `int`    | Number of online worker nodes     |
+| `workers.total`    | `int`    | Total worker nodes                |
+| `federated.online` | `int`    | Number of online federated nodes  |
+| `federated.total`  | `int`    | Total federated nodes             |
+
+#### Usage
+
+```bash
+curl http://localhost:8080/api/p2p/stats
+```
+
+#### Example response
+
+```json
+{
+  "workers": {
+    "online": 3,
+    "total": 5
+  },
+  "federated": {
+    "online": 2,
+    "total": 2
+  }
+}
+```
+
+## Error Responses
+
+| Status Code | Description                                 |
+|-------------|---------------------------------------------|
+| 500         | P2P subsystem not available or internal error |