cursortab.nvim

A Neovim plugin that provides edit completions and cursor predictions.

Note

Help improve completions by contributing anonymous usage data to our open dataset. Set contribute_data = true in your config to opt in. No code content or file paths are collected — see the schema.

Requirements
Installation
Configuration
Usage
- Commands
Development
FAQ
Contributing
License

Requirements

Go 1.25.0+ (for building the server component)
Neovim 0.8+ (for the plugin)

Installation

Recommended starting points:

Best hosted: Mercury API
Best local next-edit: Zeta-2
Fastest local: Qwen3.5-0.8B with the inline provider, or Sweep 1.5B/0.5B with the sweep provider

Pick a provider below, then use the matching setup() call in your plugin config. See Providers for all available options.

Mercury API (hosted, no local GPU needed)

Get an API key from Inception Labs

Set the environment variable:

export MERCURY_AI_TOKEN="your-api-key-here"

Zeta-2 (local next-edit prediction)

Run llama.cpp:

llama-server -hf bartowski/zed-industries_zeta-2-GGUF:Q8_0 --port 8000

Qwen3.5-0.8B/Sweep (fastest local)

Run llama.cpp:

llama-server -hf unsloth/Qwen3.5-0.8B-GGUF:Q8_0 --port 8000
# llama-server -hf sweepai/sweep-next-edit-0.5b --port 8000
# llama-server -hf sweepai/sweep-next-edit-1.5b --port 8000

Using lazy.nvim

{
  "cursortab/cursortab.nvim",
  -- version = "*",  -- Use latest tagged version for more stability
  lazy = false,      -- The server is already lazy loaded
  build = "cd server && go build",
  config = function()
    require("cursortab").setup({
      provider = {
        -- Mercury API (hosted)
        type = "mercuryapi",
        api_key_env = "MERCURY_AI_TOKEN",

        -- Zeta-2 (best local)
        -- type = "zeta-2",
        -- url = "http://localhost:8000",

        -- Qwen3.5-0.8B (fastest local, defaults to "inline")
        -- url = "http://localhost:8000",

        -- sweep-next-edit-0.5B/1.5B (fastest local)
        -- type = "sweep",
        -- url = "http://localhost:8000",
      },
    })
  end,
}

Using packer.nvim

use {
  "cursortab/cursortab.nvim",
  -- tag = "*",  -- Use latest tagged version for more stability
  run = "cd server && go build",
  config = function()
    require("cursortab").setup({
      provider = {
        type = "mercuryapi",
        api_key_env = "MERCURY_AI_TOKEN",
      },
    })
  end
}

Configuration

Full config

require("cursortab").setup({
  enabled = true,
  log_level = "info",  -- "trace", "debug", "info", "warn", "error"
  state_dir = vim.fn.stdpath("state") .. "/cursortab",  -- Directory for runtime files (log, socket, pid)
  contribute_data = false,  -- Opt-in: send anonymous metrics to train a better gating model

  keymaps = {
    accept = "<Tab>",           -- Keymap to accept completion, or false to disable
    partial_accept = "<S-Tab>", -- Keymap to partially accept, or false to disable
    trigger = false,            -- Keymap to manually trigger completion, or false to disable
  },

  ui = {
    completions = {
      addition_style = "dimmed",  -- "dimmed" or "highlight"
      fg_opacity = 0.6,           -- opacity for completion overlays (0=invisible, 1=fully visible)
    },
    jump = {
      symbol = "",              -- Symbol shown for jump points
      text = " TAB ",            -- Text displayed after jump symbol
      show_distance = true,      -- Show line distance for off-screen jumps
    },
  },

  behavior = {
    idle_completion_delay = 50,  -- Delay in ms after idle to trigger completion (-1 to disable)
    text_change_debounce = 50,   -- Debounce in ms after text change to trigger completion (-1 to disable)
    max_visible_lines = 12,      -- Max visible lines per completion (0 to disable)
    disabled_in = {},                         -- Tree-sitter scopes to suppress completions (e.g., { "comment", "string" })
    enabled_modes = { "insert", "normal" },  -- Modes where completions are active
    cursor_prediction = {
      enabled = true,            -- Show jump indicators after completions
      auto_advance = true,       -- When no changes, show cursor jump to last line
      proximity_threshold = 2,   -- Min lines apart to show cursor jump (0 to disable)
    },
    ignore_paths = {             -- Glob patterns for files to skip completions
      "*.min.js",
      "*.min.css",
      "*.map",
      "*-lock.json",
      "*.lock",
      "*.sum",
      "*.csv",
      "*.tsv",
      "*.parquet",
      "*.zip",
      "*.tar",
      "*.gz",
      "*.pem",
      "*.key",
      ".env",
      ".env.*",
      "*.log",
    },
    ignore_filetypes = { "", "terminal" }, -- Filetypes to skip completions
    ignore_gitignored = true,    -- Skip files matched by .gitignore
  },

  provider = {
    type = "inline",                      -- Provider: "inline", "fim", "sweep", "sweepapi", "zeta", "zeta-2", "copilot", or "mercuryapi"
    url = "http://localhost:8000",        -- URL of the provider server
    api_key_env = "",                     -- Env var name for API key (e.g., "OPENAI_API_KEY")
    model = "",                           -- Model name
    temperature = 0.0,                    -- Sampling temperature
    context_size = 0,                     -- Max input context in tokens (0 = use max_tokens; inline/fim default: 1024)
    max_tokens = 512,                     -- Max tokens to generate (inline default: 64, fim default: 128)
    top_k = 50,                           -- Top-k sampling
    completion_timeout = 5000,            -- Timeout in ms for completion requests
    max_diff_history_tokens = 512,        -- Max tokens for diff history (0 = no limit)
    completion_path = "/v1/completions",  -- API endpoint path
    fim_tokens = {                        -- FIM tokens (for FIM provider)
      prefix = "<|fim_prefix|>",
      suffix = "<|fim_suffix|>",
      middle = "<|fim_middle|>",
      repo_name = "",                     -- Optional: "<|repo_name|>" enables cross-file context (auto-detected for Qwen models)
      file_sep = "",                      -- Optional: "<|file_sep|>" enables cross-file context (auto-detected for Qwen models)
    },
    privacy_mode = true,                  -- Don't send telemetry to provider
  },

  blink = {
    enabled = false,    -- Enable blink source
    ghost_text = true,  -- Show native ghost text alongside blink menu
  },

  debug = {
    immediate_shutdown = false,  -- Shutdown daemon immediately when no clients
  },
})

You can also run :help cursortab-config to see the configuration.

Highlight Groups

The plugin defines the following highlight groups with default = true, so you can override them in your colorscheme or config:

Group	Default	Purpose
`CursorTabDeletion`	`bg = "#4f2f2f"`	Background for deleted text
`CursorTabAddition`	`bg = "#394f2f"`	Background for added text
`CursorTabModification`	`bg = "#282e38"`	Background for modified text
`CursorTabCompletion`	`fg = "#80899c"`	Foreground for completion text
`CursorTabJumpSymbol`	`fg = "#373b45"`	Jump indicator symbol
`CursorTabJumpText`	`bg = "#373b45"`, `fg = "#bac1d1"`	Jump indicator text

To customize, set the highlight before or after calling setup():

vim.api.nvim_set_hl(0, "CursorTabAddition", { bg = "#1a3a1a" })

Providers

The plugin supports eight AI provider backends: Inline, FIM, Sweep, Sweep API, Zeta-2, Zeta (legacy), Copilot, and Mercury API.

Provider	Hosted	Multi-line	Multi-edit	Cursor Prediction	Streaming	Model
`inline`						Any base model
`fim`		✓			✓	Any FIM-capable
`sweep`		✓	✓	✓	✓	Sweep Next-Edit family
`sweepapi`	✓	✓	✓	✓	✓	`sweep-next-edit-7b`
`zeta-2`		✓	✓	✓	✓	`zeta-2` (SeedCoder-8B)
`zeta`		✓	✓	✓	✓	`zeta` (Qwen2.5-Coder)
`copilot`	✓	✓	✓	✓		GitHub Copilot
`mercuryapi`	✓	✓	✓	✓		`mercury-edit`

Context Per Provider:

Context	inline	fim	sweep	zeta-2	zeta	sweepapi	mercuryapi
Buffer content	✓	✓	✓	✓	✓	✓	✓
Edit history		✓°	✓	✓	✓	✓	✓
Previous file state			✓			✓
LSP diagnostics		✓°	✓	✓	✓	✓	✓
Treesitter context		✓°	✓	✓	✓	✓	✓
Git diff context		✓°	✓	✓	✓	✓	✓
Recent files		✓°	✓	✓	✓	✓	✓
User actions			✓			✓

° FIM cross-file context requires repo-level tokens (repo_name, file_sep). Auto-detected for Qwen models; set manually for other models that support them.

Benchmarks

Measured on 50 scenarios (25 quality + 25 suppress) using the eval harness. Sorted by Score (higher = better):

Score — deltaChrF × gateScore / 100 where gateScore = 2 × showRate × quietRate / (showRate + quietRate). Combines edit quality with gating behavior into a single metric.
deltaChrF — edit quality when shown (character n-gram F-score on the diff region)
Show rate — fraction of quality scenarios where a completion was shown
Quiet rate — fraction of suppress scenarios where the provider correctly produced nothing

Target	Type	Score	deltaChrF	Show rate	Quiet rate	p50 (ms)	p90 (ms)
mercuryapi	mercuryapi	0.58	64.4	100%	81%	565	739
zeta-2	zeta-2	0.56	61.5	88%	96%	551	833
zeta	zeta	0.55	60.9	88%	92%	413	661
qwen3.5-27B	fim	0.23	32.2	76%	68%	131	647
sweep-next-edit-7B	sweep	0.22	45.2	64%	40%	237	474
sweep-next-edit-1.5B	sweep	0.20	41.9	68%	36%	155	258
qwen3.5-4B	fim	0.18	27.1	76%	60%	1254	1339
qwen3.5-0.8B	fim	0.18	31.4	84%	44%	49	509
qwen3.5-2B	fim	0.17	29.4	84%	44%	89	735
copilot	copilot	0.13	22.3	40%	100%	351	915
sweep-next-edit-0.5B	sweep	0.10	23.0	52%	40%	126	201
sweepapi	sweepapi	0.08	16.4	32%	100%	156	300

Inline Provider (Default)

Details

Single-line completion using any OpenAI-compatible /v1/completions endpoint.

require("cursortab").setup({
  provider = {
    type = "inline",
    url = "http://localhost:8000",
  },
})

llama-server -hf unsloth/Qwen3.5-0.8B-GGUF:Q8_0 --port 8000

FIM Provider

Details

Fill-in-the-Middle multi-line completion. Compatible with Qwen, DeepSeek-Coder, and similar FIM-capable models.

From experimentation, FIM models need to be >7B models to have consistent results.

require("cursortab").setup({
  provider = {
    type = "fim",
    url = "http://localhost:8000",
    max_tokens = 256,
  },
})

llama-server -hf unsloth/Qwen3.5-0.8B-GGUF:Q8_0 --port 8000

Sweep Provider

Details

Local next-edit prediction using Sweep models: sweep-next-edit-v2-7B, sweep-next-edit-1.5B, sweep-next-edit-0.5B.

require("cursortab").setup({
  provider = {
    type = "sweep",
    url = "http://localhost:8000",
  },
})

llama-server -hf sweepai/sweep-next-edit-1.5b --port 8000

Sweep API Provider

Details

Hosted Sweep API — no local model required. Get a token from sweep.dev.

export SWEEPAPI_TOKEN="your-api-token-here"

require("cursortab").setup({
  provider = {
    type = "sweepapi",
    api_key_env = "SWEEPAPI_TOKEN",
  },
})

Zeta-2 Provider

Details

Zed's Zeta-2 (SeedCoder-8B). Run it locally with llama.cpp.

require("cursortab").setup({
  provider = {
    type = "zeta-2",
    url = "http://localhost:8000",
  },
})

llama-server -hf bartowski/zed-industries_zeta-2-GGUF:Q8_0 --port 8000

Zeta Provider (legacy)

Details

Zed's original Zeta model (Qwen2.5-Coder-7B). Superseded by Zeta-2.

require("cursortab").setup({
  provider = {
    type = "zeta",
    url = "http://localhost:8000",
  },
})

llama-server -hf bartowski/zed-industries_zeta-GGUF:Q8_0 --port 8000

Copilot Provider

Details

GitHub Copilot via copilot-language-server. Requires a Copilot subscription and vim.lsp.enable.

require("cursortab").setup({
  provider = {
    type = "copilot",
  },
})

Mercury API Provider

Details

Hosted Mercury next-edit model by Inception Labs.

export MERCURY_AI_TOKEN="your-api-key-here"

require("cursortab").setup({
  provider = {
    type = "mercuryapi",
    api_key_env = "MERCURY_AI_TOKEN",
  },
})

blink.cmp Integration

Details

This integration exposes a minimal blink source that only consumes append_chars (end-of-line ghost text). Complex diffs (multi-line edits, replacements, deletions, cursor prediction UI) still render via the native UI.

require("cursortab").setup({
  keymaps = {
    accept = false, -- Let blink manage <Tab>
  },
  blink = {
    enabled = true,
    ghost_text = false,  -- Disable native ghost text
  },
})

require("blink.cmp").setup({
  sources = {
    providers = {
      cursortab = {
        module = "cursortab.blink",
        name = "cursortab",
        async = true,
        -- Should match provider.completion_timeout in cursortab config
        timeout_ms = 5000,
        score_offset = 50, -- Higher priority among suggestions
      },
    },
  },
})

Usage

Tab Key: Navigate to cursor predictions or accept completions
Shift-Tab Key: Partially accept completions (word-by-word for inline, line-by-line for multi-line)
Esc Key: Reject current completions
The plugin automatically shows jump indicators for predicted cursor positions
Visual indicators appear for additions, deletions, and completions
Off-screen jump targets show directional arrows with distance information

Commands

:CursortabToggle: Toggle the plugin on/off
:CursortabShowLog: Show the cursortab log file in a new buffer
:CursortabClearLog: Clear the cursortab log file
:CursortabStatus: Show detailed status information about the plugin and daemon
:CursortabRestart: Restart the cursortab daemon process

Development

# Build the server
cd server && go build

# Run all tests
cd server && go test ./...

For E2E tests, eval harness, and detailed development instructions, see CONTRIBUTING.md.

FAQ

Why are completions slow?

Use a smaller or more heavily quantized model (e.g., Q4 instead of Q8)
Decrease provider.max_tokens to reduce output length
Decrease provider.context_size to reduce input context sent to the model

The copilot provider is known to be slower than the rest.

Why are completions not working?

Update to the latest version, restart Neovim and restart the daemon with :CursortabRestart
Increase provider.completion_timeout (default: 5000ms) to 10000 or more if your model is slow
Increase provider.context_size to give the model more surrounding context (trade-off: slower completions)

How do I update the plugin?

Use your Neovim plugin manager to pull the latest changes, then restart Neovim. The daemon will automatically restart if the configuration has changed. You can also run :CursortabRestart to force a restart.

Why isn't my API key or environment variable being picked up?

The plugin runs a background daemon that persists after Neovim closes. Environment variables are only loaded when the daemon starts. If you add or change an environment variable (e.g., SWEEPAPI_TOKEN in your .zshrc), run :CursortabRestart to restart the daemon with the new environment variables.

Note: If you change plugin configuration (e.g., switch providers), the daemon will automatically restart on the next setup() call.

Contributing

Contributions are welcome! See CONTRIBUTING.md for build, test, and eval instructions.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 291 Commits
.claude/skills/config		.claude/skills/config
.github/workflows		.github/workflows
assets		assets
doc		doc
lua/cursortab		lua/cursortab
scripts		scripts
server		server
.gitattributes		.gitattributes
.gitignore		.gitignore
.luarc.json		.luarc.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cursortab.nvim

Requirements

Installation

Mercury API (hosted, no local GPU needed)

Zeta-2 (local next-edit prediction)

Qwen3.5-0.8B/Sweep (fastest local)

Using lazy.nvim

Using packer.nvim

Configuration

Highlight Groups

Providers

Benchmarks

Inline Provider (Default)

FIM Provider

Sweep Provider

Sweep API Provider

Zeta-2 Provider

Zeta Provider (legacy)

Copilot Provider

Mercury API Provider

blink.cmp Integration

Usage

Commands

Development

FAQ

Contributing

License

About

Uh oh!

Releases 42

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cursortab.nvim

Requirements

Installation

Mercury API (hosted, no local GPU needed)

Zeta-2 (local next-edit prediction)

Qwen3.5-0.8B/Sweep (fastest local)

Using lazy.nvim

Using packer.nvim

Configuration

Highlight Groups

Providers

Benchmarks

Inline Provider (Default)

FIM Provider

Sweep Provider

Sweep API Provider

Zeta-2 Provider

Zeta Provider (legacy)

Copilot Provider

Mercury API Provider

blink.cmp Integration

Usage

Commands

Development

FAQ

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 42

Contributors

Uh oh!

Languages