Skip to content

task: Performance & memory on large repositories #593

Description

@DeusData

Scope

Umbrella tracker for performance & memory on large repositories — indexing that never finishes, times out, or grows memory without bound on big codebases. Distinct from #390 (crashes/segfaults/memory-safety) and #391 (silent corruption): here indexing is too slow or too memory-hungry, not crashing or lying about success.

Sub-issues

Related

Acceptance

  1. Indexing of a large public repo completes within a bounded time and memory envelope (capture a baseline corpus).
  2. Worker-count / memory budget honor container (cgroup) limits and the explicit override.
  3. No unbounded growth over long-running sessions — reproduce the leak in a soak test before fixing.

Why one task

These share the indexing throughput + memory-budget infrastructure (worker pool sizing, GLR merge, dump phase, long-running watcher). Grouping them keeps the large-repo experience coherent.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingstability/performanceServer crashes, OOM, hangs, high CPU/memorytaskUmbrella task grouping multiple related issues

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions