feat(parser): migrate deepseek tui provider by mariusvniekerk · Pull Request #758 · kenn-io/agentsview

mariusvniekerk · 2026-06-19T18:51:43Z

DeepSeek TUI has a shallow one-file-per-session JSON layout, so moving it next keeps the provider migration incremental while exercising the JSON source helper with non-JSONL extensions.

The provider preserves legacy discovery filters for latest and offline queue files, raw/full ID lookup, changed-path classification, fingerprint propagation, and parse normalization without changing runtime sync dispatch.

roborev-ci · 2026-06-19T18:56:08Z

roborev: Combined Review (`b601036`)

Summary verdict: One medium-severity issue remains; no high or critical findings were reported.

Medium

internal/parser/deepseek_tui_provider.go:111
The new source set omits Hash: true, so provider.Fingerprint will never populate SourceFingerprint.Hash. Parse only copies req.Fingerprint.Hash, which means the migrated DeepSeek provider will stop persisting the file_hash that the existing DeepSeek sync path computes.
Fix: Enable hashing in JSONLSourceSetOptions for DeepSeek TUI, and update the provider test to call Fingerprint and verify the parsed session receives that hash.

Panel: ci_default_security | Synthesis: codex, 6s | Members: codex_default (codex/default, done, 4m0s), codex_security (codex/security, done, 1m19s) | Total: 5m25s

roborev-ci · 2026-06-20T01:53:10Z

roborev: Combined Review (`808b17a`)

No issues found.

Panel: ci_default_security | Synthesis: codex | Members: codex_default (claude-code/default, done, 4m5s), codex_security (claude-code/security, done, 41s) | Total: 4m46s

roborev-ci · 2026-06-21T00:55:44Z

roborev: Combined Review (`4b11d8c`)

Summary verdict: One medium issue remains; no critical or high findings were reported.

Medium

internal/parser/deepseek_tui_provider.go:111: The new DeepSeek TUI provider never requests a content hash, but Parse only preserves sess.File.Hash when req.Fingerprint.Hash is populated. The legacy DeepSeek path computes file_hash plus inode/device metadata, so ProviderMigrationShadowCompare can report mismatches for successful parses and would drop that metadata if this provider becomes authoritative.
- Fix: Enable hashing for the DeepSeek source set and carry file identity through the provider fingerprint/parse path, or otherwise make the provider output match the legacy processDeepSeekTUI session metadata. Add a caller-level shadow comparison test that exercises the real legacy process result.

Panel: ci_default_security | Synthesis: codex, 8s | Members: codex_default (codex/default, done, 6m31s), codex_security (codex/security, done, 1m9s) | Total: 7m48s

mariusvniekerk · 2026-06-21T01:36:59Z

This change is part of the following stack:

Design parser provider facade layer #748
- Add parser provider facade core #751
  - Add JSONL source set helper #752
    - feat(parser): add directory JSONL source helper #756
      - feat(parser): migrate commandcode and iflow providers #757
        
        Migrate gptme to parser provider facade #753
        
        feat(parser): migrate deepseek tui provider #758 ◀
        
        feat(parser): migrate amp and zencoder providers #759
        
        feat(parser): migrate pi provider #760
        
        feat(parser): migrate qwen provider #761
        feat(parser): migrate workbuddy provider #762
        feat(parser): migrate cortex provider #763
        feat(parser): migrate kimi provider #764
        feat(parser): migrate claw providers #766
        feat(parser): migrate qwenpaw provider #767
        feat(parser): migrate openhands provider #768
        feat(parser): migrate cursor provider #769
        feat(parser): migrate vibe provider #770
        feat(parser): migrate hermes provider #771
        feat(parser): migrate claude provider #772
        feat(parser): migrate cowork provider #773
        feat(parser): migrate opencode-family providers #774
        feat(parser): migrate codex provider #775
        feat(parser): migrate gemini copilot providers #776
        feat(parser): migrate copilot ide providers #778
        feat(parser): migrate positron provider #779
        feat(parser): migrate zed shelley providers #780
        feat(parser): migrate kiro providers #781
        feat(parser): migrate antigravity providers #782
        feat(parser): migrate db-backed providers #783
        fix(parser): require explicit provider factories #784

_{Change managed by git-spice.}

roborev-ci · 2026-06-21T01:41:44Z

roborev: Combined Review (`282fb83`)

Summary verdict: one medium issue remains; no high or critical findings were reported.

Medium

internal/parser/deepseek_tui_provider.go:95 - The new DeepSeek TUI provider does not populate File.Hash, File.Inode, or File.Device the way the legacy processDeepSeekTUI path does. newDeepSeekTUISourceSet also does not enable hashing, so shadow compare will report session mismatches for valid DeepSeek files when this mode is enabled. Fix by computing/copying the same file hash and identity metadata as the legacy path, and add an engine-level shadow comparison test that asserts no mismatches for DeepSeek TUI.

Panel: ci_default_security | Synthesis: codex, 13s | Members: codex_default (codex/default, done, 5m23s), codex_security (codex/security, done, 1m41s) | Total: 7m17s

roborev-ci · 2026-06-21T01:55:50Z

roborev: Combined Review (`adb7150`)

Summary verdict: one medium issue remains; no high or critical findings were reported.

Medium

internal/parser/deepseek_tui_provider.go:95
The new provider does not preserve DeepSeek TUI file metadata parity with the legacy sync path. The legacy processDeepSeekTUI populates Session.File.Hash, Inode, and Device, but real provider parses only copy req.Fingerprint.Hash, and the source set is not configured to compute that hash. In shadow mode this can report mismatches for otherwise equivalent sessions, and provider-authoritative mode would stop persisting those fields.

Fix: Populate the provider fingerprint/parse result with the same hash and file identity fields as the legacy path, and add a shadow comparison test that compares against the full legacy process result rather than only ParseDeepSeekTUISession.

Panel: ci_default_security | Synthesis: codex, 7s | Members: codex_default (codex/default, done, 5m13s), codex_security (codex/security, done, 45s) | Total: 6m5s

roborev-ci · 2026-06-24T01:07:57Z

roborev: Combined Review (`990379a`)

Synthesis unavailable. Showing individual review outputs.

claude-code — default (done)

Review Findings

Severity: Medium
Location: internal/parser/deepseek_tui_provider.go, newDeepSeekTUISourceSet (JSONLSourceSetOptions block)
Problem: The migration drops the persisted per-session content hash. The legacy processDeepSeekTUI always computed ComputeFileHash(file.Path) and stored it in sess.File.Hash. In the new provider, Parse only sets sess.File.Hash = req.Fingerprint.Hash when that value is non-empty, and the fingerprint's Hash is populated only when the source set enables Hash: true (see JSONLSourceSet.Fingerprint). newDeepSeekTUISourceSet does not set Hash: true, so Fingerprint.Hash is always "" in the real engine path and DeepSeek TUI sessions are now persisted with an empty file_hash. This diverges from the sibling JSONL providers migrated in the same stack: newIflowSourceSet and newGptmeSourceSet both set Hash: true, with iflow even commenting that it "mirrors the legacy sync path, which set the session file hash from a full content hash of the transcript." The TestDeepSeekTUIProviderParse test masks this because it injects Fingerprint{Hash: "abc123"} manually rather than going through provider.Fingerprint. Practical impact is limited (DeepSeek skip/change-detection is mtime+size based, not hash based, so updates are not missed), but it silently stops populating a DB column that was previously written and is still written for other agents, and will cause a one-time PG re-push churn for existing DeepSeek sessions as their fingerprint flips to the empty hash.
Fix: Add Hash: true to the JSONLSourceSetOptions in newDeepSeekTUISourceSet, matching iflow/gptme; optionally extend TestDeepSeekTUIProviderParse to assert provider.Fingerprint(...) returns a non-empty hash so the regression is caught without manually supplying one.

Summary

Migrates the DeepSeek TUI agent from legacy discover/find/parse functions to the Provider facade backed by JSONLSourceSet (adding a FollowSymlinkFiles option), but omits Hash: true, dropping the persisted session content hash that the legacy path and sibling provider migrations preserve.

claude-code — security (done)

I've reviewed the diff and traced the security-relevant control flow into the implementations not shown in the diff (JSONLSourceSet.FindSource, sourceForPath, and the symlink-resolution helpers).

This commit migrates the DeepSeek TUI parser from standalone discovery/find/parse functions to the Provider/JSONLSourceSet facade. I focused on the two security-relevant surfaces:

Path-traversal on session-ID lookup (the one attacker-influenceable input path). The legacy FindDeepSeekTUISourceFile guarded against traversal with IsValidSessionID(rawID) before filepath.Join(root, rawID+".json"). The replacement (JSONLSourceSet.FindSource, lines 175–193) is strictly stronger: it still validates RawSessionID with IsValidSessionID, and rather than joining the raw ID into a path, it only equality-matches the ID against already-discovered sources under configured roots. StoredFilePath/FingerprintKey lookups go through sourceForPath, which constrains the path with pathAllowedByRoot (must be under a configured root) and pathIncluded. So even an authenticated remote-mode caller passing a crafted session ID cannot escape the configured roots. The retained ../session_123 → not-found test confirms parity. Protection preserved.

New FollowSymlinkFiles option. sourceFileInfo/sourcePathInfo resolve symlinks-to-files via os.Stat, so a symlink under the sessions dir (e.g. evil.json -> /etc/passwd) would be treated as a source and read by parseSession. This matches legacy behavior: the old DiscoverDeepSeekTUISessions used entry.IsDir() (which reports false for symlinks) and then os.ReadFile followed the link at parse time. Planting such a symlink requires write access to the user-owned ~/.deepseek/sessions directory, which already implies same-user privileges; the remote API exposes no way to create it. Per the project threat model (local same-user access is not an attacker boundary; TOCTOU/symlinks on user-owned local paths are out of scope), and because the behavior is unchanged from the legacy path, this is not a new boundary crossing.

No SQL/command/template sinks, secrets, auth decisions, or privileged operations are touched; parsing is gjson-based with no eval, and IsValidSessionID still gates the stored ID prefix.

No issues found.

DeepSeek TUI has a shallow one-file-per-session JSON layout, so moving it next keeps the provider migration incremental while exercising the JSON source helper with non-JSONL extensions. The provider preserves legacy discovery filters for latest and offline queue files, raw/full ID lookup, changed-path classification, fingerprint propagation, and parse normalization without changing runtime sync dispatch. fix(parser): preserve deepseek tui symlink files DeepSeek TUI legacy lookup and parsing followed direct symlinks to session JSON files, so the facade provider needs an explicit way to preserve that source shape instead of silently dropping linked archives. The JSONL source helper keeps symlink-file following opt-in, DeepSeek TUI enables it, and the branch manifest opts the concrete provider into shadow comparison so the migration is exercised rather than additive. Validation: go test -tags "fts5" ./internal/parser -run 'Test(DeepSeekTUIProvider|JSONLSourceSet|ProviderMigrationModes)' -count=1; go test -tags "fts5" ./internal/parser -count=1; go vet ./...; git diff --check test(parser): skip deepseek symlink test when unsupported Some test environments deny symlink creation even though the provider behavior is valid when links are available. The regression should skip in that environment instead of failing for host permissions. Validation: go test -tags "fts5" ./internal/parser -run 'Test(DeepSeekTUIProvider|ProviderMigrationModes)' -count=1; go test -tags "fts5" ./internal/parser -count=1; go vet ./...; git diff --check test(sync): compare deepseek tui shadow parity DeepSeek TUI is shadow-compared on this branch, so add the shared source-level proof that provider observation matches the existing ParseDeepSeekTUISession output. This keeps the branch review focused on an actual migration surface rather than only provider-local parser tests. Validation: go fmt ./...; go test -tags "fts5" ./internal/parser ./internal/sync -count=1; go vet ./...; git diff --check; ./custom-gcl run --config .golangci.nilaway.yml ./internal/parser/... ./internal/sync/... refactor(parser): fold deepseek tui into provider DeepSeek TUI should have one maintained parser shape on this branch. Leaving exported discover, lookup, and parse functions beside the concrete provider kept the migration additive and forced sync to preserve a second dispatch path. Make the concrete provider authoritative, move parsing onto the provider, remove the AgentDef legacy hooks and engine dispatch, and replace shadow-baseline tests with provider API coverage plus a guard that the old symbols stay gone. Validation: go test -tags "fts5" ./internal/parser ./internal/sync ./cmd/agentsview -count=1; go vet ./...; git diff --check fix(parser): preserve deepseek tui file hash DeepSeek TUI legacy sync stored the transcript content hash, but the migrated provider source set did not request hashing, so provider-authoritative Parse left Session.File.Hash empty when using the real Fingerprint path.\n\nEnable source hashing for DeepSeek TUI and make the provider parse test use Fingerprint -> Parse to assert the persisted file_hash value comes from the session JSON content.\n\nValidation: go test -tags "fts5" ./internal/parser -run TestDeepSeekTUIProvider -count=1; go test -tags "fts5" ./internal/parser -count=1; go vet ./...; git diff --check

roborev-ci · 2026-06-25T06:10:23Z

roborev: Combined Review (`49d1447`)

Medium issue found: DeepSeek TUI parse-diff coverage regressed; no security findings.

Medium

internal/parser/types.go:365 and internal/sync/parsediff.go:217
DeepSeek TUI no longer has a DiscoverFunc, but parse-diff still only discovers and accepts file-based agents with DiscoverFunc. As a result, --agent deepseek-tui will be rejected as having no on-disk source, and unrestricted parse-diff will skip existing DeepSeek sessions as “not discovered” instead of comparing them.
Fix: Teach parse-diff to include provider-authoritative agents and build provider-backed work items, or keep the legacy discovery hook until parse-diff is provider-aware. Add parse-diff coverage for DeepSeek TUI/provider-authoritative sources.

Panel: ci_default_security | Synthesis: codex, 7s | Members: codex_default (codex/default, done, 6m18s), codex_security (codex/security, done, 1m17s) | Total: 7m42s

This was referenced Jun 20, 2026

fix(parser): require explicit provider factories #784

Draft

feat(parser): migrate db-backed providers #783

Draft

feat(parser): migrate antigravity providers #782

Draft

mariusvniekerk force-pushed the provider-gptme branch from e9558b6 to ede515d Compare June 20, 2026 01:41

mariusvniekerk force-pushed the provider-deepseek-tui branch from b601036 to 808b17a Compare June 20, 2026 01:41

mariusvniekerk force-pushed the provider-gptme branch from ede515d to 27cbdac Compare June 21, 2026 00:40

mariusvniekerk force-pushed the provider-deepseek-tui branch from 808b17a to 4b11d8c Compare June 21, 2026 00:40

mariusvniekerk force-pushed the provider-gptme branch from 27cbdac to 569d8db Compare June 21, 2026 01:35

mariusvniekerk force-pushed the provider-deepseek-tui branch from 4b11d8c to 282fb83 Compare June 21, 2026 01:35

mariusvniekerk force-pushed the provider-gptme branch from 569d8db to 95fe1f2 Compare June 21, 2026 01:46

mariusvniekerk force-pushed the provider-deepseek-tui branch from 282fb83 to adb7150 Compare June 21, 2026 01:46

mariusvniekerk force-pushed the provider-gptme branch from 95fe1f2 to d08a3b3 Compare June 23, 2026 23:55

mariusvniekerk force-pushed the provider-deepseek-tui branch from adb7150 to 990379a Compare June 23, 2026 23:55

mariusvniekerk force-pushed the provider-gptme branch from d08a3b3 to b80e47c Compare June 25, 2026 05:47

mariusvniekerk force-pushed the provider-deepseek-tui branch from 990379a to 49d1447 Compare June 25, 2026 05:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(parser): migrate deepseek tui provider#758

feat(parser): migrate deepseek tui provider#758
mariusvniekerk wants to merge 1 commit into
provider-gptmefrom
provider-deepseek-tui

mariusvniekerk commented Jun 19, 2026

Uh oh!

roborev-ci Bot commented Jun 19, 2026

Uh oh!

roborev-ci Bot commented Jun 20, 2026

Uh oh!

roborev-ci Bot commented Jun 21, 2026

Uh oh!

mariusvniekerk commented Jun 21, 2026 •

edited

Loading

Uh oh!

roborev-ci Bot commented Jun 21, 2026

Uh oh!

roborev-ci Bot commented Jun 21, 2026

Uh oh!

roborev-ci Bot commented Jun 24, 2026

Uh oh!

roborev-ci Bot commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mariusvniekerk commented Jun 19, 2026

Uh oh!

roborev-ci Bot commented Jun 19, 2026

roborev: Combined Review (b601036)

Medium

Uh oh!

roborev-ci Bot commented Jun 20, 2026

roborev: Combined Review (808b17a)

Uh oh!

roborev-ci Bot commented Jun 21, 2026

roborev: Combined Review (4b11d8c)

Medium

Uh oh!

mariusvniekerk commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

roborev-ci Bot commented Jun 21, 2026

roborev: Combined Review (282fb83)

Medium

Uh oh!

roborev-ci Bot commented Jun 21, 2026

roborev: Combined Review (adb7150)

Medium

Uh oh!

roborev-ci Bot commented Jun 24, 2026

roborev: Combined Review (990379a)

claude-code — default (done)

Review Findings

Summary

claude-code — security (done)

Uh oh!

roborev-ci Bot commented Jun 25, 2026

roborev: Combined Review (49d1447)

Medium

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

roborev: Combined Review (`b601036`)

roborev: Combined Review (`808b17a`)

roborev: Combined Review (`4b11d8c`)

mariusvniekerk commented Jun 21, 2026 •

edited

Loading

roborev: Combined Review (`282fb83`)

roborev: Combined Review (`adb7150`)

roborev: Combined Review (`990379a`)

roborev: Combined Review (`49d1447`)