You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following up on my comment in #95, after reading how the trust surface developed
this week (#140 cryptographic provenance, #92/#94 SOURCES + inline citation, the
new SPEC §12 trust-and-safety in #58). Those cover three distinct senses of
"trust" — and there's a fourth still open, which is the one we've had to solve in
production.
A bundle can be perfectly signed (#140), injection-safe (#58), and have every
claim cited (#94) — and an agent still can't tell a claim verified against the
live system from one that merely restates vendor documentation (often stale or
aspirational) from one that's inferred and unverified. A citation proves a claim
is grounded; it doesn't grade the ground. For a retrieval agent that's tolerable.
For an agent that acts on a real system, it's where damage happens. (This is
the dimension #95 explicitly scoped out.)
What we run in production — single-domain agent KB, enterprise IT service
management, ~250 atomic notes over ~140 build cycles, agents acting on live
customer systems:
a graded confidence value per concept in frontmatter — HIGH | MEDIUM | LOW | UNVERIFIED — orthogonal to "is it cited";
a small evidence-kind vocabulary on claims (observed-on-system / documentation / partner-confirmed / inferred) — this types the kind of
evidence, not just its URL, so it's a different sense of "provenance" than Provenance / verification: an additive layer for OKF bundles #140's origin provenance;
an explicit trust ordering when two cited sources disagree
(observed > partner-confirmed > observed+doc > doc > inferred) and a conflict rule: document both sides, never silently pick.
The taxonomy is domain-specific; the axis is not. Every producer whose agents act reinvents some version of "how sure are we", and without a shared optional
slot none of it survives the cross-org hop OKF exists for.
The ask — minimal, optional, already spec-permitted ("producers MAY include
additional keys"):
an optional confidence: frontmatter key with a small recommended enum, and
Not a mandate; minimal producers ignore it. I'm not attached to the names — happy
to align with #140/#92 so we don't end up with three meanings of "provenance".
Questions:
Is graded epistemic confidence in scope as an optional OKF convention, or
deliberately producer-side forever?
Would a worked example bundle (an OKF-shaped slice of our corpus with the
confidence layer applied) help the discussion?
Thanks for the pace and openness here — the trust model is clearly being built
right now, which is why I wanted to get this axis on the table before it settles.
Following up on my comment in #95, after reading how the trust surface developed
this week (#140 cryptographic provenance, #92/#94 SOURCES + inline citation, the
new SPEC §12 trust-and-safety in #58). Those cover three distinct senses of
"trust" — and there's a fourth still open, which is the one we've had to solve in
production.
Four axes, so we don't talk past each other:
A bundle can be perfectly signed (#140), injection-safe (#58), and have every
claim cited (#94) — and an agent still can't tell a claim verified against the
live system from one that merely restates vendor documentation (often stale or
aspirational) from one that's inferred and unverified. A citation proves a claim
is grounded; it doesn't grade the ground. For a retrieval agent that's tolerable.
For an agent that acts on a real system, it's where damage happens. (This is
the dimension #95 explicitly scoped out.)
What we run in production — single-domain agent KB, enterprise IT service
management, ~250 atomic notes over ~140 build cycles, agents acting on live
customer systems:
confidencevalue per concept in frontmatter —HIGH | MEDIUM | LOW | UNVERIFIED— orthogonal to "is it cited";observed-on-system/documentation/partner-confirmed/inferred) — this types the kind ofevidence, not just its URL, so it's a different sense of "provenance" than
Provenance / verification: an additive layer for OKF bundles #140's
origin provenance;
(
observed > partner-confirmed > observed+doc > doc > inferred) and aconflict rule: document both sides, never silently pick.
The taxonomy is domain-specific; the axis is not. Every producer whose agents
act reinvents some version of "how sure are we", and without a shared optional
slot none of it survives the cross-org hop OKF exists for.
The ask — minimal, optional, already spec-permitted ("producers MAY include
additional keys"):
confidence:frontmatter key with a small recommended enum, andProposal: Inline citation as default convention for coupling claims to evidence #94,
complementary to Provenance / verification: an additive layer for OKF bundles #140
— different sense of "provenance").
Not a mandate; minimal producers ignore it. I'm not attached to the names — happy
to align with #140/#92 so we don't end up with three meanings of "provenance".
Questions:
deliberately producer-side forever?
Proposal: SOURCES body section for agent navigation and provenance #92 SOURCES /
Provenance / verification: an additive layer for OKF bundles #140 provenance work?
confidence layer applied) help the discussion?
Thanks for the pace and openness here — the trust model is clearly being built
right now, which is why I wanted to get this axis on the table before it settles.