When to use
- Your app already calls the LLM and gets an answer
- You want independent governance without rearchitecting
- You need the lowest latency (~150ms p95)
Example
What Verify checks
- Evidence consistency — does the answer match
tool_result? - Security — canary words, secret patterns, PII leaks
- Mismatch type — contradiction / numeric / insufficient evidence
- Confidence — high / medium / low
What Verify does NOT do
- Does NOT call an LLM (no latency, no token cost)
- Does NOT ground answers against external sources (you supply them)
- Does NOT rewrite unless Security policy triggers
/v1/chat instead.