Question 1

Why can't GPT or Claude do this themselves?

Accepted Answer

A language model can describe what an audit should check, but it cannot install a tool, intercept its TLS traffic, and prove what it actually sent. That needs execution and capture — an external observation layer the model doesn't have.

Question 2

What is evidence coverage?

Accepted Answer

The share of an audit's claims that are backed by independently captured evidence rather than assertion. Five claims, one proven → 20% coverage.

Question 3

What is an unsupported claim?

Accepted Answer

An assertion with no verifiable evidence behind it — the audit says it, but nothing observed confirms it. Canary flags these and holds a verdict that rests on them.

Question 4

How is the Integrity Score calculated?

Accepted Answer

From real, checkable attributes: a passed capture self-test, intercepted traffic behind each claim, an adversarial disclosure check, a tamper-evident signature, and an exact version pin — normalised to 0–100. See the Integrity Score page.

Question 5

Can a high-confidence audit still have low integrity?

Accepted Answer

Yes — that is the central failure mode. Confidence is generated from text; integrity is computed from evidence. Confidence is not evidence.

Question 6

Does Canary audit the auditor?

Accepted Answer

Yes. It scores the audit itself on evidence, and signs its own verdicts so its scoring is auditable in turn.

Questions