Skip to main content

Hardproof quality report (release QA)

This page is a public summary of the first recorded Hardproof release QA run against non-fixture targets. It is intended to be reproducible: every row links to the raw scan outputs and the exact manifest used.

Evidence bundle

Scan summary

Hardproof makes score truth explicit. Partial scans are numeric-but-not-publishable (score_truth_status=partial). overall_score is still computed as the effective score (matching partial_score); full scans publish overall_score once the weighted evidence gates (typically Trust) are satisfied.

Note: this evidence bundle was recorded with 0.4.0-beta.2, which used overall_score=null for partial scans. The table below shows the effective score (the value that now appears as overall_score in partial scans).

targetstatusscore_truth_statusscore_modeoverall_scorepartial_scoreevidence
stdio x07lang-mcpfailpartialpartial7878scan.json
streamable_http postgres-mcp demo (partial)warnpartialpartial9696scan.json
streamable_http postgres-mcp demo (full)warnpublishablefull9393scan.json

Corpus flow (public-sample pipeline)

The release QA run also records a minimal corpus run + render to prove the public-sample pipeline works end-to-end.

How to interpret this page