These scores weren't handed down by us. They came from a panel we built to argue with our own work — and we've put its limits on the page next to its verdicts, because a scorecard you can't check is just marketing wearing a lab coat.
The panel is your buyers, not invented critics. Four expert personas, built from Covington Place's own ICP research: the transformation-ready CEO/COO, the CFO watching AI spend accumulate outside their visibility, the compliance-cornered CISO, and the CTO trying to separate real velocity from vibe-coding. These are the exact people who sign — or kill — engagements like this one. We didn't imagine an audience; we modeled the one already in the pipeline.
The rubric was fixed before any verdict. One test — sourced evidence in view, instrument-grade over styled, restraint, operator-to-operator credibility, and the tell that a site expects to be challenged rather than believed. It's the same test that ruled out a motion-video hero, scored the current live site, scored all three redesign directions, and scored every competitor. Nobody moved the goalposts between the thing being judged and the score. The test punishes advertised-but-unbacked rigor hardest — the failure mode this whole page is built to avoid.
Every score cites something you can see. Each judgment points at an artifact actually on the rendered page — a sourced-figure row, an honest verb, a self-quoted testimonial. Competitors were scored from their live homepages as they ship today, not from reputation or what we assume they're like.
It says where we lose, out loud. The same panel flagged Covington Place's own exposure as loudly as its wins: our proof is self-provenanced — our own receipts and packets — and we surface none of our ten real clients as a named logo wall, the one thing a skeptic scans for in five seconds. All four buyers named it. A scorecard that only flatters is worthless; this one told us where we're beatable.
And it corrects itself. When one competitor's homepage lazy-loaded and rendered dark on first capture, we re-captured it against a full render and re-scored it before trusting the number — no capture artifact was allowed to distort the table.
The honest limit — and it's the point. This is a modeled expert-persona panel, not a survey of live humans. It models how these specific buyers, grounded in real buyer research and a documented rubric, read a site — a decision aid, not a market survey. We say that plainly, the same way none of these sites should ever say "certified" without the paper to back it. The candor is the trust signal: if we'll tell you what this scorecard can't do, you can believe what it says it can.