Methodology
How Proprietor rates outlets, clusters stories, and detects blindspots
A transparent account of how this site works — and where it falls short
Proprietor aggregates articles from UK news outlets, groups them into story clusters, and measures how coverage differs by ownership and editorial stance. Each part of that process involves choices. This page makes them explicit.
How outlets are rated
Each tracked outlet carries editorial ratings across four axes. These are human judgments, not algorithmic outputs. They draw on MBFC, the Reuters Institute Digital News Report, and academic media studies literature. Every rating is stored with a confidence level (high / medium / low) and a source tag. All are openly contestable — see the transparency section below.
All bias dimensions use a scale from −5 to +5, where 0 is genuine neutrality. Most outlets sit between −3 and +3. The poles are intentionally descriptive, not pejorative.
Economic axis
Social axis
Establishment axis
EU / Sovereignty axis
Credibility score
Separately from bias, each outlet carries a credibility score from 0–100, drawn primarily from MBFC with Reuters Institute data for established broadcast outlets. This score affects blindspot detection: outlets rated below 40 are weighted at 0.3 rather than 1.0, so a tabloid pile-on does not trigger the same signal as genuine cross-spectrum coverage. GB News (30), The Canary (45), and The Express (35) are weighted down. The Economist (90), The Conversation UK (88), and the BBC (85) carry full weight.
Ownership data
Ownership records are drawn from three primary sources:
- Media Reform Coalition — Who Owns the Media? Annual report tracking UK print and digital ownership concentration. The most comprehensive public audit of UK media ownership.
- RSF Media Ownership Monitor Reporters Without Borders' international monitoring of media ownership transparency, used for ultimate beneficial owner data where public records are thin.
- Companies House for corporate structure and beneficial ownership verification, where publicly registered UK entities are involved.
Each outlet record carries: ownership group, ultimate beneficial owner where known, funding model (advertising / subscription / public / membership / mixed), and whether editorial staff are union-recognised. Funding model and union data come from public reporting and are updated periodically.
Story clustering
Every 10 minutes, freshly ingested articles are converted into numerical vectors using a multilingual sentence model. These vectors capture the semantic meaning of each article — two articles about the same event will produce similar vectors even if their headlines use different words.
A new article is compared against recent clusters. If it is close enough to an existing cluster — above a similarity threshold of 0.78 — it joins that cluster. If it is far from all recent clusters, it starts a new one. A narrow grey zone between 0.62 and 0.78 is arbitrated by a language model asked a simple question: are these covering the same news event?
Two safeguards prevent false groupings. First, both articles must share at least one named entity — a person, organisation, or place. A story about Keir Starmer and a story about interest rates will not cluster together even if they have similar economic vocabulary. Second, the grey-zone language model check adds a second opinion on ambiguous cases.
Wire service content — PA Media stories republished verbatim across regional outlets — is detected using SimHash fingerprinting and shown once rather than once per republishing outlet. The originating outlet gets credit for coverage; the regional republications are recorded but do not inflate the coverage count.
Blindspot detection
A cluster qualifies as a blindspot when at least three distinct outlets have covered a story and one side of the economic axis has zero representation — all coverage comes from either pro-labour or pro-market outlets, with none from the other side.
This is a structural measure, not an editorial judgment. Proprietor does not decide whether a story deserves more coverage. It only observes that the outlets which did cover it are grouped on one side of the economic divide.
Blindspot types currently detected:
- Market press blindspot. A story covered only by pro-market outlets — no left-of-centre voices present.
- Establishment blindspot. A story covered only by pro-establishment outlets — no critical or anti-establishment voices.
- Brexit lens. An EU or sovereignty story with no pro-labour outlet coverage.
- Tabloid amplification. A story whose coverage is dominated by low-credibility outlets, suggesting amplification without quality corroboration.
Known limitations
Proprietor is a work in progress. Current limitations worth naming explicitly:
- Ratings are editorial judgments. They reflect the people who made them and can be wrong. They are not produced by an algorithm and should not be treated as objective.
- Proprietor currently tracks 30 UK outlets. Hundreds of significant titles — regional papers, specialist publications, community media — are absent. Blindspot detection is only as good as the outlets in the sample.
- RSS-based ingestion cannot retrieve paywalled articles. The Times, The Economist, and The Spectator contribute headline-level signals only.
- Clustering uses headline and summary text only. Full article bodies are not yet indexed. Two articles with identical headlines but opposite editorial takes may cluster together.
- Wire duplication detection catches near-verbatim republication but not rewrites. A regional paper that rewrites a PA Media story in its own words will be counted as original coverage.
- The blindspot threshold of three outlets favours widely-covered national stories. Niche specialist coverage — a trade outlet covering a sector story no national picks up — is not measured.
- The economic axis is the primary blindspot axis. Social, establishment, and EU/sovereignty dimensions are tracked but do not yet trigger blindspot flags.
About Proprietor
Proprietor is open-source software, built in public, with no advertising and no investor funding. It was created to increase transparency about who owns and shapes UK news — a structurally-aware alternative to Ground News, focused on the UK media landscape.
No outlet pays for listing or for its ratings. No outlet can pay to be removed. The outlet database includes publications across the full political spectrum, including outlets whose editorial positions we find objectionable.
Ratings can be challenged. If you believe a score is wrong, open a GitHub issue with your reasoning and evidence. We take corrections seriously and will update ratings when the argument is sound. Ownership data corrections are especially welcome — this is an area where public records are often incomplete.