Methodology · v2.2 · updated 2026-05-11
How Mindber scores a product
Every product gets one headline composite - the Mindber Score - computed from three sub-scores: Mindber Innovation Index, Mindber Functionality Score, and Mindber Activity Score. Each is on a 0-10 scale with 1 decimal. The math is reproducible and visible - click any score chip to see the per-input contributions.
Version history
| Version | Date | Change | Why |
|---|---|---|---|
| v2.6 | 2026-05-11 | Mindber Score weights changed to 40/50/10 for Innovation, Functionality, and Activity. | Make the composite more buyer-quality-led while keeping public activity as a freshness signal. |
| v2.5 | 2026-05-06 | Scoring methodology consolidated into the canonical methodology pages. | Keep public methodology, legal methodology, and score popovers easier to audit. |
| v2.1 | 2026-04-27 | Published confidence states and missing-input fallback weights. | Make score confidence auditable when not all sub-scores are available. |
| v2.0 | 2026-03-15 | Split product scoring into named Mindber sub-scores. | Separate novelty, capability, and public activity evidence. |
| v1.0 | 2026-01-30 | Published first public product scoring bands. | Give buyers a consistent starting point for product comparisons. |
Mindber Score (composite)
Weighted average of the three sub-scores. No confidence multiplier. No hidden penalty. The popover on the score chip shows the exact math.
mindberScore = round( innovationIndex x 0.40 + functionalityScore x 0.50 + activityScore x 0.10 , 1)
When one sub-score is missing, the missing input's weight redistributes proportionally across the remaining two. Two of three inputs labels confidence as medium; all three as high. With fewer than two scored inputs, no Mindber Score is shown (insufficient) and the page renders “Score pending” instead.
Fallback weights when one input is missing
- No Activity - Innovation 44% · Functionality 56%
- No Innovation - Functionality 83% · Activity 17%
- No Functionality - Innovation 80% · Activity 20%
Why these weights?
Why functionality is weighted highest
Functionality is weighted highest because buyers first need to know whether a product appears capable of solving the job it claims. Novelty matters, but a distinctive concept without enough usable surface area should not outrank a more complete product that covers the core workflow.
Activity receives a smaller share because public momentum is a freshness signal, not a quality verdict. The composite is designed to reward products that combine clear differentiation, practical capability, and recent evidence without letting any single family dominate the score.
Confidence states
- High - all three sub-scores present. Default 40/50/10 weights apply.
- Medium - two of three sub-scores present. Fallback weights as above.
- Insufficient - fewer than two present. No Mindber Score is shown.
Score band interpretation
| Band | Range | Reading |
|---|---|---|
| Weak | 0.0 - 3.0 | Thin evidence or weak signal across most inputs. |
| Average | 3.1 - 6.0 | Mid-pack - fits some buyer profiles, not yet differentiated. |
| Strong | 6.1 - 8.0 | Confident recommendation in the right buyer fit. |
| Exceptional | 8.1 - 10.0 | Category leader signals across all inputs. |
What the Mindber Score is NOT
- Not investment advice
- Not a guarantee of business performance
- Not a measure of company financial health
- Not a recommendation to buy or avoid
- Not a substitute for due diligence
Bands are absolute - they do not shift with the category. The category percentile label (below) tells you how the product ranks against its peers separately.
How to verify a score
- All score inputs are derived from public sources. public sources page.
- Disputes start from the correction widget on the product page. product page correction widget.
- Score changes within 7 days surface in the product page audit log.
- Click any score chip to see per-input contributions.
Mindber Innovation Index™
Average of three sub-scores - Novelty, Differentiation, Concept clarity - each 0-10.
innovationIndex = round((novelty + differentiation + conceptClarity) / 3, 1)
- Novelty - how distinct the core concept is from the category baseline. LLM-inferred, grounded in the category's top 3 tools and common features (see below).
- Differentiation - count of features present in this product but missing from >=50% of category peers. Scaled to 0-10:
min(10, unique x 1.5). LLM-inferred, verified against the category common-features list. - Concept clarity - how unambiguous the value proposition reads from the tagline and description alone. LLM-inferred, grounded in the source copy.
Mindber Functionality Score™
Confidence-weighted average of four sub-scores. Feature breadth and Performance reliability are deterministic - computed from source data with no LLM involvement. The other two are LLM-inferred.
functionalityScore = sum(score x baseWeight x confidence) / sum(baseWeight x confidence)
Base weights:breadth 0.25 · depth 0.30 · integrations 0.20 · performance 0.25. Confidence weighting means a sub-score with thin evidence pulls less weight than one backed by real data.
- Feature breadth (deterministic). Count of features from collected source records. Score =
min(10, round(log2(count + 1) x 2)). 5 features - 5; 15 - 8; 30+ - 10. - Feature depth (LLM). Quality of the top features vs the category leader. Cites specific named features.
- Integration surface (LLM with caveat). Public API + webhook + native integration evidence. Where no integration evidence exists, the score defaults to neutral (5) with confidence no higher than 0.4.
- Performance reliability (deterministic). Computed from the product's website engagement score and homepage response time:
0.7 x (livenessScore / 10) + 0.3 x responseTimeScore. Response-time score is 10 for no more than 500 ms, linearly degrading to 1 at 5000 ms or more.
Category percentile
A score in isolation isn't meaningful - a 7.3 in a crowded category is different from a 7.3 in a deep one. We compute per-category distributions (median, p25, p50, p75, p90) and bucket each final score into one of five labels:
- p90 or highertop 10%
- p75 or highertop 25%
- p50 or higherabove median
- p25 or higherbelow median
- below p25bottom 25%
A baseline is only published once a category has at least 10 scored products; below that minimum, percentile is shown as unranked.
Update cadence
- Full baselines (median/p25/p75/p90) refresh weekly. A drift alert fires when a category median moves by more than 1.0 point.
- Percentile labels refresh daily. The underlying raw scores stay fixed until the product is regenerated - only the bucket label updates.
- Product scores are regenerated whenever the source inputs change (name, tagline, description, categories, candidate alternatives) or when the scoring methodology is upgraded.
Guardrails
- Every LLM-written sentence must cite a specific feature, use case, or baseline data point. Marketing adjectives are banned.
- A banned-word safety filter runs on every rationale; offending text is blanked out rather than retried.
- Any sub-score below 4/10, composite confidence below 0.6, or a safety-filter replacement flags the record for admin review before publication.
- Scores compare within category, never absolute. “Below median” is factual; “bad” is not a word we use.
Frequently asked
How is the Mindber Score computed?
Mindber Score is a weighted average of three sub-scores on a 0-10 scale: Mindber Innovation Index 40%, Mindber Functionality Score 50%, Mindber Activity Score 10%. The result is rounded to one decimal. There is no confidence multiplier or hidden penalty - what you see in the popover is the literal math.
What does the confidence label mean?
Confidence reflects how many sub-scores fed the calculation. 'High' = all three (Innovation, Functionality, Activity). 'Medium' = two of three with the missing input's weight redistributed. 'Insufficient' = fewer than two sub-scores; no Mindber Score is shown.
How should I interpret the 0-10 score?
Use the band: 0-3 weak signal, 3-6 average, 6-8 strong, 8+ exceptional. Bands are absolute - they don't shift with the category. The category percentile label tells you how the product ranks against peers in its niche.
How often does the Mindber Score update?
It updates automatically whenever the underlying intelligence record is approved or auto-published. Mindber Activity Score reflects current website engagement signals on every page render. There is no fixed weekly cadence.
Why does my product show a different score than the popover math suggests?
If the displayed final score differs from the per-input contributions by more than 0.2, that's a stored-snapshot drift - the audit job flags it for human review when it persists for 7+ days. Re-approving the intelligence record reconciles it.
Can a product have a Mindber Innovation Index but no Mindber Score?
Yes - when fewer than two sub-scores are available the Mindber Score returns null with confidence 'insufficient'. The page shows 'Score pending - additional data required' instead of a numeric score.
How Mindber Score differs from other ratings
| Source | Type | Update freq | No-update label shown |
|---|---|---|---|
| Mindber | Behavioral aggregation | Continuous | Yes |
| G2 | User reviews | On demand | No |
| Product Hunt | Launch votes | Day-of-launch | No |
| GitHub stars | Popularity proxy | Continuous | No |