Researchby Consensus★ 4.3

Consensus

AI-powered scientific search engine. Returns answer with consensus meter across studies — useful for distilling messy literature.

Try it Also in Tools: AI Research →

Safety & Trust

Delv Safety Grade: B

Score 72/100 · assessed 2026-04-18

Maintainer65

Permissions85

Supply chain60

Transparency55

Incidents100

Consensus is a web-based scientific search engine operated by Consensus Inc., a venture-backed startup. It queries academic databases and returns aggregated answers with consensus meters. The service is narrow in scope (read-only academic search) with no filesystem, shell, or desktop access. Permissions are limited to outbound network calls to their API and reading academic literature databases. The main supply chain risk is that it's a closed-source SaaS with no public repository, so you cannot audit what happens server-side. Transparency is moderate: the company is identifiable and the product is well-documented, but there's no open codebase or detailed security disclosures. No known security incidents. The freemium model means some data may be retained for service improvement. Overall, it's a low-risk tool for literature review, but you're trusting a single vendor's infrastructure and data handling practices.

Green flags

Read-only academic search with no write or execute permissions
Narrow scope limited to scientific literature aggregation
No known security incidents or credential leaks
Identifiable company with professional web presence
No filesystem, shell, or desktop control required

Red flags

Closed source with no public repository for audit
Freemium model may retain queries and results for training or analytics
Single vendor dependency with no self-hosted alternative
Unclear data retention and privacy policies for academic queries

Permissions requested

Outbound networkExternal LLM call

Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEMIUM

Platforms

web

Review

Consensus isn't strictly autonomous in the agent sense - it doesn't iterate or plan multi-step workflows. It's a search engine with a clever trick: it scans academic papers and gives you a consensus meter showing whether studies agree, disagree, or are split on a question. You ask a plain-English question, it returns a summary plus a percentage breakdown. The autonomy is minimal - one query, one answer - but the value is in aggregation speed. I've used it mostly for health claims that show up in newsletters or podcasts. Someone says "intermittent fasting improves insulin sensitivity" and I want to know if that's backed by actual trials or just one cherry-picked study. Consensus returns a grid of papers with yes/no/maybe classifications and a consensus score. It's faster than PubMed because it pre-digests the abstracts and highlights the relevant finding. The summaries are usually accurate, though I spot-check the original papers when stakes are high. Where it shines: quick sanity checks before you write something, or when you need to deflate a confident claim from someone who clearly hasn't read the literature. It's also useful for scoping a topic before a deeper dive - if consensus is 80% one way, you know where the weight of evidence sits. The free tier gives you a handful of searches; the paid version unlocks unlimited queries and better filtering. Failure modes: it's only as good as the papers indexed, and coverage outside biomedicine and social science is patchy. I've asked niche engineering questions and got thin results. The consensus meter can also mislead if sample size is small - three papers agreeing doesn't mean much if there are thirty others it missed. And it won't catch nuance like "works in mice, fails in humans" unless you drill into individual studies. Nearest competitor is probably Elicit, which does similar literature synthesis but leans harder into extracting structured data from papers. Consensus is faster for yes/no questions; Elicit is better when you need tables or comparisons across studies. I'd pick Consensus for quick fact-checks, Elicit for actual research projects.

Verdict

Pay for it if you regularly verify claims against academic literature - writers, fact-checkers, researchers in early-stage scoping. Skip it if you need deep domain coverage outside health and social science, or if you're after structured data extraction rather than consensus summaries.

Good at

Fast consensus view across multiple studies beats manual PubMed trawling
Plain-English queries work well - no Boolean operators needed
Useful for deflating pop-science claims with actual evidence base
Free tier sufficient for occasional use
Summaries generally accurate when spot-checked against source papers

Watch out

Not truly autonomous - single-query tool, no iteration or multi-step reasoning
Coverage weak outside biomedicine and social science
Consensus meter misleading when sample size is small
Won't surface nuance like species-specific results without manual drilling
Misses papers outside its index - not a complete literature view

Use cases

Quick "what do studies actually say?" answers
Health-claim fact-checking
Pre-writing background research
Dispelling pop-science myths