Delv
Researchby Consensus4.3

Consensus

AI-powered scientific search engine. Returns answer with consensus meter across studies — useful for distilling messy literature.

B
Safety & Trust

Delv Safety Grade: B

Score 72/100 · assessed 2026-04-18

Maintainer65
Permissions85
Supply chain60
Transparency55
Incidents100

Consensus is a web-based scientific search engine operated by Consensus Inc., a venture-backed startup. It queries academic databases and returns aggregated answers with consensus meters. The service is narrow in scope (read-only academic search) with no filesystem, shell, or desktop access. Permissions are limited to outbound network calls to their API and reading academic literature databases. The main supply chain risk is that it's a closed-source SaaS with no public repository, so you cannot audit what happens server-side. Transparency is moderate: the company is identifiable and the product is well-documented, but there's no open codebase or detailed security disclosures. No known security incidents. The freemium model means some data may be retained for service improvement. Overall, it's a low-risk tool for literature review, but you're trusting a single vendor's infrastructure and data handling practices.

Green flags

  • Read-only academic search with no write or execute permissions
  • Narrow scope limited to scientific literature aggregation
  • No known security incidents or credential leaks
  • Identifiable company with professional web presence
  • No filesystem, shell, or desktop control required

Red flags

  • Closed source with no public repository for audit
  • Freemium model may retain queries and results for training or analytics
  • Single vendor dependency with no self-hosted alternative
  • Unclear data retention and privacy policies for academic queries

Permissions requested

Outbound networkExternal LLM call
Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEMIUM

Platforms

web

Review

Consensus isn't strictly autonomous in the agent sense - it doesn't iterate or plan multi-step workflows. It's a search engine with a clever trick: it scans academic papers and gives you a consensus meter showing whether studies agree, disagree, or are split on a question. You ask a plain-English question, it returns a summary plus a percentage breakdown. The autonomy is minimal - one query, one answer - but the value is in aggregation speed. I've used it mostly for health claims that show up in newsletters or podcasts. Someone says "intermittent fasting improves insulin sensitivity" and I want to know if that's backed by actual trials or just one cherry-picked study. Consensus returns a grid of papers with yes/no/maybe classifications and a consensus score. It's faster than PubMed because it pre-digests the abstracts and highlights the relevant finding. The summaries are usually accurate, though I spot-check the original papers when stakes are high. Where it shines: quick sanity checks before you write something, or when you need to deflate a confident claim from someone who clearly hasn't read the literature. It's also useful for scoping a topic before a deeper dive - if consensus is 80% one way, you know where the weight of evidence sits. The free tier gives you a handful of searches; the paid version unlocks unlimited queries and better filtering. Failure modes: it's only as good as the papers indexed, and coverage outside biomedicine and social science is patchy. I've asked niche engineering questions and got thin results. The consensus meter can also mislead if sample size is small - three papers agreeing doesn't mean much if there are thirty others it missed. And it won't catch nuance like "works in mice, fails in humans" unless you drill into individual studies. Nearest competitor is probably Elicit, which does similar literature synthesis but leans harder into extracting structured data from papers. Consensus is faster for yes/no questions; Elicit is better when you need tables or comparisons across studies. I'd pick Consensus for quick fact-checks, Elicit for actual research projects.
Verdict

Pay for it if you regularly verify claims against academic literature - writers, fact-checkers, researchers in early-stage scoping. Skip it if you need deep domain coverage outside health and social science, or if you're after structured data extraction rather than consensus summaries.

Good at

  • Fast consensus view across multiple studies beats manual PubMed trawling
  • Plain-English queries work well - no Boolean operators needed
  • Useful for deflating pop-science claims with actual evidence base
  • Free tier sufficient for occasional use
  • Summaries generally accurate when spot-checked against source papers

Watch out

  • Not truly autonomous - single-query tool, no iteration or multi-step reasoning
  • Coverage weak outside biomedicine and social science
  • Consensus meter misleading when sample size is small
  • Won't surface nuance like species-specific results without manual drilling
  • Misses papers outside its index - not a complete literature view

Use cases

  • Quick "what do studies actually say?" answers
  • Health-claim fact-checking
  • Pre-writing background research
  • Dispelling pop-science myths