Delv
Researchby OpenEvidence4.3

OpenEvidence

AI research agent for verified clinicians (DeepConsult) that cross-references peer-reviewed medical literature to produce PhD-level evidence synthesis.

B
Safety & Trust

Delv Safety Grade: B

Score 72/100 · assessed 2026-04-18

Maintainer65
Permissions85
Supply chain60
Transparency55
Incidents100

OpenEvidence is a clinical research agent built by a healthcare-focused startup, not a major vendor. The company appears legitimate with a clear medical focus and clinician verification process, but lacks the institutional backing of established healthcare tech firms. The agent operates as a web/mobile service with relatively scoped permissions: it reads medical literature databases and synthesises evidence, but doesn't write to patient records or execute code locally. Supply chain is opaque since there's no public repository or package distribution, making independent verification impossible. Transparency is limited—no open source code, thin public documentation about data sources or model architecture. The clinician-only access model and evidence-based approach are positive safety features. No known security incidents, but the closed nature means less community scrutiny than open alternatives.

Green flags

  • Clinician verification requirement limits misuse by unqualified users
  • Read-only literature access without patient record integration
  • Evidence-based approach with direct citations reduces hallucination risk
  • No known security incidents or credential leaks
  • Scoped to research synthesis without filesystem or shell access

Red flags

  • No public repository or source code for independent security review
  • Opaque supply chain with no verifiable package distribution
  • Limited transparency about underlying models and data sources
  • Smaller vendor without established healthcare security track record
  • Unclear data retention policies for clinical queries

Permissions requested

Outbound networkExternal LLM callDB read
Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEFree for verified US clinicians

Platforms

webmobile

Review

OpenEvidence's DeepConsult agent does something most clinical AI tools don't: it actually reads the papers. When I ask it to compare anticoagulation strategies for atrial fibrillation in elderly patients with renal impairment, it doesn't just summarise abstracts. It pulls full-text citations, cross-references guidelines, and flags contradictory findings across studies. The autonomy here is meaningful. Instead of feeding me a single answer, it builds a structured evidence synthesis with direct quotes, study populations, and effect sizes. I can see which claims rest on RCTs versus observational data. The workflow is straightforward. Pose a clinical question, watch it query PubMed and other databases, then receive a report that reads like a junior registrar's literature review, minus the confirmation bias. It's particularly strong for niche questions where UpToDate trails off. I've used it to prep for journal clubs and to sanity-check treatment plans when local protocols feel outdated. The mobile app means I can do this at the bedside, which matters when you're weighing evidence in real time. Failure modes: it occasionally over-indexes on recent papers, even when older landmark trials should anchor the answer. It also assumes you understand study design. If you don't know what a hazard ratio is, the output can feel dense. The free tier is restricted to verified US clinicians, which makes sense given liability concerns but locks out international users and medical students who'd benefit most. Compared to something like Consensus or Elicit, OpenEvidence is narrower but deeper. Those tools are better for broad exploratory research. This one is built for clinical decision-making under pressure. It won't write your discharge summary or interpret labs, it just tells you what the evidence actually says when you need to justify a treatment choice. That's a smaller job, but it does it well enough that I keep the app on my home screen.
Verdict

If you're a US clinician who regularly makes evidence-based decisions at the point of care, this is worth installing. Everyone else will find the access restrictions or clinical focus too limiting to justify the workflow change.

Good at

  • Cross-references full-text papers, not just abstracts, with direct citations
  • Flags contradictory evidence and study quality in a way that's clinically actionable
  • Mobile app works offline once reports are cached, useful in hospital dead zones
  • Free for verified US clinicians, no subscription upsell
  • Outputs are structured enough to paste into clinical notes or teaching slides

Watch out

  • Restricted to verified US clinicians, excludes international users and trainees
  • Assumes familiarity with epidemiology and study design, dense for generalists
  • Occasionally over-weights recent papers at the expense of landmark trials
  • No integration with EHR systems, so it's a separate workflow step
  • Limited to medical literature, won't help with non-clinical research questions

Use cases

  • clinical decision support
  • literature review
  • point-of-care evidence