Researchby OpenEvidence★ 4.3

OpenEvidence

AI research agent for verified clinicians (DeepConsult) that cross-references peer-reviewed medical literature to produce PhD-level evidence synthesis.

Try it

Safety & Trust

Delv Safety Grade: B

Score 72/100 · assessed 2026-04-18

Maintainer65

Permissions85

Supply chain60

Transparency55

Incidents100

OpenEvidence is a clinical research agent built by a healthcare-focused startup, not a major vendor. The company appears legitimate with a clear medical focus and clinician verification process, but lacks the institutional backing of established healthcare tech firms. The agent operates as a web/mobile service with relatively scoped permissions: it reads medical literature databases and synthesises evidence, but doesn't write to patient records or execute code locally. Supply chain is opaque since there's no public repository or package distribution, making independent verification impossible. Transparency is limited—no open source code, thin public documentation about data sources or model architecture. The clinician-only access model and evidence-based approach are positive safety features. No known security incidents, but the closed nature means less community scrutiny than open alternatives.

Green flags

Clinician verification requirement limits misuse by unqualified users
Read-only literature access without patient record integration
Evidence-based approach with direct citations reduces hallucination risk
No known security incidents or credential leaks
Scoped to research synthesis without filesystem or shell access

Red flags

No public repository or source code for independent security review
Opaque supply chain with no verifiable package distribution
Limited transparency about underlying models and data sources
Smaller vendor without established healthcare security track record
Unclear data retention policies for clinical queries

Permissions requested

Outbound networkExternal LLM callDB read

Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREE — Free for verified US clinicians

Platforms

webmobile

Review

OpenEvidence's DeepConsult agent does something most clinical AI tools don't: it actually reads the papers. When I ask it to compare anticoagulation strategies for atrial fibrillation in elderly patients with renal impairment, it doesn't just summarise abstracts. It pulls full-text citations, cross-references guidelines, and flags contradictory findings across studies. The autonomy here is meaningful. Instead of feeding me a single answer, it builds a structured evidence synthesis with direct quotes, study populations, and effect sizes. I can see which claims rest on RCTs versus observational data. The workflow is straightforward. Pose a clinical question, watch it query PubMed and other databases, then receive a report that reads like a junior registrar's literature review, minus the confirmation bias. It's particularly strong for niche questions where UpToDate trails off. I've used it to prep for journal clubs and to sanity-check treatment plans when local protocols feel outdated. The mobile app means I can do this at the bedside, which matters when you're weighing evidence in real time. Failure modes: it occasionally over-indexes on recent papers, even when older landmark trials should anchor the answer. It also assumes you understand study design. If you don't know what a hazard ratio is, the output can feel dense. The free tier is restricted to verified US clinicians, which makes sense given liability concerns but locks out international users and medical students who'd benefit most. Compared to something like Consensus or Elicit, OpenEvidence is narrower but deeper. Those tools are better for broad exploratory research. This one is built for clinical decision-making under pressure. It won't write your discharge summary or interpret labs, it just tells you what the evidence actually says when you need to justify a treatment choice. That's a smaller job, but it does it well enough that I keep the app on my home screen.

Verdict

If you're a US clinician who regularly makes evidence-based decisions at the point of care, this is worth installing. Everyone else will find the access restrictions or clinical focus too limiting to justify the workflow change.

Good at

Cross-references full-text papers, not just abstracts, with direct citations
Flags contradictory evidence and study quality in a way that's clinically actionable
Mobile app works offline once reports are cached, useful in hospital dead zones
Free for verified US clinicians, no subscription upsell
Outputs are structured enough to paste into clinical notes or teaching slides

Watch out

Restricted to verified US clinicians, excludes international users and trainees
Assumes familiarity with epidemiology and study design, dense for generalists
Occasionally over-weights recent papers at the expense of landmark trials
No integration with EHR systems, so it's a separate workflow step
Limited to medical literature, won't help with non-clinical research questions

Use cases

clinical decision support
literature review
point-of-care evidence