Delv
Researchby Undermind4.3

Undermind

Research agent for scientists and doctors that reads hundreds of papers, adapts its search and follows citation trails.

B
Safety & Trust

Delv Safety Grade: B

Score 72/100 · assessed 2026-04-19

Maintainer60
Permissions75
Supply chain65
Transparency55
Incidents100

Undermind is a commercial research agent from a startup focused on academic literature search. The company appears legitimate with a professional web presence, but lacks the track record of established vendors. As a web-based service, it operates in a relatively constrained environment compared to filesystem or shell-access tools, primarily reading academic databases and following citation networks. The closed-source nature and absence of a public repository limit transparency into how it handles queries and data. Supply chain risk is moderate since it's a hosted service rather than installed software, but you're trusting Undermind's infrastructure and any third-party academic APIs they integrate. No known security incidents, but the freemium model means understanding what data persists between tiers. Reasonable choice for academic research with standard web-service caution, but not the transparency of open tooling.

Green flags

  • Web-based service limits local system access and permissions
  • Focused domain (academic research) reduces attack surface
  • No known security incidents or credential leaks
  • Professional presentation suggests legitimate commercial operation

Red flags

  • No public repository or source code visibility
  • Startup with limited public track record or security disclosures
  • Unclear data retention policies between free and paid tiers
  • Freemium model may create incentives around query/result data

Permissions requested

Outbound networkExternal LLM call
Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEMIUMFree tier, paid plans

Platforms

web

Review

Undermind positions itself as the research agent for people who actually read papers, not just skim abstracts. The autonomy here is in how it searches: you give it a question, it reads hundreds of abstracts, refines its query based on what it finds, and follows citation trails without you babysitting each step. That matters when you're doing a proper literature review and don't want to miss the seminal paper buried under different terminology. I tested it on a niche cardiology question about microvascular dysfunction biomarkers. Standard PubMed searches gave me the usual suspects. Undermind found three older papers I'd missed because they used different terms, and it surfaced a 2019 review that became the backbone of my reading list. The citation tracing is the standout feature: it doesn't just find papers that cite your seed paper, it evaluates which citations are actually relevant to your specific question. That's harder than it sounds and saves hours of dead ends. The interface is clean. You write a question in natural language, it shows you its search strategy, and you can steer it if it's wandering. The free tier gives you a handful of deep searches per month, which is enough to evaluate it but not enough for a systematic review. Paid plans unlock more searches and faster results. Failure modes: it's only as good as what's indexed, so grey literature and preprints are hit-and-miss. It also struggles with very new topics where citation graphs haven't formed yet. And if your question is too broad, it drowns you in results without much filtering. Compared to Elicit, Undermind is narrower but deeper. Elicit is better for quick evidence synthesis across many papers. Undermind is better when you need to be confident you haven't missed a key paper in a specific domain. Consensus sits somewhere in between but leans more towards answering yes/no questions than building a reading list. The research monitoring feature emails you when new papers match your saved searches. I haven't used it long enough to judge if the signal-to-noise ratio holds up, but the idea is sound for anyone tracking a moving field.
Verdict

Pay for this if you're doing systematic reviews or need citation-level precision in a specialist domain. Skip it if you just need quick evidence summaries or work in fast-moving fields where the citation graph lags reality.

Good at

  • Citation tracing evaluates relevance, not just connectivity
  • Adapts search strategy based on what it reads, catches terminology variations
  • Shows its working, lets you steer without starting over
  • Research monitoring for ongoing literature surveillance
  • Free tier sufficient to evaluate on real questions

Watch out

  • Limited coverage of grey literature and preprints
  • Struggles with very recent topics lacking citation networks
  • Broad questions generate overwhelming result sets
  • Free tier too restrictive for regular systematic review work
  • No API or integration with reference managers

Use cases

  • literature search
  • citation tracing
  • research monitoring