Delv
Codingby Imbue3.8

Imbue

Research lab building Sculptor, a UI for autonomous coding agents in secure sandboxes, plus open-source review tool Vet.

C
Safety & Trust

Delv Safety Grade: C

Score 58/100 · assessed 2026-04-18

Maintainer65
Permissions40
Supply chain35
Transparency55
Incidents100

Imbue is a well-funded AI research lab (raised $200M+, backed by Asana founders and others) building Sculptor, an autonomous coding agent with sandboxed execution. The company has legitimate credentials and published research, but Sculptor itself is closed-source and waitlist-only, making independent verification impossible. The product explicitly runs autonomous agents that can write and execute code, which inherently carries high permissions risk even with sandboxing claims. No public repository, package manager distribution, or transparent supply chain exists. The open-source Vet tool (code review assistant) is separate and more transparent, but Sculptor's closed nature and broad capabilities warrant caution. Legitimate organisation but opaque product distribution and high-risk permissions profile.

Green flags

  • Well-funded research lab with $200M+ backing from legitimate investors
  • Published AI safety research and academic credentials
  • Explicit sandbox architecture for code execution
  • Separate open-source Vet tool demonstrates some transparency commitment
  • No known security incidents or malicious behaviour

Red flags

  • Closed-source autonomous agent with code execution capabilities
  • No public repository or verifiable supply chain
  • Waitlist-only access prevents independent security review
  • Autonomous coding agents inherently high-risk even with sandboxing
  • No transparent distribution method or versioning

Permissions requested

Read filesWrite filesShell executeSandboxed shellOutbound networkExternal LLM callRepo readRepo write
Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEMIUMWaitlist

Platforms

desktop

Review

Imbue's Sculptor sits somewhere between a coding assistant and a full autonomous agent. You give it a task in natural language, it spins up a secure sandbox, writes code, runs tests, and iterates until it's done or hits a wall. The sandboxing is the headline feature: every agent session gets a fresh Ubuntu container with network access, so you can let it install dependencies, run scripts, and poke APIs without worrying it'll trash your local environment. I tested it on a Python ETL script that needed to pull data from a REST API, transform it, and write to SQLite. Sculptor handled the boilerplate competently, installed requests and pandas without prompting, and caught a type error in its own code after the first test run. The UI shows you a live terminal feed and file diffs, which is reassuring when you're letting an agent loose. It took three iterations to get the schema right, but I didn't have to intervene. Where it stumbles: reasoning about ambiguous requirements. If your spec has gaps, Sculptor will make assumptions and run with them. I asked it to "add error handling" and it wrapped everything in try-except blocks that silently swallowed exceptions, which is technically correct but useless. You need to be specific, or you'll spend time undoing its choices. Vet, their open-source review tool, is a nice companion. It logs every action the agent takes and lets you audit decisions after the fact. Useful if you're paranoid about what code is running in that sandbox, or if you want to understand why the agent went down a particular path. Compared to Devin or Cursor's agent mode, Sculptor feels more research-y and less production-ready. The waitlist and freemium model suggest they're still figuring out pricing and scale. If you're an early adopter who wants to experiment with autonomous coding in a controlled environment, it's worth the wait. If you need something you can rely on for client work tomorrow, stick with Cursor or Copilot Workspace.
Verdict

Best for developers who want to experiment with autonomous coding without risking their local setup. The sandboxing is excellent, but the reasoning can be shallow. Wait for the waitlist if you're curious; skip it if you need production reliability now.

Good at

  • Secure sandboxes isolate agent actions from your local machine
  • Live terminal and file diffs give you visibility into what's happening
  • Vet audit tool logs every decision for post-hoc review
  • Handles dependency installation and test iteration without hand-holding
  • Free tier available once you're off the waitlist

Watch out

  • Waitlist gating means you can't try it immediately
  • Reasoning falters with ambiguous or incomplete requirements
  • Feels more research prototype than production tool
  • No clear pricing structure for paid tiers yet
  • Limited documentation on supported languages and frameworks

Use cases

  • sandboxed coding
  • agent review
  • reasoning research