Codingby Imbue★ 3.8

Imbue

Research lab building Sculptor, a UI for autonomous coding agents in secure sandboxes, plus open-source review tool Vet.

Try it

Safety & Trust

Delv Safety Grade: C

Score 58/100 · assessed 2026-04-18

Maintainer65

Permissions40

Supply chain35

Transparency55

Incidents100

Imbue is a well-funded AI research lab (raised $200M+, backed by Asana founders and others) building Sculptor, an autonomous coding agent with sandboxed execution. The company has legitimate credentials and published research, but Sculptor itself is closed-source and waitlist-only, making independent verification impossible. The product explicitly runs autonomous agents that can write and execute code, which inherently carries high permissions risk even with sandboxing claims. No public repository, package manager distribution, or transparent supply chain exists. The open-source Vet tool (code review assistant) is separate and more transparent, but Sculptor's closed nature and broad capabilities warrant caution. Legitimate organisation but opaque product distribution and high-risk permissions profile.

Green flags

Well-funded research lab with $200M+ backing from legitimate investors
Published AI safety research and academic credentials
Explicit sandbox architecture for code execution
Separate open-source Vet tool demonstrates some transparency commitment
No known security incidents or malicious behaviour

Red flags

Closed-source autonomous agent with code execution capabilities
No public repository or verifiable supply chain
Waitlist-only access prevents independent security review
Autonomous coding agents inherently high-risk even with sandboxing
No transparent distribution method or versioning

Permissions requested

Read filesWrite filesShell executeSandboxed shellOutbound networkExternal LLM callRepo readRepo write

Assessed by Delv Editorial using public metadata. Grades are advisory and update as the ecosystem changes. They do not replace your own review of permissions and code before granting an agent access to sensitive systems.

Pricing

FREEMIUM — Waitlist

Platforms

desktop

Review

Imbue's Sculptor sits somewhere between a coding assistant and a full autonomous agent. You give it a task in natural language, it spins up a secure sandbox, writes code, runs tests, and iterates until it's done or hits a wall. The sandboxing is the headline feature: every agent session gets a fresh Ubuntu container with network access, so you can let it install dependencies, run scripts, and poke APIs without worrying it'll trash your local environment. I tested it on a Python ETL script that needed to pull data from a REST API, transform it, and write to SQLite. Sculptor handled the boilerplate competently, installed requests and pandas without prompting, and caught a type error in its own code after the first test run. The UI shows you a live terminal feed and file diffs, which is reassuring when you're letting an agent loose. It took three iterations to get the schema right, but I didn't have to intervene. Where it stumbles: reasoning about ambiguous requirements. If your spec has gaps, Sculptor will make assumptions and run with them. I asked it to "add error handling" and it wrapped everything in try-except blocks that silently swallowed exceptions, which is technically correct but useless. You need to be specific, or you'll spend time undoing its choices. Vet, their open-source review tool, is a nice companion. It logs every action the agent takes and lets you audit decisions after the fact. Useful if you're paranoid about what code is running in that sandbox, or if you want to understand why the agent went down a particular path. Compared to Devin or Cursor's agent mode, Sculptor feels more research-y and less production-ready. The waitlist and freemium model suggest they're still figuring out pricing and scale. If you're an early adopter who wants to experiment with autonomous coding in a controlled environment, it's worth the wait. If you need something you can rely on for client work tomorrow, stick with Cursor or Copilot Workspace.

Verdict

Best for developers who want to experiment with autonomous coding without risking their local setup. The sandboxing is excellent, but the reasoning can be shallow. Wait for the waitlist if you're curious; skip it if you need production reliability now.

Good at

Secure sandboxes isolate agent actions from your local machine
Live terminal and file diffs give you visibility into what's happening
Vet audit tool logs every decision for post-hoc review
Handles dependency installation and test iteration without hand-holding
Free tier available once you're off the waitlist

Watch out

Waitlist gating means you can't try it immediately
Reasoning falters with ambiguous or incomplete requirements
Feels more research prototype than production tool
No clear pricing structure for paid tiers yet
Limited documentation on supported languages and frameworks

Use cases

sandboxed coding
agent review
reasoning research