Jules
Async coding agent by Google that clones your repo into a cloud VM, plans tasks, runs tests and opens PRs powered by Gemini.
Delv Safety Grade: B
Score 72/100 · assessed 2026-04-18
Jules is Google's async coding agent powered by Gemini, offering strong maintainer credentials as a major tech vendor product. However, it presents significant permission concerns: cloning repositories into Google's cloud VMs, executing arbitrary code, running tests, and opening PRs requires extensive access to your codebase and GitHub account. The lack of a public repository severely limits transparency - users cannot inspect the agent's code or verify its behaviour. Supply chain is moderately strong given Google's infrastructure, but the closed-source nature and broad permissions create meaningful trust dependencies. The freemium model with Google AI plans provides accessibility, but the extensive automated capabilities (code execution, git operations, PR creation) demand careful consideration of what repositories you grant access to. No known security incidents, but the opacity and scope warrant caution.
Green flags
- Maintained by Google - major vendor with strong security practices
- Async operation reduces local resource requirements
- Integrated with Google AI infrastructure and Gemini models
- No known security incidents or breaches
Red flags
- No public repository - completely closed source, cannot inspect code
- Clones entire repo to Google cloud VM - full codebase exposure
- Executes arbitrary code and tests in Google's infrastructure
- Requires GitHub write access to open PRs automatically
- Limited transparency into data retention and processing policies
Permissions requested
Pricing
Platforms
Review
Pay for this if you maintain multiple repos with solid test coverage and a backlog of grunt work. Skip it if your codebase is under-tested or your tasks require nuanced judgment calls.
Good at
- Genuinely autonomous: runs end-to-end without supervision
- Excellent at dependency bumps and test-driven refactors
- Gemini handles stack traces and retries intelligently
- Free tier is usable for small teams on Google AI plans
- CLI and GitHub integration both work without friction
Watch out
- Struggles with vague or ambiguous requirements
- Assumes your test suite is comprehensive and reliable
- Rate limits on free tier hit fast for larger repos
- Audio changelog feature feels like a gimmick
- Not suitable for tasks requiring architectural judgment
Use cases
- async code changes
- dependency bumps
- audio changelogs