AutoGPT
Pioneering open-source autonomous agent platform with Forge for agent creation, AGBenchmark evaluation and a user-friendly UI.
Delv Safety Grade: C
Score 58/100 · assessed 2026-04-18
AutoGPT is a well-known open-source autonomous agent framework maintained by Significant Gravitas, with strong transparency through active GitHub development and comprehensive documentation. However, as an autonomous agent platform designed for task automation, it inherently requires extensive permissions including filesystem access, shell execution, network operations, and potential desktop control. The project has faced security concerns in its history, including credential exposure risks and the inherent dangers of autonomous code execution. Whilst the maintainer is established in the AI agent space, the project represents a single organisation rather than a major vendor. Supply chain is reasonable via standard package managers, but the autonomous nature and broad permission scope create significant attack surface. Suitable for experienced users who understand the risks of autonomous agents.
Green flags
- Fully open source with active GitHub repository and community
- Comprehensive documentation and AGBenchmark evaluation framework
- Established project with significant community adoption since 2023
- Standard package distribution via pip and Docker
- Transparent development with public issue tracking and changelog
Red flags
- Autonomous agents can execute arbitrary code with minimal human oversight
- Historical security concerns around credential handling and API key exposure
- Broad filesystem and shell access required for task automation features
- Single organisation maintainer creates bus factor risk
- Desktop control capabilities enable significant system-level access
Permissions requested
Pricing
Platforms
Review
Pay for the cloud version if you're running agents in production and need uptime. Stick with the open-source version if you're learning or building custom agents. Skip it entirely if you just want to automate a single workflow, there are simpler tools for that.
Good at
- Forge framework makes custom agent development less painful
- AGBenchmark provides rare, objective evaluation metrics
- Strong open-source foundation with active development
- Handles multi-step research tasks better than most alternatives
- Freemium model lets you experiment before committing
Watch out
- Token usage spirals quickly on complex tasks
- Gets stuck in loops when goals are vague or contradictory
- Self-hosting requires non-trivial setup and maintenance
- UI feels underbaked compared to CLI and framework
- Autonomy often slower than just doing the task yourself
Use cases
- autonomous task execution
- agent experimentation
- workflow building