Cosine
Cosine's Genie agent for software engineering — purpose-trained on real PR data. Strong on multi-step bug-fix tasks.
Delv Safety Grade: C
Score 58/100 · assessed 2026-04-18
Cosine Genie is a commercial autonomous coding agent from a startup (Cosine) with no public repository or open-source transparency. The agent executes multi-step workflows including writing code, running tests, and opening pull requests, which requires broad repository write access, shell execution for tests, and filesystem write permissions. The closed-source nature means you cannot audit what the agent does with your codebase or credentials. Supply chain is opaque: no public package, no verifiable build process, likely SaaS-only delivery. The company appears legitimate (professional website, paid product) but is small with unknown bus factor. No known security incidents, but the combination of autonomous operation, broad permissions, and zero code transparency creates meaningful risk for production codebases. Best suited for non-critical projects where you can tolerate potential data exposure.
Green flags
- Purpose-trained on real PR data for practical software engineering tasks
- Professional paid product with clear commercial backing
- No known security incidents or credential leaks
- Specific use case (bug fixes, refactors) with demonstrated value
Red flags
- No public repository or source code available for audit
- Autonomous agent with repo write and shell execute permissions
- Closed-source SaaS with opaque data handling and security practices
- Small vendor with unknown team size and bus factor
- No verifiable supply chain or package distribution method
Permissions requested
Pricing
Platforms
Review
Best for teams with solid test coverage who need to clear backlogs of well-defined bugs and refactors. Skip it if your work is exploratory or your repo is a mess—Genie needs structure to shine.
Good at
- Trained on real PR data, writes idiomatic code that fits your repo's style
- Genuine multi-step autonomy: runs tests, iterates on failures, opens PRs without hand-holding
- Async workflow lets you queue tasks and review later, good for solo founders
- Strong at tedious but scoped work like API migrations or flaky test fixes
- CLI-first design fits into existing dev workflows
Watch out
- Expensive for individual developers compared to standard AI tools
- Struggles with vague or exploratory tasks, needs clear scope
- Requires decent test coverage to verify its own work effectively
- Won't make architectural decisions or handle novel integrations
- Web interface feels underbaked compared to the CLI
Use cases
- End-to-end bug-fix tasks
- Long-running refactor plans
- Engineering reviews of agent suggestions
- Async coding work for solo founders