AI Security Engineer

Inspiration

Security is always an afterthought for a software engineer. But it doesn't have to be. With an AI security engineer, you've a teammate who is constantly looking for vulnerabilities and also creating patches to remediate them.

What it does

It performs different kinds of scanners on the code repository, e.g. static, sca or secrets. Once it identifies the vulnerabilities, it gives the user to auto-fix the vulnerability. That autofix will create a pull request which remediates the vulnerability.

How we built it

We used the Claude 4.5 API to build the auto-fix functionality. Other than that, we have integrated a few open-source security scanners to find vulnerabilities. We have built safeguards while building this functionality:

We run the scanners after applying the patch to ensure we have remediated the vulnerability and have not added new vulnerabilities
We also build and test the repository to ensure that we are not introducing bugs.
We use LLM as a judge (Claude Opus 4.1) to get a verdict on the quality of PR.

Challenges we ran into

Sometimes the code given by LLMs does not work, so we have built retries into it.
There are many failure modes in the whole process, so currently we log all the failures so that we can analyse them.
Developers do not understand security issues, so it requires explaining the issues to them.

Accomplishments that we're proud of

We were able to find vulnerabilities and raise Pull Requests in almost all trending GitHub repositories. In total, we raised > 100 PRs.
A lot of PRs got merged as well, and they thanked us for the contribution.

What we learned

Creating an automated PR is still risky, as there can be unintended consequences of the code, so we have to continuously build guardrails.
Developer community has not yet warmed to AI-created PRs, so sometimes they shrug it off even though the PR is genuine.
Developers do not create about a security vulnerability until you show them how it can be exploited; otherwise, it is a nice-to-have feature.

What's next for an AI Security Engineer

The space of code security does not have very good public benchmarks/evals. We would like to build a benchmark and contribute it to the open source.
Take AI security engineer to early-mid stage startups to see how they can use it in their workflow.

Built With

Updates

Anupam Mediratta started this project — Oct 05, 2025 10:25 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.