Software2 months

Security Red-Teaming for Code Gen Assistant

Client

DevTools Startup

Overview

A startup building a coding assistant needed to ensure their tool did not introduce security vulnerabilities into user codebases.

The Challenge

The model was susceptible to prompt injection, often revealing system instructions and generating SQL injection vulnerabilities when asked.

Our Solution

We performed aggressive red-teaming, creating a library of 500+ attack vectors. We then fine-tuned the model on "refusal" datasets and implemented output filtering.

Tech Stack

Azure OpenAILangChainPythonNemo GuardrailsPinecone

Key Results

security Score

Improved from C- to A+

vulnerability Block

100% of known injection attacks blocked

trust

Successfully passed SOC2 Type II AI controls

Security Score Improved from C- to A+