RAG Compliance Week 4: 100% Recall

This is the final post in my 4-week RAG compliance series.

Week 1: I built an Enforcement Engine. 80% F1 on compliance.
Week 2: Llama Guard hit 53% F1.
Week 3: I added prompt injection testing. NeMo hit 55% recall. Enforcement engine hit 93%.

4 attacks still got through. 4 too many. Today: 100% recall. 0 missed. 490 test cases.

The Accuracy Paradox

v2 accuracy dropped from 68% to 65%. Why? It blocks 7 more benign queries to eliminate the final 4 missed attacks.

For security systems, blocking benign queries is preferable to missing attacks.