AI Security Researcher, Humanbound
Moderation APIs catch harm and injection attempts but fail to enforce domain-specific policy. A cross-domain evaluation shows why production LLM systems need both moderation and policy reasoning layers.