Artificial Intelligence

Meta Releases Llama AI Open Source Protection Tools

Meta has released new Llama protection tools to help the open source AI community build more secure applications.

| April 30, 2025 (7:03 AM ET)

Facebook parent company Meta on Tuesday announced the release of new open source Llama AI protection tools, along with new AI-enabled solutions for security operations.

The new tools available now for the open source AI community include Llama Guard 4, LlamaFirewall, and Llama Prompt Guard 2.

Intended as a unified safeguard across modalities and providing support for text and image understanding protections, Llama Guard 4 is also available on a new Llama API, which was released in preview.

LlamaFirewall is a fresh security tool for orchestration across guard models that can detect and prevent prompt injections, insecure code, and risky plug-in interactions. It supports existing Meta protection tools, to help developers build secure AI systems.

The updated Llama Prompt Guard classifier model brings improved jailbreak and prompt injection detection, and is accompanied by Prompt Guard 2 22M, a lightweight version for reduced latency and compute costs.

To help organizations improve the efficacy of AI systems in security operations, the internet giant is making AI-enabled tools available for them and is also launching a Llama Defenders Program for select partners.

Advertisement. Scroll to continue reading.

On Tuesday, Meta introduced CyberSOC Eval and AutoPatchBench, two new tools for assessing AI system defenses, both available as part of CyberSec Eval 4, its updated open source cybersecurity benchmark suite.

CyberSOC Eval measures the efficacy of AI systems in security operation centers, while AutoPatchBench evaluates AI systems’ ability to automatically patch vulnerabilities in native code.

The Llama Defenders Program, Meta says, provides organizations and developers with access to various open, early-access, and closed solutions, such as the Automated Sensitive Doc Classification Tool for applying security classification labels to internal documents, and Llama Generated Audio Detector & Llama Audio Watermark Detector, for identifying AI-generated threats, including scams and phishing.

Additionally, Meta is previewing Private Processing, new technology leveraging AI to summarize unread messages or refine them for WhatsApp users. Messages, the company says, remain private, as neither Meta, nor WhatsApp can access them.

“We’re working with the security community to audit and improve our architecture and will continue to build and strengthen Private Processing in the open, in collaboration with researchers, before we launch it in product,” Meta notes.

Written By Ionut Arghire

Ionut Arghire is an international correspondent for SecurityWeek.

SECURITYWEEK NETWORK:

ICS:

SecurityWeek

Artificial Intelligence

Meta Releases Llama AI Open Source Protection Tools

More from Ionut Arghire

Latest News

Trending

Webinar: Closing the Exploitation Gap

Virtual Event: CodeSecCon 2026

People on the Move

Expert Insights

Is Patching Dead? Vulnerability Management in the Post-Mythos Era

When Identity Verification Fails: Lessons from a Real-World SIM Swap and Near Account Takeover

Legacy Systems, Real-World Impacts: The Reality of OT Security

The Shift Toward Business-Aligned Risk Management

How to Conduct a Successful Audit of AI-Driven Software Development

SECURITYWEEK NETWORK:

ICS:

Daily Briefing Newsletter

More from Ionut Arghire

Latest News

Trending

Daily Briefing Newsletter

Webinar: Closing the Exploitation Gap

Virtual Event: CodeSecCon 2026

People on the Move

Expert Insights

Is Patching Dead? Vulnerability Management in the Post-Mythos Era

When Identity Verification Fails: Lessons from a Real-World SIM Swap and Near Account Takeover

Legacy Systems, Real-World Impacts: The Reality of OT Security

The Shift Toward Business-Aligned Risk Management

How to Conduct a Successful Audit of AI-Driven Software Development

Daily Briefing Newsletter