Artificial Intelligence

Claude Code, Gemini CLI, GitHub Copilot Agents Vulnerable to Prompt Injection via Comments

A researcher has disclosed the details of the AI attack method he has named ‘Comment and Control’.

| April 16, 2026 (4:33 AM ET)

Updated: April 21, 2026 (2:00 AM ET)

A researcher has disclosed the details of a prompt injection attack method named ‘Comment and Control’, which has been found to work against several popular AI code security and automation tools.

The attack method was discovered by security engineer and vulnerability researcher Aonan Guan, with assistance from Johns Hopkins University researchers Zhengyu Liu and Gavin Zhong.

In a blog post published on Wednesday, Guan said the attack has been confirmed to work against several widely used AI agents: Anthropic’s Claude Code Security Review, Google’s Gemini CLI Action, and GitHub Copilot Agent.

The researchers found that AI agents associated with these tools on GitHub Actions can be hijacked using specially crafted GitHub comments, including PR titles, comments, and issue bodies.

In the case of Claude Code Security Review, designed for automated security reviews, the researchers showed how an attacker could use a specially crafted PR title to trick the AI agent into executing arbitrary commands, extracting credentials, and revealing them as a security finding or an entry in the GitHub Actions log.

For Gemini CLI Action, which acts as an autonomous agent for routine coding tasks, the researchers used an issue comment with a prompt-injection title, along with specially crafted issue comments, to bypass guardrails and obtain a full API key.

Advertisement. Scroll to continue reading.

In the Comment and Control attack aimed at GitHub Copilot Agent, the experts leveraged an HTML comment, which hides the payload, to bypass environment filtering, scan for secrets, and bypass the network firewall.

The Comment and Control attack can pose a serious threat, as the attacker’s malicious prompt is automatically triggered by GitHub Actions workflows, without any action from the victim — except in the case of Copilot, where the attacker’s issue must be manually assigned to Copilot by the victim.

“The pattern likely applies to any AI agent that ingests untrusted GitHub data and has access to execution tools in the same runtime as production secrets — and beyond GitHub Actions, to any agent that processes untrusted input with access to tools and secrets: Slack bots, Jira agents, email agents, deployment automation. The injection surface changes, but the pattern is the same,” Guan explained.

The findings have been reported to Anthropic, Google, and GitHub, and all have confirmed them. Anthropic classified the issue as ‘critical’ and implemented some mitigations, awarding a $100 bug bounty to the researchers. Google paid out a $1,337 bug bounty.

GitHub awarded the researchers $500, saying that their work “sparked some great internal discussions”, but classified the security issue as a known architectural limitation.

“This is the first public cross-vendor demonstration of a single prompt injection pattern across three major AI agents. All three vulnerabilities follow the same pattern: untrusted GitHub data → AI agent processes it → agent executes commands → credentials exfiltrated through GitHub itself,” Guan said.

“The deeper issue is architectural: these AI agents are given powerful tools (bash execution, git push, API calls) and secrets (API keys, tokens) in the same runtime that processes untrusted user input. Even when multiple layers of defense exist — model-level, prompt-level, and GitHub’s additional three runtime layers — they can all be bypassed because the prompt injection here is not a bug; it is context that the agent is designed to process,” he added.

UPDATE: Guan clarified for SecurityWeek that Google addressed the issue by adding new ‘guardrail prompts’ to the system prompt. However, this does not change the underlying threat model or attack scenario, because the capabilities (tools) available to the Gemini agent remain the same.

For Anthropic, the attack method still works in principle, though the specific payload would need to be updated. The company’s remediation was to disallow one tool (‘ps’) rather than adopting a least privilege approach by granting only the tools needed for the security review. The researcher did not attempt a full bypass after the company’s fix, but the underlying issue — overly broad tool access for a “security review agent” — remains unaddressed.

Written By Eduard Kovacs

Eduard Kovacs (@EduardKovacs) is senior managing editor at SecurityWeek. He worked as a high school IT teacher before starting a career in journalism in 2011. Eduard holds a bachelor’s degree in industrial informatics and a master’s degree in computer techniques applied in electrical engineering.

Latest News

Virtual Event: Threat Detection and Incident Response Summit

On-Demand

Delve into big-picture strategies to reduce attack surfaces, improve patch management, conduct post-incident forensics, and tools and tricks needed in a modern organization.

Webinar: Third-Party Risk in Practice

June 4, 2026

Organizations are investing heavily in third-party risk management, but breaches, delays, and blind spots continue to persist. Join this live webinar as we examine the gap between how organizations think their third-party risk programs are performing and what’s actually happening in practice.

SECURITYWEEK NETWORK:

ICS:

SecurityWeek

Artificial Intelligence

Claude Code, Gemini CLI, GitHub Copilot Agents Vulnerable to Prompt Injection via Comments

More from Eduard Kovacs

Latest News

Trending

Virtual Event: Threat Detection and Incident Response Summit

Webinar: Third-Party Risk in Practice

People on the Move

Expert Insights

Raising the Cybersecurity Stakes: Ante up for the Agentic Era

Caught Off Guard: Securing AI After It Hits Production

Cyber Resilience is the New Business Continuity Plan

Enhancing Data Center Security Without Sacrificing Performance

Is the SOC Obsolete, and We Just Haven’t Admitted It Yet?

SECURITYWEEK NETWORK:

ICS:

Daily Briefing Newsletter

More from Eduard Kovacs

Latest News

Trending

Daily Briefing Newsletter

Virtual Event: Threat Detection and Incident Response Summit

Webinar: Third-Party Risk in Practice

People on the Move

Expert Insights

Raising the Cybersecurity Stakes: Ante up for the Agentic Era

Caught Off Guard: Securing AI After It Hits Production

Cyber Resilience is the New Business Continuity Plan

Enhancing Data Center Security Without Sacrificing Performance

Is the SOC Obsolete, and We Just Haven’t Admitted It Yet?

Daily Briefing Newsletter