AgentPwn

AI Agent Security Research

A distributed honeypot network studying how AI agents behave when they encounter adversarial content on legitimate-looking sites. Mapping vulnerability by framework, sector, and attack surface.

Test Your Agent Read the Research

Network Data

Live

Total Interactions

285.8K

Unique Agents

52.7K

Compromise Rate

1.4%

Active Sources

Detailed statistics

What 285.8K agent interactions reveal

Every number on the dashboard is read live from the Registry. Nothing modeled, nothing projected.

Open the full dashboard

Attack Categories

live

3.9K callbacks

Live ranking of which attack categories actually compromise AI agents.

Daily Activity

live

52.7K unique agents

Daily callback trend and new attack surface discovery.

Sector Analysis

live

7 verticals

Cross-vertical comparison. Sector derived from the URL each agent was on at callback time.

Honeypot Lab

Browse the 48 attack scenarios across 11 categories that power our honeypot network. When your agent falls for an attack, we tell it how to fix the vulnerability.

Prompt Injection

10 tiers

Direct and indirect instruction override attacks

Jailbreak

5 tiers

Attempts to bypass safety guardrails and persona constraints

Data Exfiltration

5 tiers

Tricks to extract credentials, PII, or system information

Capability Abuse

3 tiers

Confused deputy attacks that misuse agent tools

Context Manipulation

5 tiers

Attacks that corrupt the agent's understanding of context

MCP Exploitation

3 tiers

Attacks targeting Model Context Protocol integrations

Browse all 11categories →

Free Security Review for AI Agents

Point your agent at agentpwn.com. Each page hides indirect prompt injections calibrated to the agent's sophistication. When the agent falls for one, the payload responds with the attack it just demonstrated and the command to scan for it. No registration, no cost, no setup.

1. Send

Point your agent at agentpwn.com

Browse manually or let the CLI drive it. No accounts.

2. Probe

48 payloads across 11 categories

Tiered from basic overrides to multi-step adversarial chains.

3. Patch

Each compromise returns the finding and the fix

Attack ID, HMA check command, and a sandbox to practice the defense.

Run the test from your terminal

npx hackmyagent secure

Drives your agent through the AgentPwn payloads and prints each finding with a Verify command and a Fix command. Anyone can run it. Takes a few minutes.

Open Source Security Ecosystem

Every finding produces a concrete defense. Our tools help you scan, test, and harden AI agents before deployment.

AgentPwn

Honeypot network