2026-04-28 05:56:57 网络安全文章来源：ZONE.CI 全球网 0 阅读模式

文章总结： PentestAIAgents项目将ClaudeCode改造为拥有28个专业子智能体的红队作战平台，通过三层架构实现渗透测试全流程自动化。项目采用Swarm编排体系将工作分为规划、侦察、验证、利用等6个阶段，利用SQLite数据库解决上下文限制问题。该方案通过物理隔离和索引传递机制提升AI协作效率，为渗透测试工程师提供可落地的自动化解决方案。 综合评分： 87 文章分类： 渗透测试,红队,安全工具,AI安全,安全运营

cover_image

AI 渗透进入“下半场”：为什么逻辑协同比大模型本身更重要？

原创

APT-101 APT-101

APT-101

2026年4月27日 18:41 陕西

在小说阅读器读本章

去阅读

作为渗透测试工程师的今天，如果你的渗透测试还停留在 “喂一行 Prompt，等一行结果” 的原始阶段，那你可能正在错过一场效率革命。

GitHub项目：Pentest AI Agents。它不是一个简单的脚本库，它是为 Claude Code 注入灵魂的 28 个专业子智能体（Subagents）集合。Pentest AI Agents。这不仅是一套脚本，更是一套成熟的、可落地的 AI 渗透测试操作系统。

一、项目定位：从“对话框”走向“作战室”

一句话定位： 该项目是专为 Claude Code 设计的 28 个专业子智能体（Subagents）集合。

它将 Claude 从一个通用的语言模型，改造为一个拥有多专业背景、具备文件系统读写权限、且自带安全护栏的红队作战平台。它的目标不是替代专家，而是通过自动化繁琐的侦察、分析与报告环节，让专家回归到最高价值的“逻辑博弈”中。

二、架构概览：三层联动体系

项目的整体架构可以概括为：指令层、逻辑层、数据层。

指令层 (.claude/agents/)：28 个 .md 格式的专家定义。
逻辑层 (Swarm Orchestration)：基于任务路由的编排逻辑，负责在不同阶段调用最合适的专家。
数据层 (db/)：基于 Bash 驱动的 SQLite 发现数据库。这是项目的“长期记忆”。

项目根目录/├──&nbsp;.claude/agents/&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;28&nbsp;个专业&nbsp;Agent&nbsp;定义（.md 文件）│&nbsp; &nbsp;├──&nbsp;swarm-orchestrator &nbsp;&nbsp;←&nbsp;总指挥（Swarm&nbsp;编排器）│&nbsp; &nbsp;├──&nbsp;engagement-planner &nbsp;&nbsp;←&nbsp;渗透测试规划师│&nbsp; &nbsp;├──&nbsp;recon-advisor &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;侦察与枚举顾问│&nbsp; &nbsp;├──&nbsp;web-hunter &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;Web&nbsp;渗透猎人│&nbsp; &nbsp;├──&nbsp;api-security &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;API&nbsp;安全专家│&nbsp; &nbsp;├──&nbsp;vuln-scanner &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;漏洞扫描专家│&nbsp; &nbsp;├──&nbsp;poc-validator &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;PoC&nbsp;验证专家│&nbsp; &nbsp;├──&nbsp;attack-planner &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;攻击链策划师│&nbsp; &nbsp;├──&nbsp;exploit-chainer &nbsp; &nbsp; &nbsp;←&nbsp;漏洞链式利用专家│&nbsp; &nbsp;├──&nbsp;ad-attacker &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;Active&nbsp;Directory&nbsp;攻击专家│&nbsp; &nbsp;├──&nbsp;credential-tester &nbsp; &nbsp;←&nbsp;凭证破解专家│&nbsp; &nbsp;├──&nbsp;exploit-guide &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;漏洞利用指南│&nbsp; &nbsp;├──&nbsp;bizlogic-hunter &nbsp; &nbsp; &nbsp;←&nbsp;业务逻辑漏洞猎人│&nbsp; &nbsp;├──&nbsp;bug-bounty &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;赏金猎人│&nbsp; &nbsp;├──&nbsp;mobile-pentester &nbsp; &nbsp;&nbsp;←&nbsp;移动端渗透专家│&nbsp; &nbsp;├──&nbsp;cloud-security &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;云安全专家（AWS/Azure/GCP）│&nbsp; &nbsp;├──&nbsp;cicd-redteam &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;CI/CD&nbsp;持续红队│&nbsp; &nbsp;├──&nbsp;osint-collector &nbsp; &nbsp; &nbsp;←&nbsp;OSINT&nbsp;情报收集│&nbsp; &nbsp;├──&nbsp;social-engineer &nbsp; &nbsp; &nbsp;←&nbsp;社会工程学专家│&nbsp; &nbsp;├──&nbsp;wireless-pentester &nbsp;&nbsp;←&nbsp;无线网络渗透│&nbsp; &nbsp;├──&nbsp;forensics-analyst &nbsp; &nbsp;←&nbsp;数字取证分析师│&nbsp; &nbsp;├──&nbsp;malware-analyst &nbsp; &nbsp; &nbsp;←&nbsp;恶意软件分析师│&nbsp; &nbsp;├──&nbsp;threat-modeler &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;威胁建模专家│&nbsp; &nbsp;├──&nbsp;detection-engineer &nbsp;&nbsp;←&nbsp;检测规则工程师（蓝队视角）│&nbsp; &nbsp;├──&nbsp;report-generator &nbsp; &nbsp;&nbsp;←&nbsp;报告生成器│&nbsp; &nbsp;├──&nbsp;privesc-advisor &nbsp; &nbsp; &nbsp;←&nbsp;提权顾问│&nbsp; &nbsp;├──&nbsp;stig-analyst &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;STIG&nbsp;合规分析师│&nbsp; &nbsp;├──&nbsp;ctf-solver &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;CTF&nbsp;解题助手│&nbsp; &nbsp;└──&nbsp;_scope-guard&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;范围守卫（安全护栏）├──&nbsp;.claude/commands/&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;斜杠命令├──&nbsp;db/&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;Bash&nbsp;驱动的&nbsp;SQLite&nbsp;发现数据库│&nbsp; &nbsp;├──&nbsp;findings.sh &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;工程会话管理&nbsp;CLI│&nbsp; &nbsp;├──&nbsp;handoff.sh &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;Agent&nbsp;间交接脚本│&nbsp; &nbsp;├──&nbsp;schema.sql &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;7&nbsp;张表的数据库设计│&nbsp; &nbsp;└──&nbsp;lib/&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;←&nbsp;公共库└──&nbsp;examples/&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;←&nbsp;示例输出（报告片段、检测规则等）

三、核心架构：Swarm 编排体系

该项目模仿了现实中红队小队的协作模式，将渗透测试拆解为 6 个 Phase（阶段）：

规划（Planner）：委派 engagement-planner 生成 MITRE ATT&CK 映射计划与 RoE（交战规则）。
侦察（Recon）：调用 recon-advisor 与 osint-collector 建模，结果自动写入 SQLite。
验证（Validator）：poc-validator 负责对漏洞扫描结果进行二阶段验证，过滤 AI 误报。
利用（Exploit）：exploit-chainer 关联多个低危发现，构建端到端攻击路径。

Phase&nbsp;1: 规划阶段┌─────────────────────────────────────────────┐│ swarm-orchestrator (总指挥) &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 engagement-planner &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 制定渗透测试计划、MITRE ATT&CK 映射 &nbsp; &nbsp; &nbsp; ││ &nbsp; → 生成 RoE（交战规则）模板 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;│└─────────────────────────────────────────────┘Phase&nbsp;2: 侦察阶段┌─────────────────────────────────────────────┐│ swarm-orchestrator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 recon-advisor &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 osint-collector &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 web-hunter &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 收集攻击面情报 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 端口扫描、子域名枚举、OSINT &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;│└─────────────────────────────────────────────┘Phase&nbsp;3: 漏洞分析与验证┌─────────────────────────────────────────────┐│ swarm-orchestrator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 vuln-scanner &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 poc-validator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 跑 Nuclei/Nessus/OpenVAS &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 生成 PoC 验证每个发现 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 过滤误报 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; │└─────────────────────────────────────────────┘Phase&nbsp;4: 攻击链构建┌─────────────────────────────────────────────┐│ swarm-orchestrator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 attack-planner &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; ↓ 委派给 exploit-chainer &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 关联多工具发现 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 构建端到端攻击路径 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 优先级排序 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; │└─────────────────────────────────────────────┘Phase&nbsp;5: 漏洞利用┌─────────────────────────────────────────────┐│ swarm-orchestrator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 exploit-guide &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 ad-attacker &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 credential-tester &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 privesc-advisor &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → 执行利用 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 凭证破解 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; → AD 攻击 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 权限提升 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; │└─────────────────────────────────────────────┘Phase&nbsp;6: 报告┌─────────────────────────────────────────────┐│ swarm-orchestrator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ││ &nbsp; ↓ 委派给 report-generator &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;││ &nbsp; → 生成专业渗透测试报告 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; │└─────────────────────────────────────────────┘

四、 Agent 详细分类：28 位“数字专家”

为了保证专业深度，项目将 Agent 细分为七大类别，涵盖了从移动端、云安全到工业合规（STIG）的全场景覆盖。

指挥协调类：swarm-orchestrator（总指挥）、engagement-planner（规划师）。
侦察枚举类：recon-advisor（扫描）、osint-collector（情报）、web-hunter（Web 猎人）。
深度渗透类：ad-attacker（AD 域控）、api-security（接口安全）、cloud-security（云安全）。
利用验证类：poc-validator（PoC 验证）、exploit-chainer（漏洞链构建）。
特定领域类：mobile-pentester（移动端）、wireless-pentester（无线）、cicd-redteam（持续红队）。
防御与取证类：detection-engineer（检测规则）、forensics-analyst（数字取证）。
辅助产出类：report-generator（报告生成）、stig-analyst（合规分析）。

五、安装与使用手册

要驱动这支“数字红队”，你需要按照以下步骤操作：

1. 环境准备

确保已安装 Claude Code。如果未安装，请执行： npm install -g @anthropic-ai/claude-code

2. 项目克隆与安装

# 克隆项目git&nbsp;clone&nbsp;https://github.com/0xSteph/pentest-ai-agents.gitcd&nbsp;pentest-ai-agents
# 给予安装脚本权限chmod&nbsp;+x install.sh
# 执行安装。建议选 --global 全局安装，方便在任何项目下唤起专家./install.sh --global

3. 快速启动

在你的渗透测试目标目录下开启 claude 终端，直接通过斜杠命令调用：

/swarm：启动总指挥模式，自动开始规划和侦察。
/web-hunter：专门针对 Web 目标开启深度挖掘。
/report-generator：根据 db/ 下记录的漏洞数据，自动生成 Markdown 报告。

六、核心痛点讨论：AI Agent 的工程化“深水区”

在实际使用过程中，有两个 AI 领域的底层问题是任何开发者都无法回避的。Pentest AI Agents 给出了一套极具工程美学的参考答案：

1. 上下文大小（Context Window）的极限博弈

痛点： 渗透测试产生的 Nmap 扫描结果、HTTP 流量包、目录爆破日志动辄数万行。直接塞进对话，会导致 AI 变得“健忘”甚至产生逻辑崩溃。

项目的解法： 物理隔离 + “外挂”索引。它利用子智能体（Subagent）机制，让每一个任务都在独立的 Session 中运行。主进程只通过 SQLite 数据库传递资产的 ID 索引。

例子： 主 Agent 只告诉 web-hunter：“去分析 ID 为 7 的主机。” 子 Agent 随后精准读取数据库中该主机的服务信息。这种 “按需按量读取” 的逻辑，绕过了上下文窗口的物理限制。

# 智能体能力配置文件定义subagents:&nbsp; - name:&nbsp;"ad-attacker"&nbsp; &nbsp; allowed_tools:&nbsp;&nbsp; &nbsp; &nbsp; -&nbsp;"read_file"&nbsp;# 允许读取扫描结果&nbsp; &nbsp; &nbsp; -&nbsp;"write_file"&nbsp;# 允许写入攻击路径&nbsp; &nbsp; &nbsp; -&nbsp;"run_terminal_command"&nbsp; &nbsp;&nbsp;# 强制预读指令&nbsp; &nbsp; pre_run_commands:&nbsp;&nbsp; &nbsp; &nbsp; -&nbsp;"cat ./recon/active_hosts.txt"&nbsp;# 启动即同步状态

2. 结果协同中的“逻辑断层”问题

痛点： 当 Agent A 仅将“扫描结果”传给 Agent B 时，往往丢失了关键的“思考过程”。这可能导致 Agent B 无法理解漏洞的利用环境，从而导致任务失败。

项目的解法： 带推理证据的交接（Handoff with Reasoning）。在 handoff.sh 和各 Agent 的 Prompt 设计中，强制要求输出包含：

Reasoning（逻辑链）：我为什么得出这个结论？排除了哪些干扰项？
Evidence（证据路径）：支撑结论的原始日志在哪一行？通过这种方式，Agent 之间传递的不仅是数据，更是结构化的知识。即使后续 Agent 遇到死路，也能通过回溯推理链快速重定位。

// 核心调度逻辑：决定何时切换 Agentasync&nbsp;function&nbsp;orchestrator(userInput) {&nbsp;&nbsp;// 1. 调用 Planner Agent 生成结构化计划&nbsp;&nbsp;const&nbsp;plan =&nbsp;await&nbsp;callSubagent("engagement-planner", userInput);
&nbsp;&nbsp;// 2. 解析计划中的任务阶段&nbsp;&nbsp;const&nbsp;tasks =&nbsp;parseMarkdownPlan(plan);
&nbsp;&nbsp;for&nbsp;(let&nbsp;task&nbsp;of&nbsp;tasks) {&nbsp; &nbsp;&nbsp;// 3. 根据任务标签动态选择子 Agent&nbsp; &nbsp;&nbsp;const&nbsp;agentName =&nbsp;routeToAgent(task.category);&nbsp;// e.g., "web" -> "web-hunter"
&nbsp; &nbsp;&nbsp;// 4. 传递“上下文简报”而非全量历史&nbsp; &nbsp;&nbsp;const&nbsp;brief = {&nbsp; &nbsp; &nbsp;&nbsp;objective: task.description,&nbsp; &nbsp; &nbsp;&nbsp;prior_findings:&nbsp;await&nbsp;readLocalFile("findings_summary.md"),&nbsp;// 读取共享黑板&nbsp; &nbsp; &nbsp;&nbsp;evidence_path:&nbsp;"./logs/last_scan.json"&nbsp; &nbsp; };
&nbsp; &nbsp;&nbsp;const&nbsp;result =&nbsp;await&nbsp;callSubagent(agentName, brief);
&nbsp; &nbsp;&nbsp;// 5. 强制执行“结果持久化”&nbsp; &nbsp;&nbsp;await&nbsp;updateFindingsSummary(result);&nbsp; }}

七、价值升华：管理复杂性的艺术

网络安全本质上是信息熵与成本的竞赛。

Pentest AI Agents 的价值在于，它将专家的经验（Prompt）与机器的算力（Tooling）进行了完美的解耦与封装。它让我们看到，AI 不仅仅能写代码，它更能像人类一样进行分层思考、协同作战、并遵守严格的安全规范。

项目地址：https://github.com/0xSteph/pentest-ai-agents

免责声明：

本文所载程序、技术方法仅面向合法合规的安全研究与教学场景，旨在提升网络安全防护能力，具有明确的技术研究属性。

任何单位或个人未经授权，将本文内容用于攻击、破坏等非法用途的，由此引发的全部法律责任、民事赔偿及连带责任，均由行为人独立承担，本站不承担任何连带责任。

本站内容均为技术交流与知识分享目的发布，若存在版权侵权或其他异议，请通过邮件联系处理，具体联系方式可点击页面上方的联系我。

本文转载自：APT-101 APT-101 APT-101《AI 渗透进入“下半场”：为什么逻辑协同比大模型本身更重要？》