2026-06-02 04:01:00 网络安全文章来源：ZONE.CI 全球网 0 阅读模式

文章总结： 本文详细介绍了如何利用OpenWebUI结合自定义Skills搭建个人专属AI助手，涵盖服务启动、工具配置、本地脚本编写及浏览器自动化实现等关键步骤。通过具体代码示例演示了技能创建、Playwright集成与Ollama模型调用方法，并提供了从环境部署到实际操作的完整流程指导。 综合评分： 71 文章分类： AI安全,安全工具,安全开发,解决方案,技术标准

cover_image

结合Open WebUI，搭建专属个人的AI助手

原创

TP微客 TP微客

技术分享交流

2026年5月25日 17:09 福建

在小说阅读器读本章

去阅读

1 前言

Open WebUI（原 Ollama WebUI）是一款开源、可自托管、支持完全离线的大语言模型（LLM）交互 Web 平台，本次主要是通过加入skills搭建专属个人的AI助手

2 搭建过程

(1)启动服务

上个文章中有介绍了如何搭建open-webui，这里就直接启动服务，启动open-webui命令如下：

docker run -d -p&nbsp;3000:8080&nbsp;--add-host=host.docker.internal:host-gateway -v&nbsp;open-webui:/app/backend/data&nbsp;--name&nbsp;open-webui --restart always ghcr.io/open-webui/open-webui:main

(2)访问页面

打开open-webui的网址，使用如下网址和端口访问：

http://x.x.x.x:3000/

开始使用

(3)配置

在登录界面中，点击工作空间

点击工具标签页

点击新建工具

这里面有默认的工具，可以参考这里面的代码

将默认的代码修改成你需要的工具内容

&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;import&nbsp;requestsfrom&nbsp;pydantic&nbsp;import&nbsp;BaseModel, Fieldclass&nbsp;Tools:&nbsp; &nbsp;&nbsp;def&nbsp;__init__(self):&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;pass&nbsp; &nbsp;&nbsp;def&nbsp;automate_browser_on_local_pc(&nbsp; &nbsp; &nbsp; &nbsp; self,&nbsp; &nbsp; &nbsp; &nbsp; instruction:&nbsp;str&nbsp;= Field(&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ..., description="用户自然语言指令，例如：'打开百度，搜索 Ollama'"&nbsp; &nbsp; &nbsp; &nbsp; ),&nbsp; &nbsp; ) ->&nbsp;str:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"""&nbsp; &nbsp; &nbsp; &nbsp; 将浏览器自动化任务发送到本地 Windows 电脑执行，这里后面会讲到&nbsp; &nbsp; &nbsp; &nbsp; """&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 👇 替换为你的 Windows 内网 IP&nbsp; &nbsp; &nbsp; &nbsp; LOCAL_AGENT_URL =&nbsp;"http://x.x.x.x:8002/run"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; resp = requests.post(&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; LOCAL_AGENT_URL,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; json={"instruction": instruction},&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; timeout=120, &nbsp;# 给 Playwright 足够时间&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; )&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;resp.status_code ==&nbsp;200:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; result = resp.json()&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;result.get("message",&nbsp;"任务完成")&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;else:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; error = resp.json().get("error",&nbsp;"未知错误")&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;f"❌ 本地代理返回错误:&nbsp;{error}"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;except&nbsp;requests.RequestException&nbsp;as&nbsp;e:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;f"❌ 无法连接到本地 Windows 代理 ({LOCAL_AGENT_URL}):&nbsp;{e}"

(4)配置本地文件

将脚本保存到本地中，并根据实际填写相应的ollama模型ip地址

import&nbsp;sysimport&nbsp;jsonimport&nbsp;refrom&nbsp;flask&nbsp;import&nbsp;Flask, request, jsonifyfrom&nbsp;playwright.sync_api&nbsp;import&nbsp;sync_playwrightimport&nbsp;threadingimport&nbsp;timeapp = Flask(__name__)_BROWSER =&nbsp;Nonedef&nbsp;get_browser():&nbsp; &nbsp;&nbsp;global&nbsp;_BROWSER&nbsp; &nbsp;&nbsp;if&nbsp;_BROWSER&nbsp;is&nbsp;None:&nbsp; &nbsp; &nbsp; &nbsp; p = sync_playwright().start()&nbsp; &nbsp; &nbsp; &nbsp; _BROWSER = p.chromium.launch(&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; headless=False, &nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; args=["--start-maximized"]&nbsp; &nbsp; &nbsp; &nbsp; )&nbsp; &nbsp;&nbsp;return&nbsp;_BROWSERdef&nbsp;execute_action_in_page(action, page):&nbsp; &nbsp; skill = action.get("skill")&nbsp; &nbsp; args = action.get("args", {})&nbsp; &nbsp;&nbsp;# === 字段名兼容处理 ===&nbsp; &nbsp;&nbsp;if&nbsp;skill ==&nbsp;"fill_input":&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 支持 selector/element, value/text&nbsp; &nbsp; &nbsp; &nbsp; selector = args.get("selector")&nbsp;or&nbsp;args.get("element")&nbsp; &nbsp; &nbsp; &nbsp; value = args.get("value")&nbsp;or&nbsp;args.get("text")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;not&nbsp;selector:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;raise&nbsp;ValueError(f"❌ fill_input 缺少 selector/element:&nbsp;{args}")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;value&nbsp;is&nbsp;None:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;raise&nbsp;ValueError(f"❌ fill_input 缺少 value/text:&nbsp;{args}")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;# 执行&nbsp; &nbsp; &nbsp; &nbsp; page.fill(selector,&nbsp;str(value))&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp; &nbsp;&nbsp;elif&nbsp;skill ==&nbsp;"click_element":&nbsp; &nbsp; &nbsp; &nbsp; selector = args.get("selector")&nbsp;or&nbsp;args.get("element")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;not&nbsp;selector:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;raise&nbsp;ValueError(f"❌ click_element 缺少 selector/element:&nbsp;{args}")&nbsp; &nbsp; &nbsp; &nbsp; page.click(selector)&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp; &nbsp;&nbsp;elif&nbsp;skill ==&nbsp;"open_url":&nbsp; &nbsp; &nbsp; &nbsp; url = args.get("url")&nbsp;or&nbsp;args.get("href")&nbsp;or&nbsp;args.get("link")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;not&nbsp;url:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;raise&nbsp;ValueError(f"❌ open_url 缺少 url:&nbsp;{args}")&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;not&nbsp;url.startswith(("http://",&nbsp;"https://")):&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; url =&nbsp;"https://"&nbsp;+ url&nbsp; &nbsp; &nbsp; &nbsp; page.goto(url)&nbsp; &nbsp; &nbsp; &nbsp; page.wait_for_load_state("networkidle", timeout=30000)&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp; &nbsp;&nbsp;elif&nbsp;skill ==&nbsp;"wait_seconds":&nbsp; &nbsp; &nbsp; &nbsp; seconds = args.get("seconds",&nbsp;1)&nbsp; &nbsp; &nbsp; &nbsp; page.wait_for_timeout(int(seconds) *&nbsp;1000)&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp; &nbsp;&nbsp;else:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;raise&nbsp;ValueError(f"❌ 未知操作:&nbsp;{skill}")def&nbsp;extract_actions_from_text(text:&nbsp;str):&nbsp; &nbsp; actions = []&nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; data = json.loads(text.strip())&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;isinstance(data,&nbsp;list):&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;data&nbsp; &nbsp;&nbsp;except:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;pass&nbsp; &nbsp; pattern =&nbsp;r'"skill"\s*:\s*"([^"]+)"[^}]*?"args"\s*:\s*(\{[^}]*\})'&nbsp; &nbsp; matches = re.findall(pattern, text, re.DOTALL)&nbsp; &nbsp;&nbsp;for&nbsp;skill, args_str&nbsp;in&nbsp;matches:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; args = json.loads(args_str)&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; actions.append({"skill": skill,&nbsp;"args": args})&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;except:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; args = {}&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;for&nbsp;key&nbsp;in&nbsp;["selector",&nbsp;"value",&nbsp;"url"]:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;match&nbsp;= re.search(f'"{key}"\\s*:\\s*"([^"]*)"', args_str)&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;match:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; args[key] =&nbsp;match.group(1)&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;args:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; actions.append({"skill": skill,&nbsp;"args": args})&nbsp; &nbsp;&nbsp;return&nbsp;[email protected]('/run', methods=['POST'])def&nbsp;run_automation():&nbsp; &nbsp; data = request.json&nbsp; &nbsp; instruction = data.get("instruction",&nbsp;"")&nbsp; &nbsp;&nbsp;# === 调用远程 Ollama 生成操作计划 ===&nbsp; &nbsp;&nbsp;import&nbsp;requests&nbsp; &nbsp; OLLAMA_SERVER =&nbsp;"http://x.x.x.x:11434"&nbsp;&nbsp;#替换成你的ollama模型的地址&nbsp; &nbsp; MODEL =&nbsp;"frob/qwen3.5-instruct:4b"&nbsp; &nbsp; full_prompt = (&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"你是一个浏览器自动化助手，必须将用户指令分解为精确的原子操作序列。\n"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"规则:\n"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"- 必须使用以下技能: open_url, fill_input, click_element, wait_seconds\n"
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"- 输出必须是纯 JSON 数组，格式: [{\"skill\":\"...\",\"args\":{...}}, ...]\n"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"- 不要任何解释、注释、Markdown 或额外文本\n"&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;f"用户指令：{instruction}\n输出："&nbsp; &nbsp; )&nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; resp = requests.post(&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;f"{OLLAMA_SERVER}/api/generate",&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; json={&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"model": MODEL,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"prompt": full_prompt,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"format":&nbsp;"json",&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"stream":&nbsp;False,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;"options": {"temperature":&nbsp;0.1}&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; },&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; timeout=60&nbsp; &nbsp; &nbsp; &nbsp; )&nbsp; &nbsp; &nbsp; &nbsp; resp.raise_for_status()&nbsp; &nbsp; &nbsp; &nbsp; ai_response = resp.json()["response"]&nbsp; &nbsp;&nbsp;except&nbsp;Exception&nbsp;as&nbsp;e:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;jsonify({"error":&nbsp;f"调用 Ollama 失败:&nbsp;{e}"}),&nbsp;500&nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; actions = extract_actions_from_text(ai_response)&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;if&nbsp;not&nbsp;actions:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;jsonify({"error":&nbsp;f"未提取到操作:&nbsp;{ai_response[:200]}"}),&nbsp;400
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;with&nbsp;sync_playwright()&nbsp;as&nbsp;p:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; browser = p.chromium.launch(&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; headless=False, &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; args=["--start-maximized"]&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; )&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; page = browser.new_page()&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;try:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;for&nbsp;act&nbsp;in&nbsp;actions:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; execute_action_in_page(act, page)&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; message =&nbsp;"✅ 浏览器自动化任务执行成功！"&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;# 👇 在这里加延迟，比如等 5 秒再关闭&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; page.wait_for_timeout(5000) &nbsp;# 5000 毫秒 = 5 秒&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;finally:&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; browser.close() &nbsp;# 确保关闭&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;jsonify({"status":&nbsp;"success",&nbsp;"message": message})&nbsp; &nbsp;&nbsp;except&nbsp;Exception&nbsp;as&nbsp;e:&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;return&nbsp;jsonify({"error":&nbsp;f"执行失败:&nbsp;{str(e)}"}),&nbsp;500if&nbsp;__name__ ==&nbsp;'__main__':
&nbsp; &nbsp; app.run(host='0.0.0.0', port=8002, debug=False)

本地先运行这个代码

python&nbsp;remote_ai_agent.py

这里可以看到是否运行正常

(5)安装playwright-cli

可以去git上下载playwright-cli插件

在本地安装好这个插件

（6）开始使用

通过直接输入你想要做的事情

例如：访问百度网址，搜索AI最新资讯

注意这里要选择工具，再执行

执行完成后，会看到如下执行过程

目前只是以打开百度搜索为例，也可以根据自己需要研究其他的skills

欢迎关注「技术分享交流」公众号，如果有建议或者疑问的话，欢迎大家评论留言，如果喜欢公众号文章的话可以点【在看】，您的鼓励就是我的动力哈！！！

请在微信客户端打开

免责声明：

本文所载程序、技术方法仅面向合法合规的安全研究与教学场景，旨在提升网络安全防护能力，具有明确的技术研究属性。

任何单位或个人未经授权，将本文内容用于攻击、破坏等非法用途的，由此引发的全部法律责任、民事赔偿及连带责任，均由行为人独立承担，本站不承担任何连带责任。

本站内容均为技术交流与知识分享目的发布，若存在版权侵权或其他异议，请通过邮件联系处理，具体联系方式可点击页面上方的联系我。

本文转载自：技术分享交流 TP微客 TP微客《结合Open WebUI，搭建专属个人的AI助手》