[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-72307":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":17,"stars7d":18,"stars30d":19,"stars90d":16,"forks30d":16,"starsTrendScore":20,"compositeScore":21,"rankGlobal":10,"rankLanguage":10,"license":22,"archived":23,"fork":23,"defaultBranch":24,"hasWiki":25,"hasPages":25,"topics":26,"createdAt":10,"pushedAt":10,"updatedAt":33,"readmeContent":34,"aiSummary":35,"trendingCount":16,"starSnapshotCount":16,"syncStatus":36,"lastSyncTime":37,"discoverSource":38},72307,"py-xiaozhi","huangjunsen0406\u002Fpy-xiaozhi","huangjunsen0406","Open-source AI assistant ecosystem with MCP integrations, multimodal workflows, IoT support, and cross-platform voice interaction.","https:\u002F\u002Fhuangjunsen0406.github.io\u002Fpy-xiaozhi\u002F",null,"Python",3350,700,32,5,0,10,20,44,30,30.54,"MIT License",false,"main",true,[27,28,29,30,31,32],"esp32","esp32-s3","mcp","mcp-server","python","xiaozhi-esp32","2026-06-12 02:03:01","\u003Cp align=\"center\" class=\"trendshift\">\n  \u003Ca href=\"https:\u002F\u002Ftrendshift.io\u002Frepositories\u002F14130\" target=\"_blank\">\n    \u003Cimg src=\"https:\u002F\u002Ftrendshift.io\u002Fapi\u002Fbadge\u002Frepositories\u002F14130\" alt=\"Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"\u002F>\n  \u003C\u002Fa>\n\u003C\u002Fp>\n\u003Cp align=\"center\">\n  \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fhuangjunsen0406\u002Fpy-xiaozhi\u002Freleases\u002Flatest\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fv\u002Frelease\u002Fhuangjunsen0406\u002Fpy-xiaozhi?style=flat-square&logo=github&color=blue\" alt=\"Release\"\u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fopensource.org\u002Flicenses\u002FMIT\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FLicense-MIT-green.svg?style=flat-square\" alt=\"License: MIT\"\u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fhuangjunsen0406\u002Fpy-xiaozhi\u002Fstargazers\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuangjunsen0406\u002Fpy-xiaozhi?style=flat-square&logo=github\" alt=\"Stars\"\u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fhuangjunsen0406\u002Fpy-xiaozhi\u002Freleases\u002Flatest\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fdownloads\u002Fhuangjunsen0406\u002Fpy-xiaozhi\u002Ftotal?style=flat-square&logo=github&color=52c41a1&maxAge=86400\" alt=\"Download\"\u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fgitee.com\u002Fhuang-jun-sen\u002Fpy-xiaozhi\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FGitee-FF5722?style=flat-square&logo=gitee\" alt=\"Gitee\"\u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fhuangjunsen0406.github.io\u002Fpy-xiaozhi\u002Fguide\u002F00_%E6%96%87%E6%A1%A3%E7%9B%AE%E5%BD%95.html\">\n    \u003Cimg alt=\"使用文档\" src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F使用文档-点击查看-blue?labelColor=2d2d2d\" \u002F>\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fatomgit.com\u002Fhuangjunsen0406\u002Fpy-xiaozhi\">\n    \u003Cimg src=\".\u002Fassets\u002FAtomGit.svg\" alt=\"AtomGit\" height=\"20\"\u002F>\n  \u003C\u002Fa>\n\u003C\u002Fp>\n\n简体中文 | [English](README.en.md)\n\n## 项目简介\n\npy-xiaozhi 是一个使用 Python 实现的小智语音客户端，旨在通过代码学习和在没有硬件条件下体验 AI 小智的语音功能。\n本仓库是基于 [xiaozhi-esp32](https:\u002F\u002Fgithub.com\u002F78\u002Fxiaozhi-esp32) 移植。\n\n> **重要提示**\n>\n> - 请先阅读 [项目文档](https:\u002F\u002Fhuangjunsen0406.github.io\u002Fpy-xiaozhi\u002F)，启动教程和配置说明都在里面\n> - main 是最新代码，每次更新后请重新安装 pip 依赖\n> - **如果你已经基于本项目进行了二次开发，请不要直接合并最新代码**，新版本架构已大幅重构，强行合并会导致大量冲突。建议以旧版本为基础继续维护，或参考新架构重新适配\n> - [从零开始使用小智客户端（视频教程）](https:\u002F\u002Fwww.bilibili.com\u002Fvideo\u002FBV1dWQhYEEmq\u002F)\n\n## 演示\n\n- [Bilibili 演示视频](https:\u002F\u002Fwww.bilibili.com\u002Fvideo\u002FBV1HmPjeSED2\u002F)\n\n![系统界面](.\u002Fdocuments\u002Fdocs\u002Fguide\u002Fimages\u002F系统界面.png)\n\n## 功能特点\n\n- **AI 语音交互** — 语音输入与识别，自然流畅的对话体验\n- **视觉多模态** — 图像识别和处理，理解图像内容\n- **智能唤醒** — 多种唤醒词激活，免手动操作（可配置）\n- **自动对话模式** — 连续对话，提升交互流畅度\n- **MCP 工具生态** — 音乐播放、摄像头、截图、应用管理、天气查询、音量控制\n- **Opus 编解码** — 音频编解码和实时重采样\n- **唤醒词检测** — 基于 Sherpa-ONNX 离线识别，支持多唤醒词和拼音匹配\n- **多界面模式** — GUI（PySide6 + QML）\u002F CLI \u002F GPIO，适应不同环境\n- **系统托盘 & 全局快捷键** — 后台运行，快捷操作\n- **WebSocket \u002F MQTT** — 双协议通信，支持 WSS 加密传输\n- **设备激活** — v1\u002Fv2 双协议，自动验证码和设备指纹\n- **跨平台** — Windows 10+ \u002F macOS 10.15+ \u002F Linux\n\n## 相关项目\n\n- [xiaozhi-desktop](https:\u002F\u002Fxiaozhi.junsen.online) — Electron 桌面版，支持 AEC 回声消除、Live2D、悬浮窗等显示模式，提供 Windows \u002F macOS 安装包\n\n## 快速开始\n\n**环境要求**：Python >= 3.10，麦克风和扬声器，稳定网络连接\n\n```bash\n# 克隆项目\ngit clone https:\u002F\u002Fgithub.com\u002Fhuangjunsen0406\u002Fpy-xiaozhi.git\ncd py-xiaozhi\n\n# 基础安装（CLI \u002F GPIO 模式）\nuv sync                        # 推荐\n# 或: pip install -e .\n\n# GUI 模式（额外安装 PySide6 + qasync）\nuv sync --extra gui            # 推荐\n# 或: pip install -e '.[gui]'\n\n# 运行\npython main.py                 # GUI 模式（默认）\npython main.py --mode cli      # CLI 模式\npython main.py --protocol mqtt # MQTT 协议\n```\n\n## 项目结构\n\n```\npy-xiaozhi\u002F\n├── main.py                     # 应用程序主入口\n├── src\u002F\n│   ├── bootstrap\u002F              # 应用引导与依赖注入\n│   ├── core\u002F                   # 核心基础设施（事件总线、状态管理等）\n│   ├── plugins\u002F                # 插件系统（音频、UI、MCP、唤醒词、快捷键）\n│   ├── protocols\u002F              # 通信协议（WebSocket \u002F MQTT）\n│   ├── audio_codecs\u002F           # 音频编解码\n│   ├── audio_processing\u002F       # 唤醒词检测\n│   ├── activation\u002F             # 设备激活\n│   ├── mcp\u002F                    # MCP 工具系统\n│   │   └── tools\u002F              # 工具模块（music\u002Fcamera\u002Fscreenshot\u002Fapp\u002Fweather\u002Fvolume）\n│   ├── ui\u002F                     # 用户界面\n│   │   ├── gui\u002F                # PySide6 + QML 图形界面\n│   │   ├── cli\u002F                # 命令行界面\n│   │   └── gpio\u002F               # GPIO 嵌入式界面\n│   └── utils\u002F                  # 工具函数\n├── libs\u002F                       # 第三方原生库（libopus \u002F webrtc_apm）\n├── models\u002F                     # 语音唤醒模型\n├── documents\u002F                  # VitePress 文档站\n└── pyproject.toml              # 项目配置\n```\n\n## 状态流转\n\n```\n                    +----------------+\n                    |                |\n                    v                |\n+------+  唤醒\u002F按钮  +------------+  |   +------------+\n| IDLE | ---------> | CONNECTING | -+-> | LISTENING  |\n+------+            +------------+      +------------+\n   ^                                          |\n   |                                          | 语音识别完成\n   |        +------------+                    v\n   +------- |  SPEAKING  | \u003C-----------------+\n    完成播放 +------------+\n```\n\n## 贡献指南\n\n欢迎提交 Issue 和 PR，请确保：\n\n1. 代码风格符合 PEP8 规范\n2. PR 包含适当的测试\n3. 更新相关文档\n\n## 感谢\n\n> 排名不分前后\n\n[Xiaoxia](https:\u002F\u002Fgithub.com\u002F78)\n[zhh827](https:\u002F\u002Fgithub.com\u002Fzhh827)\n[四博智联-李洪刚](https:\u002F\u002Fgithub.com\u002FSmartArduino)\n[HonestQiao](https:\u002F\u002Fgithub.com\u002FHonestQiao)\n[vonweller](https:\u002F\u002Fgithub.com\u002Fvonweller)\n[孙卫公](https:\u002F\u002Fspace.bilibili.com\u002F416954647)\n[isamu2025](https:\u002F\u002Fgithub.com\u002Fisamu2025)\n[Rain120](https:\u002F\u002Fgithub.com\u002FRain120)\n[kejily](https:\u002F\u002Fgithub.com\u002Fkejily)\n[电波bilibili君](https:\u002F\u002Fspace.bilibili.com\u002F119751)\n[赛搏智能](https:\u002F\u002Fshop115087494.m.taobao.com\u002F?refer=https%3A%2F%2Fm.tb.cn%2F)\n\n## 赞助支持\n\n\u003Cdiv align=\"center\">\n  \u003Cp>感谢所有赞助者的支持，无论是接口资源、设备兼容测试还是资金支持，每一份帮助都让项目更加完善\u003C\u002Fp>\n  \u003Ca href=\"https:\u002F\u002Fhuangjunsen0406.github.io\u002Fpy-xiaozhi\u002Fsponsors\u002F\" target=\"_blank\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F查看-赞助者名单-brightgreen?style=for-the-badge&logo=github\" alt=\"赞助者名单\">\n  \u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fhuangjunsen0406.github.io\u002Fpy-xiaozhi\u002Fsponsors\u002F\" target=\"_blank\">\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F成为-项目赞助者-orange?style=for-the-badge&logo=heart\" alt=\"成为赞助者\">\n  \u003C\u002Fa>\n\u003C\u002Fdiv>\n\n## 项目统计\n\n[![Star History Chart](https:\u002F\u002Fapi.star-history.com\u002Fsvg?repos=huangjunsen0406\u002Fpy-xiaozhi&type=Date)](https:\u002F\u002Fwww.star-history.com\u002F#huangjunsen0406\u002Fpy-xiaozhi&Date)\n\n## 许可证\n\n[MIT License](LICENSE)\n","py-xiaozhi 是一个基于 Python 的小智 AI 语音客户端，使用户无需特殊硬件即可体验完整的 AI 语音功能。该项目提供了包括 AI 语音交互、视觉多模态处理、智能唤醒、自动对话模式在内的多种核心功能，并支持 MCP 工具生态，如音乐播放、摄像头管理等。它采用 Opus 编解码技术进行音频处理，并通过 WebSocket 和 MQTT 协议实现安全通信。此外，项目还支持多界面模式（GUI\u002FCLI\u002FGPIO），并具备系统托盘和全局快捷键功能，适用于 Windows、macOS 和 Linux 等多个平台。无论是个人学习还是在没有专用设备的情况下体验 AI 语音助手，py-xiaozhi 都是一个理想的选择。",2,"2026-06-11 03:41:18","high_star"]