[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-73439":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":17,"stars7d":18,"stars30d":19,"stars90d":16,"forks30d":16,"starsTrendScore":20,"compositeScore":21,"rankGlobal":10,"rankLanguage":10,"license":22,"archived":23,"fork":23,"defaultBranch":24,"hasWiki":23,"hasPages":23,"topics":25,"createdAt":10,"pushedAt":10,"updatedAt":34,"readmeContent":35,"aiSummary":36,"trendingCount":16,"starSnapshotCount":16,"syncStatus":37,"lastSyncTime":38,"discoverSource":39},73439,"midscene","web-infra-dev\u002Fmidscene","web-infra-dev","AI-powered, vision-driven UI automation for every platform.","https:\u002F\u002Fmidscenejs.com",null,"TypeScript",13659,1038,67,41,0,48,150,683,144,119.05,"MIT License",false,"main",[26,27,28,29,30,31,32,33],"ai","ai-test","browser-use","computer-use","gpt-operator","javascript","phone-use","testing","2026-06-12 04:01:09","\u003Cp align=\"center\">\n  \u003Cimg alt=\"Midscene.js\"  width=\"260\" src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002Ff60de3c1-dd6f-4213-97a1-85bf7c6e79e4\">\n\u003C\u002Fp>\n\n\u003Ch1 align=\"center\">Midscene.js\u003C\u002Fh1>\n\u003Cdiv align=\"center\">\n\nEnglish | [简体中文](.\u002FREADME.zh.md)\n\n\u003Cstrong>Official Website\u003C\u002Fstrong>: \u003Ca href=\"https:\u002F\u002Fmidscenejs.com\u002F\">https:\u002F\u002Fmidscenejs.com\u002F\u003C\u002Fa>\n\n\u003Ca href=\"https:\u002F\u002Ftrendshift.io\u002Frepositories\u002F12524\" target=\"_blank\">\u003Cimg src=\"https:\u002F\u002Ftrendshift.io\u002Fapi\u002Fbadge\u002Frepositories\u002F12524\" alt=\"web-infra-dev%2Fmidscene | Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"\u002F>\u003C\u002Fa>\n\n\u003C\u002Fdiv>\n\n\u003Cp align=\"center\">\n  AI-powered, vision-driven UI automation for every platform.\n\u003C\u002Fp>\n\n\u003Cp align=\"center\">\n  \u003Ca href=\"https:\u002F\u002Fwww.npmjs.com\u002Fpackage\u002F@midscene\u002Fweb\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fnpm\u002Fv\u002F@midscene\u002Fweb?style=flat-square&color=00a8f0\" alt=\"npm version\" \u002F>\u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fhuggingface.co\u002FByteDance-Seed\u002FUI-TARS-1.5-7B\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FUI%20TARS%20Models-yellow\" alt=\"hugging face model\" \u002F>\u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fnpm-compare.com\u002F@midscene\u002Fweb\u002F#timeRange=THREE_YEARS\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fnpm\u002Fdm\u002F@midscene\u002Fweb.svg?style=flat-square&color=00a8f0\" alt=\"downloads\" \u002F>\u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene\u002Fblob\u002Fmain\u002FLICENSE\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FLicense-MIT-blue.svg?style=flat-square&color=00a8f0\" alt=\"License\" \u002F>\n  \u003Ca href=\"https:\u002F\u002Fdiscord.gg\u002F2JyBHxszE4\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fdiscord\u002F1328277792730779648?style=flat-square&color=7289DA&label=Discord&logo=discord&logoColor=white\" alt=\"discord\" \u002F>\u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fx.com\u002Fmidscene_ai\">\u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Ftwitter\u002Ffollow\u002Fmidscene_ai?style=flat-square\" alt=\"twitter\" \u002F>\u003C\u002Fa>\n  \u003Ca href=\"https:\u002F\u002Fdeepwiki.com\u002Fweb-infra-dev\u002Fmidscene\">\n    \u003Cimg alt=\"Ask DeepWiki.com\" src=\"https:\u002F\u002Fdevin.ai\u002Fassets\u002Fdeepwiki-badge.png\" style=\"height: 18px; vertical-align: middle;\" \u002F>\n  \u003C\u002Fa>\n\u003C\u002Fp>\n\n## 📣 Midscene Skills is here!\n\nUse [Midscene Skills](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene-skills) to control any platform with [OpenClaw](https:\u002F\u002Fgithub.com\u002FOpenClaw\u002FOpenClaw) \n\n## Showcases\n\n* [Web Automation - Automatically register the GitHub form in a web browser and pass all field validations](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#web)\n* [iOS Automation - Meituan coffee order](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#ios)\n* [iOS Automation - Auto-like the first @midscene_ai tweet](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#ios)\n* [Android Automation - DCar: Xiaomi SU7 specs](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#android)\n* [Android Automation - Booking a hotel for Christmas](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#android)\n* [MCP Integration - Midscene MCP UI prepatch release](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#mcp)\n* [robotic arm + vision + voice for in-vehicle testing](https:\u002F\u002Fmidscenejs.com\u002Fshowcases#community-showcases)\n\n## 💡 Features\n\n### Write Automation with Natural Language\n- Describe your goals and steps, and Midscene will plan and operate the user interface for you.\n- Use Javascript SDK or YAML to write your automation script.\n\n### Web & Mobile App & Any Interface\n- **Web Automation**: Either integrate with [Puppeteer](https:\u002F\u002Fmidscenejs.com\u002Fintegrate-with-puppeteer), [Playwright](https:\u002F\u002Fmidscenejs.com\u002Fintegrate-with-playwright) or use [Bridge Mode](https:\u002F\u002Fmidscenejs.com\u002Fbridge-mode) to control your desktop browser.\n- **Android Automation**: Use [Javascript SDK](https:\u002F\u002Fmidscenejs.com\u002Fandroid-getting-started) with adb to control your local Android device.\n- **iOS Automation**: Use [Javascript SDK](https:\u002F\u002Fmidscenejs.com\u002Fios-getting-started) with WebDriverAgent to control your local iOS devices and simulators.\n- **Any Interface Automation**: Use [Javascript SDK](https:\u002F\u002Fmidscenejs.com\u002Fintegrate-with-any-interface) to control your own interface.\n\n### For Developers\n- **Three kinds of APIs**:\n  - [Interaction API](https:\u002F\u002Fmidscenejs.com\u002Fapi#interaction-methods): interact with the user interface.\n  - [Data Extraction API](https:\u002F\u002Fmidscenejs.com\u002Fapi#data-extraction): extract data from the user interface and dom.\n  - [Utility API](https:\u002F\u002Fmidscenejs.com\u002Fapi#more-apis): utility functions like `aiAssert()`, `aiLocate()`, `aiWaitFor()`.\n- **MCP**: Midscene provides MCP services that expose atomic Midscene Agent actions as MCP tools so upper-layer agents can inspect and operate UIs with natural language. [Docs](https:\u002F\u002Fmidscenejs.com\u002Fmcp)\n- [**Caching for Efficiency**](https:\u002F\u002Fmidscenejs.com\u002Fcaching): Replay your script with cache and get the result faster.\n- **Debugging Experience**: Midscene.js offers a visualized replay back report file, a built-in playground, and a Chrome Extension to simplify the debugging process. These are the tools most developers truly need.\n\n\n## 👉 Zero-code Quick Experience\n\n- **[Chrome Extension](https:\u002F\u002Fmidscenejs.com\u002Fquick-experience)**: Start in-browser experience immediately through [the Chrome Extension](https:\u002F\u002Fmidscenejs.com\u002Fquick-experience), without writing any code.\n- **[Android Playground](https:\u002F\u002Fmidscenejs.com\u002Fandroid-getting-started)**: There is also a built-in Android playground to control your local Android device.\n- **[iOS Playground](https:\u002F\u002Fmidscenejs.com\u002Fios-getting-started)**: There is also a built-in iOS playground to control your local iOS device.\n\n## ✨ Driven by Visual Language Model\n\nMidscene.js is all-in on the pure-vision route for UI actions: element localization and interactions are based on screenshots only. It supports visual-language models like `Qwen3-VL`, `Doubao-1.6-vision`, `gemini-3-pro`, and `UI-TARS`. For data extraction and page understanding, you can still opt in to include DOM when needed.\n\n* Pure-vision localization for UI actions; the DOM extraction mode is removed.\n* Works across web, mobile, desktop, and even `\u003Ccanvas>` surfaces.\n* Far fewer tokens by skipping DOM for actions, which cuts cost and speeds up runs.\n* DOM can still be included for data extraction and page understanding when needed.\n* Strong open-source options for self-hosting.\n\nRead more about [Model Strategy](https:\u002F\u002Fmidscenejs.com\u002Fmodel-strategy)\n\n\n\n## 📄 Resources \n\n* Official Website: [https:\u002F\u002Fmidscenejs.com](https:\u002F\u002Fmidscenejs.com\u002F)\n* Documentation: [https:\u002F\u002Fmidscenejs.com](https:\u002F\u002Fmidscenejs.com\u002F)\n* Sample Projects: [https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene-example](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene-example)\n* API Reference: [https:\u002F\u002Fmidscenejs.com\u002Fapi](https:\u002F\u002Fmidscenejs.com\u002Fapi)\n* GitHub: [https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene)\n\n## 🤝 Community\n\n* [Discord](https:\u002F\u002Fdiscord.gg\u002F2JyBHxszE4)\n* [Follow us on X](https:\u002F\u002Fx.com\u002Fmidscene_ai)\n* [Lark Group(飞书交流群)](https:\u002F\u002Fapplink.larkoffice.com\u002Fclient\u002Fchat\u002Fchatter\u002Fadd_by_link?link_token=693v0991-a6bb-4b44-b2e1-365ca0d199ba)\n\n## 🌟 Awesome Midscene\n\nCommunity projects that extend Midscene.js capabilities:\n\n* [midscene-ios](https:\u002F\u002Fgithub.com\u002Flhuanyu\u002Fmidscene-ios) - iOS Mirror automation support for Midscene\n* [midscene-pc](https:\u002F\u002Fgithub.com\u002FMofangbao\u002Fmidscene-pc) - PC operation device for Windows, macOS, and Linux\n* [midscene-pc-docker](https:\u002F\u002Fgithub.com\u002FMofangbao\u002Fmidscene-pc-docker) - Docker image with Midscene-PC server pre-installed\n* [Midscene-Python](https:\u002F\u002Fgithub.com\u002FPython51888\u002FMidscene-Python) - Python SDK for Midscene automation\n* [midscene-java](https:\u002F\u002Fgithub.com\u002FMaster-Frank\u002Fmidscene-java) by @Master-Frank - Java SDK for Midscene automation\n* [midscene-java](https:\u002F\u002Fgithub.com\u002Falstafeev\u002Fmidscene-java) by @alstafeev - Java SDK for Midscene automation\n\n\n## 📝 Credits\n\nWe would like to thank the following projects:\n\n- [Rsbuild](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Frsbuild) and [Rslib](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Frslib) for the build tool.\n- [UI-TARS](https:\u002F\u002Fgithub.com\u002Fbytedance\u002Fui-tars) for the open-source agent model UI-TARS.\n- [Qwen-VL](https:\u002F\u002Fgithub.com\u002FQwenLM\u002FQwen-VL) for the open-source VL model Qwen-VL.\n- [scrcpy](https:\u002F\u002Fgithub.com\u002FGenymobile\u002Fscrcpy) and [yume-chan](https:\u002F\u002Fgithub.com\u002Fyume-chan) allow us to control Android devices with browser.\n- [appium-adb](https:\u002F\u002Fgithub.com\u002Fappium\u002Fappium-adb) for the javascript bridge of adb.\n- [appium-webdriveragent](https:\u002F\u002Fgithub.com\u002Fappium\u002FWebDriverAgent) for the javascript operate XCTest。\n- [YADB](https:\u002F\u002Fgithub.com\u002Fysbing\u002FYADB) for the yadb tool which improves the performance of text input.\n- [libnut-core](https:\u002F\u002Fgithub.com\u002Fnut-tree\u002Flibnut-core) for the cross-platform native keyboard and mouse control.\n- [Puppeteer](https:\u002F\u002Fgithub.com\u002Fpuppeteer\u002Fpuppeteer) for browser automation and control.\n- [Playwright](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002Fplaywright) for browser automation and control and testing.\n\n## 📖 Citation\n\nIf you use Midscene.js in your research or project, please cite:\n\n```bibtex\n@software{Midscene.js,\n  author = {Xiao Zhou, Tao Yu, YiBing Lin},\n  title = {Midscene.js: Your AI Operator for Web, Android, iOS, Automation & Testing.},\n  year = {2025},\n  publisher = {GitHub},\n  url = {https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene}\n}\n```\n\n## ✨ Star History\n\n[![Star History Chart](https:\u002F\u002Fapi.star-history.com\u002Fsvg?repos=web-infra-dev\u002Fmidscene&type=Date)](https:\u002F\u002Fwww.star-history.com\u002F#web-infra-dev\u002Fmidscene&Date)\n\n\n## 📝 License\n\nMidscene.js is [MIT licensed](https:\u002F\u002Fgithub.com\u002Fweb-infra-dev\u002Fmidscene\u002Fblob\u002Fmain\u002FLICENSE).\n\n---\n\n\u003Cdiv align=\"center\">\n  If this project helps you or inspires you, please give us a star\n\u003C\u002Fdiv>\n","Midscene.js 是一个基于AI视觉驱动的跨平台UI自动化工具。它利用先进的计算机视觉和自然语言处理技术，能够根据用户提供的描述自动规划并执行UI操作。项目采用TypeScript编写，支持网页、iOS、Android等多种平台，并且集成了OpenClaw以实现更广泛的控制能力。其核心特点包括使用自然语言编写测试脚本以及强大的图像识别功能。适用于需要进行复杂UI交互测试或自动化任务的各种场景，如Web应用注册流程验证、移动应用功能测试等。",2,"2026-06-11 03:45:32","high_star"]