[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-73612":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":17,"stars7d":17,"stars30d":18,"stars90d":16,"forks30d":16,"starsTrendScore":15,"compositeScore":19,"rankGlobal":10,"rankLanguage":10,"license":20,"archived":21,"fork":22,"defaultBranch":23,"hasWiki":22,"hasPages":22,"topics":24,"createdAt":10,"pushedAt":10,"updatedAt":38,"readmeContent":39,"aiSummary":40,"trendingCount":16,"starSnapshotCount":16,"syncStatus":41,"lastSyncTime":42,"discoverSource":43},73612,"Scraperr","jaypyles\u002FScraperr","jaypyles","Self-hosted webscraper.","https:\u002F\u002Fscraperr-docs.pages.dev\u002F",null,"TypeScript",4896,241,9,3,0,1,5,62.15,"MIT License",true,false,"master",[25,26,27,28,29,30,31,32,33,34,35,36,37],"docker","helm","kubernetes","opensource","playwright","python","scraping","self-hosted","web-scraper","web-scrapers","web-scraping","webscraper","webscraping","2026-06-12 04:01:10","\u003Cdiv align=\"center\">\n  \u003Cimg src=\"https:\u002F\u002Fgithub.com\u002Fjaypyles\u002Fwww-scrape\u002Fblob\u002Fmaster\u002Fdocs\u002Flogo_picture.png\" alt=\"Scraperr Logo\" width=\"250px\">\n  \n  **A powerful self-hosted web scraping solution**\n  \n  \u003Cdiv>\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FMongoDB-%234ea94b.svg?style=for-the-badge&logo=mongodb&logoColor=white\" alt=\"MongoDB\" \u002F>\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FFastAPI-005571?style=for-the-badge&logo=fastapi\" alt=\"FastAPI\" \u002F>\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FNext-black?style=for-the-badge&logo=next.js&logoColor=white\" alt=\"Next JS\" \u002F>\n    \u003Cimg src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002Ftailwindcss-%2338B2AC.svg?style=for-the-badge&logo=tailwind-css&logoColor=white\" alt=\"TailwindCSS\" \u002F>\n  \u003C\u002Fdiv>\n\u003C\u002Fdiv>\n\n## 📋 Overview\n\nScrape websites without writing a single line of code.\n\n> 📚 **[Check out the docs](https:\u002F\u002Fscraperr-docs.pages.dev)** for a comprehensive quickstart guide and detailed information.\n\n\u003Cdiv align=\"center\">\n  \u003Cimg src=\"https:\u002F\u002Fgithub.com\u002Fjaypyles\u002Fwww-scrape\u002Fblob\u002Fmaster\u002Fdocs\u002Fmain_page.png\" alt=\"Scraperr Main Interface\" width=\"800px\">\n\u003C\u002Fdiv>\n\n## ✨ Key Features\n\n- **XPath-Based Extraction**: Precisely target page elements\n- **Queue Management**: Submit and manage multiple scraping jobs\n- **Domain Spidering**: Option to scrape all pages within the same domain\n- **Custom Headers**: Add JSON headers to your scraping requests\n- **Media Downloads**: Automatically download images, videos, and other media\n- **Results Visualization**: View scraped data in a structured table format\n- **Data Export**: Export your results in markdown and csv formats\n- **Notifcation Channels**: Send completion notifcations, through various channels\n\n## 🚀 Getting Started\n\n### Docker\n\n```bash\nmake up\n```\n\n### Helm\n\n> Refer to the docs for helm deployment: https:\u002F\u002Fscraperr-docs.pages.dev\u002Fguides\u002Fhelm-deployment\n\n## ⚖️ Legal and Ethical Guidelines\n\nWhen using Scraperr, please remember to:\n\n1. **Respect `robots.txt`**: Always check a website's `robots.txt` file to verify which pages permit scraping\n2. **Terms of Service**: Adhere to each website's Terms of Service regarding data extraction\n3. **Rate Limiting**: Implement reasonable delays between requests to avoid overloading servers\n\n> **Disclaimer**: Scraperr is intended for use only on websites that explicitly permit scraping. The creator accepts no responsibility for misuse of this tool.\n\n## 💬 Join the Community\n\nGet support, report bugs, and chat with other users and contributors.\n\n👉 [Join the Scraperr Discord](https:\u002F\u002Fdiscord.gg\u002F89q7scsGEK)\n\n## 📄 License\n\nThis project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.\n\n## 👏 Contributions\n\nDevelopment made easier with the [webapp template](https:\u002F\u002Fgithub.com\u002Fjaypyles\u002Fwebapp-template).\n\nTo get started, simply run `make build up-dev`.","Scraperr 是一个强大的自托管网页抓取解决方案。它支持XPath精准定位页面元素、队列管理多个抓取任务、域名爬虫功能以抓取同一域名下的所有页面，以及添加自定义请求头等功能。此外，Scraperr还提供了媒体下载、结果可视化展示和多种格式的数据导出选项，并且可以通过不同渠道发送任务完成通知。此项目使用TypeScript编写，基于Docker、Helm等技术栈部署，适用于需要从网站上自动提取信息而不希望手动编写代码的场景。无论是个人用户还是企业级应用，只要遵守相关网站的服务条款及法律规范，都可以利用Scraperr来简化数据采集流程。",2,"2026-06-11 03:46:24","high_star"]