[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-7935":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":9,"language":10,"languages":9,"totalLinesOfCode":9,"stars":11,"forks":12,"watchers":13,"openIssues":14,"contributorsCount":15,"subscribersCount":15,"size":15,"stars1d":15,"stars7d":16,"stars30d":17,"stars90d":15,"forks30d":15,"starsTrendScore":18,"compositeScore":19,"rankGlobal":9,"rankLanguage":9,"license":9,"archived":20,"fork":20,"defaultBranch":21,"hasWiki":22,"hasPages":20,"topics":23,"createdAt":9,"pushedAt":9,"updatedAt":24,"readmeContent":25,"aiSummary":26,"trendingCount":15,"starSnapshotCount":15,"syncStatus":18,"lastSyncTime":27,"discoverSource":28},7935,"CeWL","digininja\u002FCeWL","digininja","CeWL is a Custom Word List Generator",null,"Ruby",2718,316,39,4,0,9,46,2,29.5,false,"master",true,[],"2026-06-12 02:01:46","# CeWL - Custom Word List generator\n\nCopyright(c) 2026, Robin Wood \u003Crobin@digi.ninja>\n\nBased on a discussion on PaulDotCom (episode 129) about creating custom word lists spidering a targets website and collecting unique words I decided to write CeWL, the Custom Word List generator. CeWL is a ruby app which spiders a given URL to a specified depth, optionally following external links, and returns a list of words which can then be used for password crackers such as John the Ripper.\n\nBy default, CeWL sticks to just the site you have specified and will go to a depth of 2 links, this behaviour can be changed by passing arguments. Be careful if setting a large depth and allowing it to go offsite, you could end up drifting on to a lot of other domains. All words of three characters and over are output to stdout. This length can be increased and the words can be written to a file rather than screen so the app can be automated.\n\nCeWL also has an associated command line app, FAB (Files Already Bagged) which uses the same meta data extraction techniques to create author\u002Fcreator lists from already downloaded.\n\nFor anyone running CeWL with Ruby 2.7, you might get some warnings in the style:\n\n```\n...\u002Fruby-2.7.0\u002Fgems\u002Fmime-types-3.2.2\u002Flib\u002Fmime\u002Ftypes\u002Flogger.rb:30: warning: `_1' is reserved for numbered parameter; consider another name\n```\nThis is due to a new feature introduced in 2.7 which conflices with one line of code in the logger script from the mime-types gem. There is an update for it in the [gem's repo](https:\u002F\u002Fgithub.com\u002Fmime-types\u002Fruby-mime-types\u002Fcommit\u002Fc44673179d24e495e5fb93282a87d37f09925d25#diff-f0a644249326afd54e7a0b90c807f8a6) so hopefully that will be released soon. Till then, as far as I can tell, the warning does not affect CeWL in any way. If, for asthetics, you want to hide the warning, you can run the script as follows:\n\n```\nruby -W0 .\u002Fcewl.rb\n```\n\nHomepage: \u003Chttps:\u002F\u002Fdigi.ninja\u002Fprojects\u002Fcewl.php>\n\nGitHub: \u003Chttps:\u002F\u002Fgithub.com\u002Fdigininja\u002FCeWL>\n\n## Pronunciation\n\nSeeing as I was asked, CeWL is pronounced \"cool\".\n\n## Installation\n\nCeWL needs the following gems to be installed:\n\n* mime\n* mime-types\n* mini_exiftool\n* nokogiri\n* public_suffix\n* rubyzip\n* spider\n* getoptlong\n\nThe easiest way to install these gems is with Bundler:\n\n```\ngem install bundler\nbundle install\n```\n\nAlternatively, you can install them manually with:\n\n```\ngem install xxx\n```\n\nThe gem `mini_exiftool` gem also requires the exiftool application to be installed.\n\n## Installing CeWL\n\n1. **Clone the repository:**\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fdigininja\u002FCeWL.git\ncd CeWL\n```\n2. **Install dependencies:**\n```bash\nbundle install\n```\n3. **Make the script executable:**\n```bash\nchmod u+x .\u002Fcewl.rb\n```\n4. **Make CeWL available system-wide (optional):**\n\nCreate a symlink\n```bash\nsudo ln -s $(pwd)\u002Fcewl.rb \u002Fusr\u002Flocal\u002Fbin\u002Fcewl\n```\n5. **Verify Installation**\nRun the following checks to ensure CeWL is working correctly:\n\n```bash\n# Check if the script runs (should show version and usage)\n.\u002Fcewl.rb --version\n# If you created a symlink or added to PATH, test the cewl command\ncewl --version\n```\n\nThe project page on my site gives some tips on solving common problems people\nhave encountered while running CeWL - https:\u002F\u002Fdigi.ninja\u002Fprojects\u002Fcewl.php\n\n## Usage\n\n```\n.\u002Fcewl.rb --help\n# OR\ncewl --help\n\nCeWL 6.2.1 (More Fixes) Robin Wood (robin@digi.ninja) (https:\u002F\u002Fdigi.ninja\u002F)\nUsage: cewl [OPTIONS] ... \u003Curl>\n\n    OPTIONS:\n        -h, --help: Show help.\n        -k, --keep: Keep the downloaded file.\n        -d \u003Cx>,--depth \u003Cx>: Depth to spider to, default 2.\n        -m, --min_word_length: Minimum word length, default 3.\n        -x, --max_word_length: Maximum word length, default unset.\n        -o, --offsite: Let the spider visit other sites.\n        --exclude: A file containing a list of paths to exclude\n        --allowed: A regex pattern that path must match to be followed\n        -w, --write: Write the output to the file.\n        -u, --ua \u003Cagent>: User agent to send.\n        -n, --no-words: Don't output the wordlist.\n        -g \u003Cx>, --groups \u003Cx>: Return groups of words as well\n        --lowercase: Lowercase all parsed words\n        --with-numbers: Accept words with numbers in as well as just letters\n        --convert-umlauts: Convert common ISO-8859-1 (Latin-1) umlauts (ä-ae, ö-oe, ü-ue, ß-ss)\n        -a, --meta: include meta data.\n        --meta_file file: Output file for meta data.\n        -e, --email: Include email addresses.\n        --email_file \u003Cfile>: Output file for email addresses.\n        --meta-temp-dir \u003Cdir>: The temporary directory used by exiftool when parsing files, default \u002Ftmp.\n        -c, --count: Show the count for each word found.\n        -v, --verbose: Verbose.\n        --debug: Extra debug information.\n\n        Authentication\n        --auth_type: Digest or basic.\n        --auth_user: Authentication username.\n        --auth_pass: Authentication password.\n\n        Proxy Support\n        --proxy_host: Proxy host.\n        --proxy_port: Proxy port, default 8080.\n        --proxy_username: Username for proxy, if required.\n        --proxy_password: Password for proxy, if required.\n\n        Headers\n        --header, -H: In format name:value - can pass multiple.\n\n        URL Structure Capture\n        --capture-paths: Add URL path components to the wordlist.\n        --capture-subdomains: Add subdomain components to the wordlist.\n        --capture-domain: Add the main domain to the wordlist.\n        --capture-url-structure: Capture domain, paths, and subdomains (all URL structure).\n\n    \u003Curl>: The site to spider.\n```\n\n### Running CeWL in a Docker container\n\nTo quickly use CeWL with Docker, you can use the official `ghcr.io\u002Fdigininja\u002Fcewl` image:\n\n```sh\ndocker run -it --rm -v \"${PWD}:\u002Fhost\" ghcr.io\u002Fdigininja\u002Fcewl [OPTIONS] ... \u003Curl>\n```\n\nYou can also build it locally:\n```sh\ndocker build -t cewl .\ndocker run -it --rm -v \"${PWD}:\u002Fhost\" cewl [OPTIONS] ... \u003Curl>\n```\n\nIf you want to see this in action, check out my video [Running CeWL in Docker on Kali](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=8UeKigZdHdE).\n\n## Licence\n\nThis project released under the Creative Commons Attribution-Share Alike 2.0 UK: England & Wales\n\n\u003Chttp:\u002F\u002Fcreativecommons.org\u002Flicenses\u002Fby-sa\u002F2.0\u002Fuk\u002F>\n\nAlternatively, you can use GPL-3+ instead the of the original license.\n\n\u003Chttp:\u002F\u002Fopensource.org\u002Flicenses\u002FGPL-3.0>\n","CeWL 是一个自定义词列表生成器，主要用于从目标网站爬取并收集独特的单词以生成密码破解工具如 John the Ripper 所需的字典。其核心功能包括根据指定深度爬取网页内容，并可选择是否跟随外部链接，输出长度大于等于三个字符的所有单词，默认情况下仅限于指定站点且深度为两层链接，这些参数均可通过命令行选项调整。此外，CeWL 还提供了一个名为 FAB 的命令行应用程序，用于从已下载文件中提取元数据创建作者\u002F创建者列表。适用于需要针对特定网站或组织进行针对性密码猜测攻击的安全测试场景。","2026-06-11 03:15:08","top_language"]