CEWL (Custom Word List Generator) is a powerful command-line tool used for discovering web content and generating custom wordlists from websites. It's particularly useful for security professionals and penetration testers to gather potential usernames, passwords, or other relevant strings from a target's web presence. This tool crawls a given URL and extracts words found within the HTML, JavaScript, and other linked files.
Below are common ways to utilize CEWL, demonstrating its flexibility for various web content discovery tasks:
To spider a site and write all found words to a file:
cewl -w <file> <url>
To spider a site and follow links to other external sites:
cewl -o <url>
To spider a site using a given user-agent string:
cewl -u <user-agent> <url>
To spider a site for a given depth and minimum word length:
cewl -d <depth> -m <min word length> <url>
To spider a site and include a count for each word found:
cewl -c <url>
To spider a site, including meta data, and separate the meta_data words into a specified file:
cewl -a -meta_file <file> <url>
To spider a site and store discovered email addresses in a separate file:
cewl -e -email_file <file> <url>