This page contains some resources for those interested in using the Common Crawl dataset for security research. There are some useful queries for attack surface enumeration, plus a simple tool to create wordlists from a given domain.
See the accompanying blogpost - https://labs.watchtowr.com/all-around-the-world-the-common-crawl-dataset/