

Large-scale web crawl dataset containing 3.5 billion web pages from CommonCrawl (2012), suitable for web mining, search, and network analysis research. Listed as part of an awesome-style collection of computer networks datasets.
Loading more......
Large-scale web crawl dataset containing approximately 3.5 billion web pages collected by CommonCrawl in 2012. Intended for research and experimentation in areas such as web mining, search, and network analysis. Listed within an “awesome-style” collection of computer networks datasets.