• Home
  • Categories
  • Pricing
  • Submit
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Ever. All rights reserved.·Terms of Service·Privacy Policy·Cookies
    Decorative pattern
    Decorative pattern
    1. Home
    2. Datasets
    3. Awesome Cybersecurity Datasets

    Awesome Cybersecurity Datasets

    A curated list of amazingly awesome cybersecurity datasets maintained by Santiago H. Ramos. Features network intrusion data, malware samples, botnet traffic, and web attack payloads used by universities and researchers worldwide. Approximately 1.9k stars and 326 forks.

    Surveys

    Loading more......

    Information

    Websitegithub.com
    PublishedMar 14, 2026

    Categories

    1 Item
    Datasets

    Tags

    3 Items
    #cybersecurity#datasets#research

    Similar Products

    6 result(s)

    Awesome Data - Biology Datasets (Meta)

    A curated Awesome-style collection of biological and genomics datasets, including ENCODE, EMPIAR, Ensembl Genomes, GEO, Gene Ontology, GloBI, LINCS, HGDP, HMP, ICOS PSP Benchmark, HapMap, JCB DataViewer (via BioStudies), and KEGG. Each entry links out to the primary dataset resource along with a corresponding YAML metadata file in the awesomedata/apd-core GitHub repository, making this part of a larger meta collection of Awesome data directories.

    Featured

    Awesome Public Datasets - Economics Collection

    A curated subset of the Awesome Public Datasets meta-collection, focusing on economics-related data sources such as macroeconomic indicators, trade statistics, productivity, corporate registries, and long-run historical series. This portion of the awesome list aggregates high‑quality, openly accessible economics datasets useful for research, data science, and policy analysis.

    Featured

    Awesome Public Datasets - Energy

    A curated Awesome-style subdirectory under the Awesome Public Datasets project focusing on Energy-related datasets (e.g., AMPds, BLUEd, COMBED, DBFC, ECO, Global Power Plant Database). It aggregates and links to high-quality, structured energy datasets useful for research and data science.

    Featured

    Awesome Cybersecurity List

    A personal collection of awesome blog posts, write-ups, and papers focusing on cybersecurity, with dedicated sections for tools, techniques, and educational resources.

    Awesome Data – Social Sciences

    A curated subset of the Awesome Data project focused on social sciences datasets, including political conflict, legal information, surveys, religion, and violence data. The listed resources (e.g., ACLED, Correlates of War, GDELT, General Social Survey, etc.) are part of a broader awesome-style meta collection of high-quality open datasets for researchers and practitioners.

    Awesome Public Datasets (APD) - eSports: OpenDota Data Dump

    An eSports section item in the Awesome Public Datasets (APD) repository that describes and links to the OpenDota data dump, making this large Dota 2 dataset discoverable via an awesome-style meta directory of public datasets.

    Overview

    Awesome Cybersecurity Datasets is a comprehensive GitHub repository maintained by Santiago H. Ramos, providing a curated collection of high-quality datasets for cybersecurity research and education.

    Repository Statistics

    • Stars: ~1,900
    • Forks: 326
    • Status: Actively maintained
    • Community: Used by universities, private industry, and independent researchers worldwide

    Featured Datasets

    Network Security Datasets

    Unified Host and Network Dataset

    • Source: Los Alamos National Laboratory
    • Coverage: ~90 days of enterprise network data
    • Contains: Network events and computer events
    • Use case: Insider threat detection, lateral movement analysis

    Canadian Institute for Cybersecurity Datasets

    • Widely used globally by researchers
    • Multiple datasets covering different attack scenarios
    • Well-documented and maintained

    KDD Cup 1999 Data

    • Classic benchmark dataset
    • Simulated military network environment
    • Wide variety of intrusion types
    • Standard for comparing IDS performance

    CTU-13 Dataset

    • Labeled botnet traffic
    • Includes normal and background traffic
    • Real-world botnet captures
    • Multiple botnet families represented

    Specialized Datasets

    • JavaScript Vulnerability Datasets: For web application security research
    • Web Attack Payloads: Common attack patterns and exploits
    • Machine Learning Firewall Data: Training data for ML-driven WAFs
    • Malware Samples: Various malware families and variants
    • Phishing Datasets: Email and URL phishing examples

    Use Cases

    • Academic Research: Training ML models, publishing research papers
    • Security Tool Development: Building and testing IDS/IPS systems
    • Education: Teaching cybersecurity concepts and techniques
    • Benchmarking: Comparing detection algorithm performance
    • Threat Intelligence: Understanding attack patterns and TTPs

    Dataset Categories

    • Network intrusion detection
    • Malware analysis
    • Web application security
    • Botnet detection
    • Insider threat detection
    • DDoS attack analysis
    • Phishing and social engineering

    Community Contributions

    The repository accepts pull requests for new datasets and welcomes community contributions to expand the collection.

    Target Audience

    Cybersecurity researchers, data scientists, university professors and students, security tool developers, and SOC analysts.

    Pricing

    Free and open-source collection. Individual datasets may have their own licensing terms.