Useful Resources

Curated datasets, frameworks, tools, papers, and standards for AI, ML, and cybersecurity research.

Machine Learning & Open Datasets

MNIST Dataset

http://yann.lecun.com/exdb/mnist/

Benchmark dataset for handwriting recognition.

CIFAR-10 / CIFAR-100

https://www.cs.toronto.edu/~kriz/cifar.html

Small-sized natural images widely used in deep learning.

HuggingFace Open Datasets

https://huggingface.co/datasets

Thousands of curated datasets across all ML domains.

ICS / SCADA Datasets

https://www.netresec.com/?page=PCAP4SICS

Placeholder for SWaT, HAI, BATADAL and other industrial datasets.

ML Engineering & Reproducible Research

Jupyter Notebooks

https://jupyter.org/

Interactive computing environment for research.

Weights & Biases

https://wandb.ai/

Experiment tracking & model monitoring.

Keras

https://keras.io/

High-level neural network API.

PyTorch

https://pytorch.org/

Flexible deep learning framework.

TensorFlow

https://www.tensorflow.org/

Machine learning framework by Google.

HuggingFace Transformers

https://huggingface.co/docs/transformers

State-of-the-art transformer models.

Differential Privacy & Cryptography

Opacus

https://opacus.ai/

PyTorch differential privacy library.

TensorFlow Privacy

https://github.com/tensorflow/privacy

Differential privacy for TensorFlow.

Python Cryptography

https://cryptography.io/

Modern cryptographic primitives for Python.

Security Research & Analysis Tools

Ghidra

https://ghidra-sre.org/

Reverse engineering framework.

Zeek

https://zeek.org/

Network monitoring & threat analysis.

Suricata

https://suricata.io/

IDS/IPS engine.

Nmap

https://nmap.org/

Network scanning and enumeration.

Wireshark

https://www.wireshark.org/

Packet capture and protocol analysis.

Burp Suite Community

https://portswigger.net/burp/communitydownload

Web pentesting toolkit.

YARA

https://virustotal.github.io/yara/

Malware classification rules engine.

VirusTotal

https://www.virustotal.com/

Malware scanning, sandboxing, intel.

Standards & Frameworks

NIST SP 800 Series

https://csrc.nist.gov/publications/sp800

Risk management, crypto, system guidelines.

MITRE ATT&CK

https://attack.mitre.org/

Adversary TTP knowledge base.

OWASP Top 10

https://owasp.org/www-project-top-ten/

Most critical web security risks.

OWASP Cheat Sheets

https://cheatsheetseries.owasp.org/

Secure coding guidances.

Foundational Papers

Generative Adversarial Nets

https://arxiv.org/abs/1701.00160

Original GAN paper by Goodfellow.

Deep Learning Book

https://www.deeplearningbook.org/

Foundational DL textbook.

Legal & Privacy

GDPR Full Text

https://gdpr-info.eu/

Official GDPR regulation.

Bonus

OverTheWire Wargames

https://overthewire.org/wargames

Cybersecurity challenge platform.

Nmap Movies

https://nmap.org/movies/

Movies that feature Nmap.