A curated list of awesome open-source tools, resources, and tutorials for MLSecOps (Machine Learning Security Operations).
- Open Source Security Tools
- ML code security
- Attack Vectors
- Blogs and Publications
- Community Resources
- Contributions
- ModelScan - Protection Against ML Model Serialization Attacks.
- NB Defense - Secure Jupyter Notebooks.
- Garak - LLM vulnerability scanner.
- Adversarial Robustness Toolbox - A library of defense methods for machine learning models against adversarial attacks.
- MLSploit - MLsploit is a cloud framework for interactive experimentation with adversarial machine learning research.
- TensorFlow Privacy - A library of privacy-preserving machine learning algorithms and tools.
- Foolbox - A Python toolbox for creating and evaluating adversarial attacks and defenses.
- Advertorch - A Python toolbox for adversarial robustness research.
- Artificial Intelligence Threat Matrix - A framework for identifying and mitigating threats to machine learning systems.
- Adversarial ML Threat Matrix - Adversarial Threat Landscape for AI Systems.
- CleverHans - A library of adversarial examples and defenses for machine learning models.
- AdvBox - Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow.
- Audit AI - Bias Testing for Generalized Machine Learning Applications.
- Deep Pwning - Deep-pwning is a lightweight framework for experimenting with machine learning models with the goal of evaluating their robustness against a motivated adversary.
- Privacy Meter - An open-source library to audit data privacy in statistical and machine learning algorithms.
- TensorFlow Model Analysis - A library for analyzing, validating, and monitoring machine learning models in production.
- PromptInject - A framework that assembles adversarial prompts.
- TextAttack - TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.
- OpenAttack - An Open-Source Package for Textual Adversarial Attack.
- TextFooler - A Model for Natural Language Attack on Text Classification and Inference.
- Flawed Machine Learning Security - Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the machine learning model lifecycle from training, to packaging, to deployment.
- Adversarial Machine Learning CTF - This repository is a CTF challenge, showing a security flaw in most (all?) common artificial neural networks. They are vulnerable for adversarial images.
- Damn Vulnerable LLM Project - A Large Language Model designed for getting hacked
- Gandalf Lakera - Prompt Injection CTF playground
- Vigil - LLM prompt injection and security scanner
- PALLMs (Payloads for Attacking Large Language Models) - list of various payloads for attacking LLMs collected in one place
- AI-exploits - exploits for MlOps systems. It's not just in the inputs given to LLMs such as ChatGPT
- Offensive ML Playbook - Offensive ML Playbook. Notes on machine learning attacks and pentesting.
- AnonLLM - Anonymize Personally Identifiable Information (PII) for Large Language Model APIs.
- AI Goat - vulnerable LLM CTF challenges.
- lintML - security linter for ML, by Nvidia
- HiddenLayer: Model as Code - research about some vectors in ml libraries.
- Copycat CNN - proof-of-concept on how to generate a copy of a Convolutional Neural Network by querying it as a black-box with random data and using the output to train a copycat CNN which mimics the target CNN's predictive patterns.
- Red-Teaming Large Language Models
- Google's AI red-team
- The MLSecOps Top 10 vulnerabilities
- Token Smuggling Jailbreak via Adversarial Prompt
- Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks
- We need a new way to measure AI security
- PrivacyRaven: Implementing a proof of concept for model inversion
- Adversarial Prompts Engineering
- TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP
- Trail Of Bits' audit of Hugging Face's safetensors library
- OWASP Top 10 for Large Language Model Applications
- LLM Security
- Is you MLOps infrastructure leaking secrets?
- MLSecOps
- MLSecOps Podcast
- Awesome LLM Security
- Hackstery
- PWNAI
- HUNTR Discord community
- AIRSK
- AI Vulnerability Database
- Incident AI Database
- Defcon AI Villiage CTF
All contributions to this list are welcome!