OWASP LLM Security Demo

A detection tool for OWASP Top 10 LLM vulnerabilities using pattern matching, perplexity-based analysis, and session-aware scoring.

What it does

You enter a prompt, the tool:

Analyzes it for OWASP LLM vulnerabilities using pattern matching (LLM01/02/04/05/06), perplexity analysis (LLM03), dual-path scoring (LLM09), and session-aware multi-detector scoring (LLM10)
Sends it to a local LLM (Qwen 0.5B) for generation
Returns the detected OWASP category and LLM response

The Plugin Demo (LLM07) section lets you run SQL queries directly against a SQLite database (Northwind) in unsafe or safe mode, demonstrating how naive keyword-based protection can be bypassed.

The Model Theft Demo (LLM10) section detects extraction attempts through session-aware analysis across three sub-detectors. Responses are blocked when the confidence score reaches ≥ 0.85.

Detection accuracy: 98.6% on test suite (70/71 tests passing) — see tests/TESTING.md

Response quality: Uses Qwen instruction format with system prompt for coherent, accurate answers.

Quick Start

Local (Docker Compose)

cd docker
docker-compose up -d
# Open browser
http://localhost:3000

Docker Hub

docker pull francescopaololezza/owasp-llm-demo:main
docker run -d -p 3000:3000 francescopaololezza/owasp-llm-demo:main
# Open browser
http://localhost:3000

Azure

See infra/azure/README.md for Terraform deployment.

Architecture

Browser (:3000) → Node.js → Flask API (:5000) → llama-server (:8081)
                                             → owasp-llm-tool (detection)
                                             → SQLite plugin (LLM07)
                                             → session store (LLM10)

See docs/architecture.md for details.

Detected Vulnerabilities

Category	Name	Detection Method	Status	Accuracy
LLM01	Prompt Injection	Pattern matching	✅ Done	92% (11/12)
LLM02	Insecure Output Handling	Pattern matching	✅ Done	100% (9/9)
LLM03	Training Data Poisoning	Perplexity analysis	✅ Done	100% (6/6)
LLM04	Model Denial of Service	Pattern matching	✅ Done	100% (1/1)
LLM05	Supply Chain Vulnerabilities	Pattern matching	✅ Done	100% (7/7)
LLM06	Excessive Agency	Pattern matching	✅ Done	100% (9/9)
LLM07	Insecure Plugin Design	SQLite plugin + HTTP tests	✅ Done	100% (6/6)
LLM08	Excessive Agency	-	❌ Duplicate	-
LLM09	Overreliance / Misinformation	Dual-path scoring (5 sub-detectors)	✅ Done	100% (9/9)
LLM10	Model Theft	Session-aware scoring (3 sub-detectors)	✅ Done	100% (18/18)

Overall: 9/10 categories implemented | 70/71 tests passing (98.6%)

Detection Methods:

Pattern matching (LLM01/02/04/05/06): keyword-based rules, <100ms
Perplexity analysis (LLM03): statistical anomaly detection on input prompts
- High perplexity (>50.0) = anomalous/poisoned text
- Example: "xzqw jumped mflkj" → perplexity 445.24 ✓ | "quick brown fox" → 3.57 ✗
- Threshold: configurable via config/llm03_baseline.json
SQLite plugin (LLM07): live query execution against Northwind DB with intentionally bypassable safe mode
Dual-path scoring (LLM09): combines citation fraud detection and statistical hallucination signals across 5 sub-detectors (fake_citation, authority_claim, precision_abuse, hedging_absence, source_unverifiability)
Session-aware scoring (LLM10): IP-based session store (TTL 10 min) with 3 sub-detectors — extraction_intent (45%), query_similarity (45%), rate_anomaly (10%). Responses blocked at score ≥ 0.85

See tests/TESTING.md for detailed test results.

Testing

./tests/test_owasp.sh           # All categories (pattern + perplexity) — 52/53
./tests/test_owasp.sh llm03     # Specific category (loads model)
./tests/test_owasp.sh llm09     # Overreliance / misinformation
./tests/test_llm10.sh           # Model theft — requires Flask running — 18/18
./tests/test_plugin.sh          # Plugin tests — requires Flask running

Note: LLM03 tests require model loading (~20s per test). Other categories use pattern matching only. test_llm10.sh and test_plugin.sh require Flask API running on port 5000.

Project Structure

├── api/          # Flask API (v0.13.0)
├── config/       # Runtime configuration (LLM03 threshold)
├── data/         # Northwind SQLite database (LLM07 demo)
├── docker/       # Docker configuration
├── docs/         # Documentation
├── frontend/     # Node.js + EJS UI (v0.12.0)
├── infra/azure/  # Terraform for Azure deployment
├── llama.cpp/    # Pre-built binaries + detection tools
└── tests/        # Test suite (71 test cases)

C++ Source Code

The owasp-llm-tool is maintained in a fork of llama.cpp:

https://github.com/FrancescoPaoloL/llama.cpp/tree/feature/owasp-llm-tool/examples/owasp-llm-tool

Features:

Pattern-based detection for LLM01/02/04/05/06
Perplexity calculation for LLM03
JSON config support (nlohmann/json)
Prompt normalization

Known Limitations

Pattern-based detection (LLM01/02/04/05/06): keyword matching, not ML-based
Perplexity detection (LLM03): requires model load, sensitive to tokenization
Plugin safe mode (LLM07): intentionally bypassable — demonstrates the vulnerability
LLM09 dual-path scoring: heuristic-based, no external source verification
LLM10 session store: in-memory only, resets on restart; IP-based attribution is trivially bypassable with proxies
98.6% accuracy with 1 known false positive (LLM01: "ignore spam emails")
English only
Basic heuristics — this is a learning/portfolio project, not production security software

References

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OWASP LLM Security Demo

What it does

Quick Start

Local (Docker Compose)

Docker Hub

Azure

Architecture

Detected Vulnerabilities

Testing

Project Structure

C++ Source Code

Known Limitations

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github/workflows		.github/workflows
api		api
config		config
data		data
docker		docker
docs		docs
frontend		frontend
infra/azure		infra/azure
llama.cpp		llama.cpp
models		models
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE.md		LICENSE.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

OWASP LLM Security Demo

What it does

Quick Start

Local (Docker Compose)

Docker Hub

Azure

Architecture

Detected Vulnerabilities

Testing

Project Structure

C++ Source Code

Known Limitations

References

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages