Skip to content

A Different Kind of Attack Surface

AI systems introduce vulnerabilities that traditional security testing was never designed to find. Here's what makes AI pen testing fundamentally different.

Prompt Injection (OWASP LLM01)

The most prevalent and underestimated vulnerability in AI systems. Malicious instructions embedded in user inputs, documents, or data sources can override your AI's behaviour — causing it to leak data, bypass controls, or take unauthorised actions. In healthcare, these inputs can come from patient records, clinical documents, or external APIs. We test every input pathway.

Adversarial Inputs & Model Manipulation (MITRE ATLAS AML.T0043)

Carefully crafted inputs can cause AI models to produce incorrect outputs — misclassifying medical images, generating dangerous recommendations, or failing to detect critical conditions. For SaMD and clinical AI, this is a patient safety risk. We test model robustness against adversarial examples relevant to your specific use case.

Data Exfiltration via AI (OWASP LLM06)

AI agents with access to patient records, clinical databases, or sensitive operational data can be manipulated into exfiltrating that data — through carefully crafted prompts that cause the agent to include sensitive information in outputs. We test whether your AI can be used as an exfiltration vector, and whether your output filtering catches it.

Supply Chain & Third-Party Risk (OWASP LLM03)

Your AI system depends on LLM providers, embedding models, and third-party tools. We assess the security posture of your AI supply chain — including whether your LLM provider, tracing tools (LangSmith, Portkey), and external connectors introduce risks. MITRE ATLAS documents supply chain compromise (AML.T0010) as a primary attack vector.

CONTENTS

OWASP LLM TOP 10
MITRE ATLAS
NCSC GUIDELINES
RED TEAMING

OWASP LLM Top 10 Testing

We test against all 10 OWASP LLM vulnerabilities with healthcare-specific scenarios. LLM01 (prompt injection), LLM02 (insecure output handling), LLM06 (sensitive information disclosure), and LLM08 (excessive agency) are the highest priority for clinical AI deployments. Every finding is mapped to its OWASP LLM reference for clear, auditable reporting.

MITRE ATLAS Adversarial ML

MITRE ATLAS is the adversarial threat landscape for AI systems — the AI equivalent of the MITRE ATT&CK framework. We use ATLAS technique IDs to structure our testing, ensuring comprehensive coverage of adversarial ML attack patterns including model evasion, data poisoning, model extraction, and supply chain attacks specific to your AI architecture.

NCSC AI Security Principles

The NCSC's guidelines for secure AI system development (co-signed by CISA, NSA, and 16 national cybersecurity agencies) provide a government-backed framework for AI security assessment. We test against all four NCSC principles: secure design, secure development, secure deployment, and secure operation and maintenance — with NHS-specific context throughout.

AI Red Teaming

Beyond structured testing, our AI red team adopts an attacker's mindset — attempting to find novel attack paths specific to your deployment. This includes creative prompt injection scenarios, chained attacks across multiple AI components, and healthcare-specific threat scenarios (malicious patient records, compromised clinical data sources). Findings not covered by existing frameworks are documented as novel vulnerabilities.

Why Choose Our Approach?

AI-SPECIFIC TESTING

We test LLM vulnerabilities that traditional pen testers don't cover — prompt injection, adversarial inputs, model manipulation, and AI supply chain attacks.

OWASP LLM TOP 10

Every finding mapped to OWASP LLM Top 10 and MITRE ATLAS technique IDs. Clear, consistent reporting that your security team and auditors can use.

HEALTHCARE CONTEXT

We test against healthcare-specific threat scenarios — malicious patient records, compromised clinical data sources, and NHS-specific attack patterns.

RETEST INCLUDED

Once you've remediated findings, we retest and provide written confirmation. The evidence your MHRA technical file or DTAC submission needs.

Frequently Asked Questions

Do I need AI pen testing if I already do annual pen tests? minus-icon

Yes. Traditional CREST penetration testing covers your network, applications, and infrastructure — it doesn't test LLM-specific vulnerabilities. Prompt injection, adversarial inputs, and AI supply chain attacks require specialist testing that most pen test firms are not equipped to perform. For healthcare AI, both are required.

Who regulates AI security testing in healthcare? plus-icon
What does a Periculo AI pen test report look like? plus-icon
Can Periculo test our AI before we go live? plus-icon

Latest Insights

What is NHS DTAC? Digital Technology Assessment Criteria — A Complete Guide

What is NHS DTAC? Digital Technology Assessme...

Digital health technology is transforming how care is delivered across the NHS. From AI-powered diagnostics to remote pa...

What is DCB0160? The NHS Clinical Safety Standard for Deploying Health IT Systems

What is DCB0160? The NHS Clinical Safety Stan...

Digital systems are now at the heart of how NHS care is delivered. Electronic patient records, clinical decision support...

What is DSPT? A Guide for Digital Health Companies

What is DSPT? A Guide for Digital Health Comp...

If you are building or scaling a digital health product in the UK, the NHS Data Security and Protection Toolkit — univer...

What the Five Eyes Agentic AI Guidance Actually Means for Your Organisation

What the Five Eyes Agentic AI Guidance Actual...

The cybersecurity agencies of the United States, United Kingdom, Australia, Canada, and New Zealand published their firs...

40% of AI Projects Predicted to Fail

40% of AI Projects Predicted to Fail

Over 40% of agentic AI projects will be cancelled by the end of 2027. If that number feels high, the reasons why are eve...

DPRK's AI-Driven npm Malware Surge: Fake Firms, RATs, and Supply Chain Threats Uncovered

DPRK's AI-Driven npm Malware Surge: Fake Firm...

The software supply chain remains the backbone of modern application development—and an increasingly lucrative target fo...

Weekly Round Up Issue 17

Weekly Round Up Issue 17

It has been a significant week for anyone supplying digital products or services to the NHS. The headlines are political...

Securing Agentic AI: Navigating Emerging Enterprise Security Risks of Autonomous AI Agents

Securing Agentic AI: Navigating Emerging Ente...

The Rise of Agentic AI in the Enterprise Enterprises are rapidly adopting agentic AI—autonomous systems capable of execu...