Every Allymet engagement runs 2,000+ automated probes across 17 test categories, mapped to the controls and requirements of 6 industry frameworks. This page shows exactly what is in scope, what is not, and why.
The most directly testable framework for LLM deployments. Six of ten risks have strong automated coverage; four have partial coverage requiring additional architectural review.
Tests the cognitive layer of agentic systems: goal stability, injection resistance, and memory integrity. Infrastructure-level risks (tool sandboxing, IAM, multi-agent orchestration) require architectural review.
Comprehensive coverage of LLM-specific adversarial techniques. ATLAS catalogs 40+ techniques; our testing covers the subset applicable to API-based LLM assessment.
Technical validation of main-body clause requirements. ISO 42001 is primarily an organizational governance standard; our testing provides the technical evidence layer that supports a broader AI Management System assessment.
Strongest in the MEASURE and MANAGE functions, where automated testing directly validates risk metrics. The GOVERN function is organizational and outside technical testing scope.
Mapped to GPAI and transparency obligations. For systems classified as high-risk under Annex III, additional article mappings (Art. 9-15) are available on request.
Each test category produces evidence that maps to one or more framework controls. A single scan covers all six frameworks simultaneously.
| Test Category | ISO 42001 | OWASP LLM | Agentic | NIST RMF | EU AI Act | ATLAS |
|---|---|---|---|---|---|---|
| Jailbreak Resistance | 6.1.2 | LLM01 | ASI01 | MANAGE | Art. 55 | T0054 |
| Prompt Injection | 6.1.2 | LLM01 | ASI01 | MANAGE | Art. 55 | T0051 |
| Multi-Turn Attacks | 6.1.2 | — | ASI01 | MAP | — | — |
| System Prompt | 6.1.3 | LLM07 | — | MAP | Art. 50 | T0056 |
| Supply Chain | 6.1.3 | LLM03 | ASI04 | MANAGE | — | — |
| Output Sanitization | 6.1.3 | LLM05 | — | MANAGE | — | T0048 |
| PII/PHI Protection | 8.2 | LLM02 | — | MANAGE | Art. 53 | T0057 |
| RAG Data Security | 7.2 | LLM04, 08 | ASI06 | MEASURE | — | — |
| Copyright Protection | 8.2 | — | — | — | Art. 53 | — |
| Factual Accuracy | 7.2 | LLM09 | — | MEASURE | Art. 50, 53 | — |
| Bias and Fairness | 7.3 | — | — | MEASURE | Art. 53 | — |
| Harmful Content | 8.1 | — | — | MANAGE | Art. 55 | T0048 |
| Human Override | 8.1 | LLM06 | — | GOVERN | Art. 50 | — |
| Cost/Resource | 8.1 | LLM10 | — | GOVERN | — | — |
An Allymet assessment is a point-in-time technical evaluation of an AI system's behavior via API-based probing. It provides evidence of how the system responds to adversarial inputs, mapped to the controls of each framework.
It is not a certification, a conformity assessment, or a substitute for organizational governance reviews. Framework mappings are technical indicators. Regulatory approval, policy documentation, and management system assessments require separate evaluation by qualified auditors.
Request a sample or book a 20-minute intro to walk through a specific client situation.
Book an intro →