CWE-776: Improper Restriction of Recursive Entity References in DTDs ('XML Entity Expansion')

Definición

What is CWE-776?

This vulnerability occurs when an XML parser allows Document Type Definitions (DTDs) to contain recursively defined entities without proper limits, enabling malicious data structures.

Attackers can craft a malicious DTD that defines XML entities in a recursive loop—where one entity references another, which then references back to the first, creating a chain. When the parser expands these entities, what looks like a small XML file in memory explodes into a massive data structure, consuming excessive CPU and memory. This results in a classic Denial-of-Service (DoS) attack, often called 'XML Entity Expansion' or 'Billion Laughs.' To prevent it, developers should disable DTD processing entirely in their XML parsers when possible, or explicitly configure them to restrict entity expansion depth and total memory usage during parsing.

Impacto en el mundo real

Real-world CVEs caused by CWE-776

CVE-2008-3281

XEE in XML-parsing library.
CVE-2011-3288

XML bomb / XEE in enterprise communication product.
CVE-2011-1755

"Billion laughs" attack in XMPP server daemon.
CVE-2009-1955

XML bomb in web server module
CVE-2003-1564

Parsing library allows XML bomb

Cómo lo explotan los atacantes

Ruta del atacante paso a paso

1
Identifica una ruta de código que maneje entrada no confiable sin validación.
2
Crea un payload que ejercite el comportamiento inseguro — inyección, traversal, overflow o abuso de lógica.
3
Envía el payload a través de una solicitud normal y observa la reacción de la aplicación.
4
Itera hasta que la respuesta filtre datos, ejecute código del atacante o escale privilegios.

Ejemplo de código vulnerable

Vulnerable XML

The DTD and the very brief XML below illustrate what is meant by an XML bomb. The ZERO entity contains one character, the letter A. The choice of entity name ZERO is being used to indicate length equivalent to that exponent on two, that is, the length of ZERO is 2^0. Similarly, ONE refers to ZERO twice, therefore the XML parser will expand ONE to a length of 2, or 2^1. Ultimately, we reach entity THIRTYTWO, which will expand to 2^32 characters in length, or 4 GB, probably consuming far more data than expected.

Vulnerable XML

<?xml version="1.0"?>
  <!DOCTYPE MaliciousDTD [
  <!ENTITY ZERO "A">
  <!ENTITY ONE "&ZERO;&ZERO;">
  <!ENTITY TWO "&ONE;&ONE;">
  ...
  <!ENTITY THIRTYTWO "&THIRTYONE;&THIRTYONE;">
  ]>
  <data>&THIRTYTWO;</data>

Payload del atacante

The DTD and the very brief XML below illustrate what is meant by an XML bomb. The ZERO entity contains one character, the letter A. The choice of entity name ZERO is being used to indicate length equivalent to that exponent on two, that is, the length of ZERO is 2^0. Similarly, ONE refers to ZERO twice, therefore the XML parser will expand ONE to a length of 2, or 2^1. Ultimately, we reach entity THIRTYTWO, which will expand to 2^32 characters in length, or 4 GB, probably consuming far more data than expected.

Payload del atacante XML

<?xml version="1.0"?>
  <!DOCTYPE MaliciousDTD [
  <!ENTITY ZERO "A">
  <!ENTITY ONE "&ZERO;&ZERO;">
  <!ENTITY TWO "&ONE;&ONE;">
  ...
  <!ENTITY THIRTYTWO "&THIRTYONE;&THIRTYONE;">
  ]>
  <data>&THIRTYTWO;</data>

Ejemplo de código seguro

Secure pseudo

Seguro pseudo

// Validate, sanitize, or use a safe API before reaching the sink.
function handleRequest(input) {
  const safe = validateAndEscape(input);
  return executeWithGuards(safe);
}

What changed: the unsafe sink is replaced (or the input is validated/escaped) so the same payload no longer triggers the weakness.

Lista de prevención

How to prevent CWE-776

Operation If possible, prohibit the use of DTDs or use an XML parser that limits the expansion of recursive DTD entities.
Implementation Before parsing XML files with associated DTDs, scan for recursive entity declarations and do not continue parsing potentially explosive content.

Señales de detección

How to detect CWE-776

Automated Static Analysis High

Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then searching for potentially-vulnerable patterns that connect "sources" (origins of input) with "sinks" (destinations where the data interacts with external components, a lower layer such as the OS, etc.)

Preguntas frecuentes

Frequently asked questions

¿Qué es CWE-776?

This vulnerability occurs when an XML parser allows Document Type Definitions (DTDs) to contain recursively defined entities without proper limits, enabling malicious data structures.

¿Qué gravedad tiene CWE-776?

MITRE califica la probabilidad de explotación como Media — la explotación es realista pero suele requerir condiciones específicas.

¿Qué lenguajes o plataformas se ven afectados por CWE-776?

MITRE lists the following affected platforms: XML.

¿Cómo puedo prevenir CWE-776?

If possible, prohibit the use of DTDs or use an XML parser that limits the expansion of recursive DTD entities. Before parsing XML files with associated DTDs, scan for recursive entity declarations and do not continue parsing potentially explosive content.

¿Cómo detecta y corrige Plexicus CWE-776?

El motor SAST de Plexicus detecta la firma de flujo de datos para CWE-776 en cada commit. Cuando hay coincidencia, nuestro agente Codex Remedium abre un PR de corrección con el código corregido, las pruebas y un resumen de una línea para el revisor.

¿Dónde puedo aprender más sobre CWE-776?

MITRE publica la definición canónica en https://cwe.mitre.org/data/definitions/776.html. También puedes consultar la documentación de OWASP y NIST para guías relacionadas.

Debilidades relacionadas

Weaknesses related to CWE-776

CWE-674 Padre

Uncontrolled Recursion

This vulnerability occurs when an application fails to limit how deeply a function can call itself. Without proper controls, this…

Recursos

Deja de pagar por desarrollador.
Empieza a cerrar el bucle.

Plexicus es el ASPM nativo de IA que escanea, filtra, corrige, pentestea y explica — de forma autónoma. Desarrolladores ilimitados, repos ilimitados, acciones de IA de uso justo. Nivel gratuito real, €269/mo anual cuando estés listo.

Comenzar gratis Reservar una demo

Improper Restriction of Recursive Entity References in DTDs ('XML Entity Expansion')

What is CWE-776?

Real-world CVEs caused by CWE-776

Ruta del atacante paso a paso

Vulnerable XML

Secure pseudo

How to prevent CWE-776

How to detect CWE-776

Plexicus detecta automáticamente CWE-776 y abre un PR de corrección en menos de 60 segundos.

Frequently asked questions

Weaknesses related to CWE-776

Uncontrolled Recursion

Further reading

Deja de pagar por desarrollador.
Empieza a cerrar el bucle.

Improper Restriction of Recursive Entity References in DTDs ('XML Entity Expansion')

What is CWE-776?

Real-world CVEs caused by CWE-776

Ruta del atacante paso a paso

Vulnerable XML

Secure pseudo

How to prevent CWE-776

How to detect CWE-776

Plexicus detecta automáticamente CWE-776 y abre un PR de corrección en menos de 60 segundos.

Frequently asked questions

Weaknesses related to CWE-776

Uncontrolled Recursion

Further reading

Deja de pagar por desarrollador.Empieza a cerrar el bucle.

Deja de pagar por desarrollador.
Empieza a cerrar el bucle.