CWE-625 Base Draft

Permissive Regular Expression

This weakness occurs when a regular expression is too permissive, failing to properly validate or sanitize input by allowing unintended values or patterns.

Definition

What is CWE-625?

This weakness occurs when a regular expression is too permissive, failing to properly validate or sanitize input by allowing unintended values or patterns.
A permissive regex often arises from forgetting to anchor the pattern to the start (^) and end ($) of the input string. This causes a partial match, where the system accepts any substring that fits the pattern, rather than validating the entire input. For example, a regex meant to validate a 5-digit ZIP code like \d{5} would incorrectly accept '12345' within 'abc12345def', leading to incomplete validation. Other common mistakes include using overly broad wildcards (like .*) instead of specific character classes, or crafting patterns that fail to exclude dangerous or malformed data. This lax validation can open the door to data corruption, injection attacks, or logic flaws downstream, as the application processes input it assumed was already safe.
Auswirkungen in der Praxis

Real-world CVEs caused by CWE-625

  • Chain: regex in EXIF processor code does not correctly determine where a string ends (CWE-625), enabling eval injection (CWE-95), as exploited in the wild per CISA KEV.

  • ".*" regexp leads to static code injection

  • insertion of username into regexp results in partial comparison, causing wrong database entry to be updated when one username is a substring of another.

  • regexp intended to verify that all characters are legal, only checks that at least one is legal, enabling file inclusion.

  • Regexp for IP address isn't anchored at the end, allowing appending of shell metacharacters.

  • Regexp isn't "anchored" to the beginning or end, which allows spoofed values that have trusted values as substrings.

  • regexp in .htaccess file allows access of files whose names contain certain substrings

  • allow load of macro files whose names contain certain substrings.

Wie Angreifer es ausnutzen

Angreiferpfad Schritt für Schritt

  1. 1

    The following code takes phone numbers as input, and uses a regular expression to reject invalid phone numbers.

  2. 2

    An attacker could provide an argument such as: "; ls -l ; echo 123-456" This would pass the check, since "123-456" is sufficient to match the "\d+-\d+" portion of the regular expression.

  3. 3

    This code uses a regular expression to validate an IP string prior to using it in a call to the "ping" command.

  4. 4

    Since the regular expression does not have anchors (CWE-777), i.e. is unbounded without ^ or $ characters, then prepending a 0 or 0x to the beginning of the IP address will still result in a matched regex pattern. Since the ping command supports octal and hex prepended IP addresses, it will use the unexpectedly valid IP address (CWE-1389). For example, "0x63.63.63.63" would be considered equivalent to "99.63.63.63". As a result, the attacker could potentially ping systems that the attacker cannot reach directly.

Verwundbares Codebeispiel

Vulnerable Perl

The following code takes phone numbers as input, and uses a regular expression to reject invalid phone numbers.

Verwundbar Perl
$phone = GetPhoneNumber();
  if ($phone =~ /\d+-\d+/) {
```
# looks like it only has hyphens and digits* 
  	system("lookup-phone $phone");} 
  else {
  ```
  	error("malformed number!");
  }
Sicheres Codebeispiel

Secure pseudo

Sicher pseudo
// Validate, sanitize, or use a safe API before reaching the sink.
function handleRequest(input) {
  const safe = validateAndEscape(input);
  return executeWithGuards(safe);
}
What changed: the unsafe sink is replaced (or the input is validated/escaped) so the same payload no longer triggers the weakness.
Präventions-Checkliste

How to prevent CWE-625

  • Implementation When applicable, ensure that the regular expression marks beginning and ending string patterns, such as "/^string$/" for Perl.
Erkennungssignale

How to detect CWE-625

Automated Static Analysis High

Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then searching for potentially-vulnerable patterns that connect "sources" (origins of input) with "sinks" (destinations where the data interacts with external components, a lower layer such as the OS, etc.)

Plexicus Auto-Fix

Plexicus erkennt CWE-625 automatisch und öffnet in unter 60 Sekunden einen Fix-PR.

Codex Remedium scannt jeden Commit, identifiziert genau diese Schwachstelle und liefert einen reviewer-ready Pull Request mit dem Patch. Keine Tickets. Keine Hand-offs.

Häufig gestellte Fragen

Frequently asked questions

Was ist CWE-625?

This weakness occurs when a regular expression is too permissive, failing to properly validate or sanitize input by allowing unintended values or patterns.

Wie gravierend ist CWE-625?

MITRE hat für diese Schwachstelle keine Exploit-Wahrscheinlichkeit veröffentlicht. Behandle sie als mittlere Auswirkung, bis dein Threat Model anderes belegt.

Welche Sprachen oder Plattformen sind von CWE-625 betroffen?

MITRE lists the following affected platforms: Perl, PHP.

Wie kann ich CWE-625 verhindern?

When applicable, ensure that the regular expression marks beginning and ending string patterns, such as "/^string$/" for Perl.

Wie erkennt und behebt Plexicus CWE-625?

Die SAST-Engine von Plexicus erkennt die Datenfluss-Signatur von CWE-625 bei jedem Commit. Bei einem Treffer öffnet unser Codex-Remedium-Agent einen Fix-PR mit korrigiertem Code, Tests und einer einzeiligen Zusammenfassung für den Reviewer.

Wo erfahre ich mehr über CWE-625?

MITRE veröffentlicht die kanonische Definition unter https://cwe.mitre.org/data/definitions/625.html. Für ergänzende Hinweise kannst du auch die OWASP- und NIST-Dokumentation heranziehen.

Bereit, wenn du es bist

Schluss mit dem Bezahlen pro Entwickler.
Schließ den Kreislauf.

Plexicus ist die KI-native ASPM, die scannt, filtert, fixt, pentestet und erklärt — autonom. Unbegrenzte Entwickler, unbegrenzte Repos, Fair-Use-KI-Aktionen. Echter kostenloser Tarif, €269/mo jährlich, wenn du bereit bist.