Variant · Low-Medium

CWE-150: Improper Neutralization of Escape, Meta, or Control Sequences

The product receives input from an upstream component, but it does not neutralize or incorrectly neutralizes special elements that could be interpreted as escape, meta, or control character sequences...

CWE-150 · Variant Level ·12 CVEs ·6 Mitigations

Description

The product receives input from an upstream component, but it does not neutralize or incorrectly neutralizes special elements that could be interpreted as escape, meta, or control character sequences when they are sent to a downstream component.

Potential Impact

Integrity

Execute Unauthorized Code or Commands, Hide Activities, Unexpected State

Demonstrative Examples

Consider a situation in which an AI agent uses LLM output based on training data from untrusted sources.
Suppose an attacker is able to inject input into the training data that is used by the LLM. If a person is using a terminal emulator to interact with the LLM on the command line, if the output contains escape codes, then the emulator might act on those codes, e.g., to change color, flash text, or (in some cases) leak sensitive information or execute code. In one documented case, a terminal would handle an OSC (Operating System Command) for setting the working directory by processing a file:// URL in a way that could perform a DNS lookup for an adversary-controlled domain, leading to an information leak [REF-1526].

Mitigations & Prevention

General

Developers should anticipate that escape, meta and control characters/sequences will be injected/removed/manipulated in the input vectors of their product. Use an appropriate combination of denylists and allowlists to ensure only valid, expected and appropriate input is processed by the system.

Implementation

Assume all input is malicious. Use an "accept known good" input validation strategy, i.e., use a list of acceptable inputs that strictly conform to specifications. Reject any input that does not strictly conform to specifications, or transform it into something that does. When performing input validation, consider all potentially relevant properties, including length, type of input, the full range of acceptable values, missing or extra inputs, syntax, consistency across relat

Implementation

While it is risky to use dynamically-generated query strings, code, or commands that mix control and data together, sometimes it may be unavoidable. Properly quote arguments and escape any special characters within those arguments. The most conservative approach is to escape or filter all characters that do not pass an extremely strict allowlist (such as everything that is not alphanumeric or white space). If some special characters are still needed, such as white space, wrap each argument in qu

Implementation

Inputs should be decoded and canonicalized to the application's current internal representation before being validated (CWE-180). Make sure that the application does not decode the same input twice (CWE-174). Such errors could be used to bypass allowlist validation schemes by introducing dangerous inputs after they have been checked.

Implementation High

When using output from an LLM, neutralize or strip escape codes before redirecting output to the terminal or other rendering engine that would process the codes. The neutralization could require that the character be printable and/or allowable whitespace, such as a carriage return or newline. Be deliberate about what to allow.

Build and Compilation Limited

When using an LLM: during tokenizer training, suppress escape codes from the tokenizer's vocabulary. Depending on context, this could be accomplished by removing the codes from input to the tokenizer, or removing the map from the string to its token ID. It is generally unlikely that this removal would adversely affect the quality or correctness of what is generated, e.g. advice requests for terminal

Detection Methods

  • Automated Static Analysis High — Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then sea

Real-World CVE Examples

CVE IDDescription
CVE-2024-27936Chain: JavaScript-based application removes ANSI escape sequences in a dialog that asks permission for a particular file, causing the wrong filename to be visually presented for user approval (CWE-451
CVE-2002-0542The mail program processes special "~" escape sequence even when not in interactive mode.
CVE-2000-0703Setuid program does not filter escape sequences before calling mail program.
CVE-2002-0986Mail function does not filter control characters from arguments, allowing mail message content to be modified.
CVE-2003-0020Multi-channel issue. Terminal escape sequences not filtered from log files.
CVE-2003-0083Multi-channel issue. Terminal escape sequences not filtered from log files.
CVE-2003-0021Terminal escape sequences not filtered by terminals when displaying files.
CVE-2003-0022Terminal escape sequences not filtered by terminals when displaying files.
CVE-2003-0023Terminal escape sequences not filtered by terminals when displaying files.
CVE-2003-0063Terminal escape sequences not filtered by terminals when displaying files.
CVE-2000-0476Terminal escape sequences not filtered by terminals when displaying files.
CVE-2001-1556MFV. (multi-channel). Injection of control characters into log files that allow information hiding when using raw Unix programs to read the files.

Taxonomy Mappings

  • PLOVER: — Escape, Meta, or Control Character / Sequence
  • The CERT Oracle Secure Coding Standard for Java (2011): IDS03-J — Do not log unsanitized user input
  • Software Fault Patterns: SFP24 — Tainted input to command

Frequently Asked Questions

What is CWE-150?

CWE-150 (Improper Neutralization of Escape, Meta, or Control Sequences) is a software weakness identified by MITRE's Common Weakness Enumeration. It is classified as a Variant-level weakness. The product receives input from an upstream component, but it does not neutralize or incorrectly neutralizes special elements that could be interpreted as escape, meta, or control character sequences...

How can CWE-150 be exploited?

Attackers can exploit CWE-150 (Improper Neutralization of Escape, Meta, or Control Sequences) to execute unauthorized code or commands, hide activities, unexpected state. This weakness is typically introduced during the Build and Compilation, Implementation, Implementation phase of software development.

How do I prevent CWE-150?

Key mitigations include: Developers should anticipate that escape, meta and control characters/sequences will be injected/removed/manipulated in the input vectors of their product. Use an appropriate combination of denylists

What is the severity of CWE-150?

CWE-150 is classified as a Variant-level weakness (Low-Medium abstraction). It has been observed in 12 real-world CVEs.