CWE-1389: Incorrect Parsing of Numbers with Different Radices

Description

The product parses numeric input assuming base 10 (decimal) values, but it does not account for inputs that use a different base number (radix).

Frequently, a numeric input that begins with "0" is treated as octal, or "0x" causes it to be treated as hexadecimal, e.g. by the inet_addr() function. For example, "023" (octal) is 35 decimal, or "0x31" is 49 decimal. Other bases may be used as well. If the developer assumes decimal-only inputs, the code could produce incorrect numbers when the inputs are parsed using a different base. This can result in unexpected and/or dangerous behavior. For example, a "0127.0.0.1" IP address is parsed as octal due to the leading "0", whose numeric value would be the same as 87.0.0.1 (decimal), where the developer likely expected to use 127.0.0.1. The consequences vary depending on the surrounding code in which this weakness occurs, but they can include bypassing network-based access control using unexpected IP addresses or netmasks, or causing apparently-symbolic identifiers to be processed as if they are numbers. In web applications, this can enable bypassing of SSRF restrictions.

Potential Impact

Confidentiality

Read Application Data

Integrity

Bypass Protection Mechanism, Alter Execution Logic

Demonstrative Examples

The below demonstrative example uses an IP validator that splits up an IP address by octet, tests to ensure each octet can be casted into an integer, and then returns the original IP address if no exceptions are raised. This validated IP address is then tested using the "ping" command.

Bad

import subprocess
					  
					  def validate_ip(ip: str):
					  
					    split_ip = ip.split('.')
						if len(split_ip) > 4 or len(split_ip) == 0:
						
						  raise ValueError("Invalid IP length")
						
						
						for octet in split_ip:
						
						  try:
						  
						    int(octet, 10)
						  
						  except ValueError as e:
						  
						    raise ValueError(f"Cannot convert IP octet to int - {e}")
						  
						
						
						# Returns original IP after ensuring no exceptions are raised
						return ip

					  
					  
					  def run_ping(ip: str):
					  
					    validated = validate_ip(ip)
						# The ping command treats zero-prepended IP addresses as octal
						result = subprocess.call(["ping", validated])
						print(result)

If run_ping() were to be called with one or more zero-prepended octets, validate_ip() will succeed as zero-prepended numerical strings can be interpreted as decimal by a cast ("012" would cast to 12). However, as the original IP with the prepended zeroes is returned rather than the casted IP, it will be used in the call to the ping command. Ping DOES check and support octal-based IP octets, so the IP reached via ping may be different than the IP assumed by the validator. For example, ping would considered "0127.0.0.1" the same as "87.0.0.1".

This code uses a regular expression to validate an IP string prior to using it in a call to the "ping" command.

Bad

import subprocess
					  import re
					  
					  def validate_ip_regex(ip: str):
					  
					    ip_validator = re.compile(r"((25[0-5]|(2[0-4]|1\d|[1-9]|)\d)\.?\b){4}")
						if ip_validator.match(ip):
						
						  return ip
						
						else:
						
						  raise ValueError("IP address does not match valid pattern.")
						
					  
					
					def run_ping_regex(ip: str):
				      
						validated = validate_ip_regex(ip)
						# The ping command treats zero-prepended IP addresses as octal
						result = subprocess.call(["ping", validated])
						print(result)

Since the regular expression does not have anchors (CWE-777), i.e. is unbounded without ^ or $ characters, then prepending a 0 or 0x to the beginning of the IP address will still result in a matched regex pattern. Since the ping command supports octal and hex prepended IP addresses, it will use the unexpectedly valid IP address (CWE-1389). For example, "0x63.63.63.63" would be considered equivalent to "99.63.63.63". As a result, the attacker could potentially ping systems that the attacker cannot reach directly.

Consider the following scenario, inspired by CWE team member Kelly Todd. 
					 Kelly wants to set up monitoring systems for his two cats, who pose very different threats. One cat, Night, tweets embarrassing or critical comments about his owner in ways that could cause reputational damage, so Night's blog needs to be monitored regularly. The other cat, Taki, likes to distract Kelly and his coworkers during business meetings with cute meows, so Kelly monitors Taki's location using a different web site. 
					 Suppose /etc/hosts provides the site info as follows:

Bad

taki.example.com 10.1.0.7
					  night.example.com 010.1.0.8

The entry for night.example.com has a typo "010" in the first octet. When using ping to ensure the servers are up, the leading 0 causes the IP address to be converted using octal.  So when Kelly's script attempts to access night.example.com, it inadvertently scans 8.1.0.8 instead of 10.1.0.8 (since "010" in octal is 8 in decimal), and Night is free to send new Tweets without being immediately detected.

Mitigations & Prevention

Implementation

If only decimal-based values are expected in the application, conditional checks should be created in a way that prevent octal or hexadecimal strings from being checked. This can be achieved by converting any numerical string to an explicit base-10 integer prior to the conditional check, to prevent octal or hex values from ever being checked against the condition.

Implementation

If various numerical bases do need to be supported, check for leading values indicating the non-decimal base you wish to support (such as 0x for hex) and convert the numeric strings to integers of the respective base. Reject any other alternative-base string that is not intentionally supported by the application.

Implementation

If regular expressions are used to validate IP addresses, ensure that they are bounded using ^ and $ to prevent base-prepended IP addresses from being matched.

Detection Methods

Automated Static Analysis — Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then sea

Real-World CVE Examples

CVE ID	Description
CVE-2021-29662	Chain: Use of zero-prepended IP addresses in Perl-based IP validation module can lead to an access control bypass.
CVE-2021-28918	Chain: Use of zero-prepended IP addresses in a product that manages IP blocks can lead to an SSRF.
CVE-2021-29921	Chain: Use of zero-prepended IP addresses in a Python standard library package can lead to an SSRF.
CVE-2021-29923	Chain: Use of zero-prepended IP addresses in the net Golang library can lead to an access control bypass.
CVE-2021-29424	Chain: Use of zero-prepended IP addresses in Perl netmask module allows bypass of IP-based access control.
CVE-2016-4029	Chain: incorrect validation of intended decimal-based IP address format (CWE-1286) enables parsing of octal or hexadecimal formats (CWE-1389), allowing bypass of an SSRF protection mechanism (CWE-918)
CVE-2020-13776	Mishandling of hex-valued usernames leads to unexpected decimal conversion and privilege escalation in the systemd Linux suite.

Frequently Asked Questions

What is CWE-1389?

CWE-1389 (Incorrect Parsing of Numbers with Different Radices) is a software weakness identified by MITRE's Common Weakness Enumeration. It is classified as a Base-level weakness. The product parses numeric input assuming base 10 (decimal) values, but it does not account for inputs that use a different base number (radix).

How can CWE-1389 be exploited?

Attackers can exploit CWE-1389 (Incorrect Parsing of Numbers with Different Radices) to read application data. This weakness is typically introduced during the Implementation, Implementation phase of software development.

How do I prevent CWE-1389?

Key mitigations include: If only decimal-based values are expected in the application, conditional checks should be created in a way that prevent octal or hexadecimal strings from being checked. This can be achieved by conver

What is the severity of CWE-1389?

CWE-1389 is classified as a Base-level weakness (Medium abstraction). It has been observed in 7 real-world CVEs.

Description

Potential Impact

Confidentiality

Integrity

Demonstrative Examples

Mitigations & Prevention

Detection Methods

Real-World CVE Examples

Related Weaknesses

CWE-704: Incorrect Type Conversion or Cast

Frequently Asked Questions