数据混淆

数据混淆是攻击者通过修改命令控制流量或恶意代码的结构特征,使其难以被检测和分析的技术手段。其核心目标并非加密保护数据机密性,而是通过协议模拟、隐写嵌入、代码变形等方法降低恶意活动的可识别性。防御方可通过深度协议分析、异常元数据检测、载荷熵值监测等手段进行对抗,重点关注网络流量的协议合规性、数据统计特征异常性以及载荷结构的随机性水平。

为突破传统数据混淆技术因模式固定或载体单一而易于被规则匹配检测的局限,攻击者发展出动态化、智能化的新型混淆体系。通过构建协议指纹库实现流量深度伪装,利用隐写技术实现存在性隐匿,结合运行时代码变异规避静态检测,形成多层联动的混淆防御穿透链。

现有数据混淆匿迹技术的核心突破在于实现攻击载荷与网络流量的动态环境适配。协议模拟混淆通过协议逆向与状态仿真,使恶意流量获得合法协议的表层特征;隐写嵌入混淆利用数字媒体的信息冗余特性,将攻击指令隐匿于视觉/听觉不可感知的载体中;动态载荷混淆则通过遗传变异算法生成语法唯一性代码,破坏特征指纹的稳定性。三类技术的共性在于突破传统静态混淆模式,引入环境感知与动态调整机制,使攻击载荷能够根据防御态势实时优化混淆策略,形成"一次一密"的对抗能力,显著提升检测成本。

匿迹技术的演进导致传统基于特征签名、固定规则库的检测体系面临严峻挑战。防御方需构建动态协议行为建模、隐写流量检测、运行时代码仿真等新型能力,结合威胁情报共享实现混淆策略的预判式防御,并发展基于深度学习的多维度异常检测框架应对智能化混淆攻击。

ID: T1001
Sub-techniques:  T1001.001, T1001.002, T1001.003
Tactic: 命令控制
Platforms: Linux, Windows, macOS
Version: 1.1
Created: 31 May 2017
Last Modified: 07 October 2024

匿迹效应

效应类型 是否存在
特征伪装
行为透明
数据遮蔽
时空释痕

特征伪装

攻击者通过精确模拟合法协议的结构特征与交互模式,使恶意流量在报文格式、状态转换等表层特征上与正常业务流量高度一致。例如将C2指令封装为符合HTTP/2规范的流量,或利用DNS TXT记录传递加密指令,使得防御方难以通过协议合规性检查发现异常。

数据遮蔽

采用多层嵌套的混淆技术实现数据内容的深度隐藏。在协议模拟混淆中,通过保留字段填充和扩展头封装遮蔽真实指令;隐写术混淆利用多媒体文件的冗余空间隐藏数据存在性;动态载荷混淆则通过代码变形遮蔽恶意功能语义。这些手法使得传统基于内容解析的检测手段难以有效提取攻击特征。

Procedure Examples

ID Name Description
S1111 DarkGate

DarkGate will retrieved encrypted commands from its command and control server for follow-on actions such as cryptocurrency mining.[1]

S0381 FlawedAmmyy

FlawedAmmyy may obfuscate portions of the initial C2 handshake.[2]

S1120 FRAMESTING

FRAMESTING can send and receive zlib compressed data within POST requests.[3]

S1044 FunnyDream

FunnyDream can send compressed and obfuscated packets to C2.[4]

G0047 Gamaredon Group

Gamaredon Group has used obfuscated VBScripts with randomly generated variable names and concatenated strings.[5]

S1100 Ninja

Ninja has the ability to modify headers and URL paths to hide malicious traffic in HTTP requests.[6]

S0439 Okrum

Okrum leverages the HTTP protocol for C2 communication, while hiding the actual messages in the Cookie and Set-Cookie headers of the HTTP requests.[7]

C0014 Operation Wocao

During Operation Wocao, threat actors encrypted IP addresses used for "Agent" proxy hops with RC4.[8]

S0495 RDAT

RDAT has used encoded data within subdomains as AES ciphertext to communicate from the host to the C2.[9]

S0610 SideTwist

SideTwist can embed C2 responses in the source code of a fake Flickr webpage.[10]

S0533 SLOTHFULMEDIA

SLOTHFULMEDIA has hashed a string containing system information prior to exfiltration via POST requests.[11]

S0682 TrailBlazer

TrailBlazer can masquerade its C2 traffic as legitimate Google Notifications HTTP requests.[12]

Mitigations

ID Mitigation Description
M1031 Network Intrusion Prevention

Network intrusion detection and prevention systems that use network signatures to identify traffic for specific adversary malware can be used to mitigate some obfuscation activity at the network level.

Detection

ID Data Source Data Component Detects
DS0029 Network Traffic Network Traffic Content

Analyze network data for uncommon data flows (e.g., a client sending significantly more data than it receives from a server). Processes utilizing the network that do not normally have network communication or have never been seen before are suspicious. Analyze packet contents to detect communications that do not follow the expected protocol behavior for the port that is being used. [13]

References