Malicious Client Detection using Machine learning

Malicious Client Detection using machine learning SATYAM SAXENA

Threats•There are many types of malware for all types of devices and operating systems

•Most if not all malware relies on a support system – command and control infrastructure

•Bad guys use DNS to scale and hide their C&C infrastructure

•Bad guys use DNS for C&C to bypass corporate security (tunneling)

•Bad guys use cloud providers to roll out, scale, manage and quickly move their C&C Infrastructure

Without reliance on any particular end point operation system or configuration, we can use big data analytics on network data to detect malware.

Malware use of DNS

rndruppbakyokv[.]com

1.2.3.4

rndruppbakyokv[.]com

1.2.3.4

Command andControl

Infrastructure

CommunicationChanel with C&C is established. Compromised device receives updates, instructions, targets.

DNS Server

End point device

RawpDNS

Domain Nameclassifier

DNS Resolverclassifier

Device Behavior classifier

Compromised Device(Security Event)

classifier

MaliciousDomains

MaliciousResolvers

Behavior Anomalies

Machine Learning Pipeline

DGA Network Time

Tunnel

Network Time

Architecture

DGA Model• Detect Randomly generated domains in the pDNS data.

• Model is trained on 6 categories of malware families like zeus, tinba, pushdo, etc.

• 29 features extracted from the domain.

• 29 features dimensionally reduced to 16 features using PCA.

• Those reduced features set is then used to train a GBM classifier.

Domain FeaturesCommon Letter Score Entropy

Domain Features(2)Length of largest meaningful string Mean length of dictionary words

DGA Features

DGA Classification PerformanceOverall model performance

(Random Forrest)

Metric Performance Accuracy 98.738% Precision 99.288% Recall 98.181% AUC 99.801%

Performance per malware family

Malware Family % Detection

Conflicker 86.309%

Cryptolocker 98.348%

Pushdo 95.515%

Ramdo 99.823%

Tinba 96.715%

Zeus 100.0%

Network Model• Using WHOIS record to find if a domain is malicious or benign.

• WHOIS record contains very rich information about a domain.

• Age based features.

• Registration Features.

Network Features – Whois Server

Malicious Domains Benign Domains

Network Features – creation Date

Network Model Performance • Final Set of features :- creation Date, update Date, expiration Date,admin country, registrant country, tech country, status, whois server

Metric Performance Error 0.00450864127

Area Under Curve 0.96615884041

Compromised Client Detection

Hadoop HDFS

Spark Compute

IP DGA WHOIS NX SERVERip1 #10 #3 #4 #5

Ip2 #8 #1 #2 #3

ip3 #5 #2 #0 #0

ip4 #3 #3 #0 #0

pDNS Data

Group By

Thank You

Malicious Client Detection using Machine learning

Technology

Malicious URL Detection by Dynamically Mining Patterns ...jpei/publications/URLPatternMining_ · Malicious URL Detection by Dynamically Mining Patterns without Pre ... in network

Malicious Domain Detection BasedonK-meansandSMOTE › content › pdf › 10.1007 › 978-3... · Malicious Domain Detection BasedonK-meansandSMOTE Qing Wang 1,2,LinyuLi, Bo Jiang1(B),

Man vs. Machine: Adversarial Detection of Malicious Crowdsourcing Workers

Dynamic camouﬂage event based malicious node detection ...alexliu/publications/cenda/cenda_journal.pdf · Dynamic camouﬂage event based malicious node detection architecture 721

Detection of malicious Encrypted Web Traffic Using Machine

Techniques for Detection of Malicious Packet Drops in Networks

Exploiting Redundancy Properties of Malicious Infrastructure for Incident Detection

Learning based Malicious Web Sites Detection Using ...users.eecs.northwestern.edu/~hlc720/349/HTXPZYQ_poster.pdf · Learning based Malicious Web Sites Detection Using Suspicious URLs!

A Hybrid Malicious Code Detection Method based on Deep ... · detection rate and detection accuracy, and reduces the time complexity of the hybrid model. 2. Hybrid Malicious Code

CyberProbe: Towards Internet-Scale Active Detection of Malicious Server

Malicious Code Detection - ISACA · Malicious Code Detection SCR FOR MALICIOUS INTENTION, NOT JUST ... Script Kiddie AppSec Pro Organized ... - Environment modi¿FDtion Inversion

Testing Malicious Code Detection Tools

Detection of Unknown Malicious Code via Machine Learning

Malicious Url Detection Using Machine Learning

Malicious JavaScript Detection by Features Extraction · Malicious JavaScript Detection by Features Extraction 67 fact, change frequently the IP addresses espe-ciallywhentheyareblacklisted

Detection v2 Malicious Process TENABLE NETWORK SECURITY…static.tenable.com/oldsite/blog/files/report---malicious-process... · Tenable Network Security 2 Malicious Software Detection

Detection of Algorithmically Generated Malicious Domain

Malicious Website Detection Based on Honeypot Systems

Malicious Nodes Detection in AODV-Based Mobile Ad Hoc … · 50 Malicious Nodes Detection in AODV-Based Mobile Ad Hoc GESTS- Oct.2005 node and a malicious node can access the wireless

Malicious node detection in vanet