Machine Learning-based Android Malware Detection using Static and Dynamic Analysis

Abstract

This research presents a comprehensive machine learning approach for Android malware detection, combining static and dynamic analysis techniques with a Random Forest classifier to achieve robust mobile security. The system provides a complete analysis pipeline from APK extraction to threat classification, with extensive visualization and reporting capabilities.

Published in: Springer Lecture Notes in Networks and Systems, vol 507. DOI: 10.1007/978-3-032-07992-3_26

Methodology & Architecture

Malware Detection Flowchart

Dual-Phase Analysis Approach

Static Analysis

Dynamic Analysis

Machine Learning Pipeline

Key Technical Features

Technical Implementation

Technology Stack

Feature Extraction Categories

Analysis Pipeline

  1. APK Acquisition: Secure collection and verification of Android applications
  2. Static Feature Extraction: Automated analysis without execution
  3. Dynamic Feature Extraction: Controlled execution in sandbox environment
  4. Feature Integration: Combination of static and dynamic features
  5. Classification: Random Forest-based malware detection
  6. Visualization: Comprehensive reporting and visualization generation

Results & Analysis

Performance Metrics

The Random Forest classifier demonstrated strong performance across key evaluation metrics:

Feature Analysis Insights

The research revealed important patterns in malware characteristics:

Visualization Capabilities

The system includes comprehensive visualization tools:

Practical Applications

Publication & Recognition

Conference Paper

Title: "Machine Learning-based Android Malware Detection using Static and Dynamic Analysis"
Authors: Pratham Patel, Prof. Jizhou Tong (Gannon University)
Venue: Future Technology Conference (FTC) - SAI Conferences, 2024
Status: Accepted - To Appear

Research Contributions

Future Research Directions