Android malware detection through hybrid features fusion and ensemble classifiers: The AndroPyTool framework and the OmniDroid dataset

Martín García, Alejandro ORCID: https://orcid.org/0000-0002-0800-7632, Lara Cabrera, Raúl ORCID: https://orcid.org/0000-0002-7959-1936 and Camacho Fernández, David ORCID: https://orcid.org/0000-0002-5051-3475 (2019). Android malware detection through hybrid features fusion and ensemble classifiers: The AndroPyTool framework and the OmniDroid dataset. "Information Fusion", v. 52 ; pp. 128-142. ISSN 15662535. https://doi.org/10.1016/j.inffus.2018.12.006.

Descripción

Título: Android malware detection through hybrid features fusion and ensemble classifiers: The AndroPyTool framework and the OmniDroid dataset
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Information Fusion
Fecha: Diciembre 2019
ISSN: 15662535
Volumen: 52
Materias:
Palabras Clave Informales: Android, Hybrid features fusion, Malware analysis, Malware dataset
Escuela: E.T.S.I. de Sistemas Informáticos (UPM)
Departamento: Sistemas Informáticos
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of 4225953mod.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB)

Resumen

Cybersecurity has become a major concern for society, mainly motivated by the increasing number of cyber attacks and the wide range of targeted objectives. Due to the popularity of smartphones and tablets, Android devices are considered an entry point in many attack vectors. Malware applications are among the most used tactics and tools to perpetrate a cyber attack, so it is critical to study new ways of detecting them. In these detection mechanisms, machine learning has been used to build classifiers that are effective in discerning if an application is malware or benignware. However, training such classifiers require big amounts of labelled data which, in this context, consist of categorised malware and benignware Android applications represented by a set of features able to describe their behaviour. For that purpose, in this paper we present OmniDroid, a large and comprehensive dataset of features extracted from 22,000 real malware and goodware samples, aiming to help anti-malware tools creators and researchers when improving, or developing, new mechanisms and tools for Android malware detection. Furthermore, the characteristics of the dataset make it suitable to be used as a benchmark dataset to test classification and clustering algorithms or new representation techniques, among others. The dataset has been released under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License and was built using AndroPyTool, our automated framework for dynamic and static analysis of Android applications. Finally, we test a set of ensemble classifiers over this dataset and propose a malware detection approach based on the fusion of static and dynamic features through the combination of ensemble classifiers. The experimental results show the feasibility and potential usability (for the machine learning, soft computing and cyber security communities) of our automated framework and the publicly available dataset.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Comunidad de Madrid
S2013/ICE3095
CIBERDINE
Sin especificar
Cybersecurity, Data and Risks); Spanish Ministry of Science and Education and Competitivity
Gobierno de España
TIN2014-56494-C4-4-P
EphemeCH
Sin especificar
Sin especificar
Gobierno de España
TIN2017-85727-C4-3-P
DeepBio
Sin especificar
Sin especificar

Más información

ID de Registro: 86700
Identificador DC: https://oa.upm.es/86700/
Identificador OAI: oai:oa.upm.es:86700
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/4225953
Identificador DOI: 10.1016/j.inffus.2018.12.006
URL Oficial: https://www.sciencedirect.com/science/article/pii/...
Depositado por: iMarina Portal Científico
Depositado el: 23 Ene 2025 16:52
Ultima Modificación: 23 Ene 2025 16:52