Published on 01 January 2024
Troid: Temporal and Cross-Sectional Android Dataset and Its Applications
View DatasetDescription
Numerous studies have focused on exploring Android malware in recent years, covering areas such as malware detection and application analysis. As a result, there is a pressing need for a reliable and scalable malware dataset to support the development and evaluation of effective malware studies. Although several benchmarks for Android malware datasets are widely used in research, they have significant limitations. Firstly, many of these datasets are outdated and do not capture current malware trends. Additionally, some have become obsolete or inaccessible, limiting their usefulness. Secondly, most datasets only contain the apps themselves (APKs), lacking important meta features like content rating, ad coverage, user ratings, and privacy policies. This omission restricts the potential applications of these datasets. This paper introduces a reliable Android malware dataset called \ours{} and sourced from the Google Play Store Market, covering the period from 2019 to 2023. To label malicious apps, we use VirusTotal and track their availability and removal status on the Google Play Store. We curate a meticulous Android malware dataset with 5,028 samples using this method. We augment our dataset with various features, including privacy policies, metadata, control flow graphs, permissions, API calls, strings, function names, hex dumps, and labels. We believe this benchmark dataset will greatly support various research efforts, including Android malware classification and detection, static program analysis, and privacy policy analysis.
Citations (1)
- https://doi.org/10.1007/s10207-025-01027-xDataCite OpenAlex
Cited on 09 April 2025
Weight: 1.23
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Signal Processing
Field
Computer Science
Domain
Physical Sciences
Confidence Score
45%
Source
Scholar Data Model