The DATAR dataset, is designed to track the characteristics of open-source Android app releases, like popularity ratings, and the social and technical factors that affect them. The dataset comprises two types of information: app-related and release-related, with the majority collected from the Google Play and GitHub platforms. Additionally, our dataset includes several source code attributes (e.g., number of activities, cyclomatic complexity) that have proven effective in previous research, extracted from the APK files and their source code. This dataset contains information from 8,031 releases of 1,354 Android open-source apps, each having at least one compatible published release on Google Play and GitHub.
You can access the dataset from the link below:
If you use this dataset in your research, please cite the accompanying paper:
Y. Abedini, M. H. Hajihosseini, and A. Heydarnoori. "DATAR: A Dataset for Tracking App Releases", In Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories (MSR), Lisbon, Portugal, Apr. 2024.
@inproceedings{abedini-msr2024-DATAR,
title={DATAR: A Dataset for Tracking App Releases},
booktitle={Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories (MSR)},
author={Yasaman Abedini and Mohammad Hadi Hajihosseini and Abbas Heydarnoori},
month={April},
year={2024},
publisher={IEEE/ACM},
address={Lisbon, Portugal},
}
Thank you for using DATAR!