Skip to content

Latest commit

 

History

History
134 lines (81 loc) · 9.55 KB

README.md

File metadata and controls

134 lines (81 loc) · 9.55 KB

📱 Android Apps Scraper/Downloader

views DOI

Open Source Love svg1 GitHub Forks GitHub Issues contributions welcome

We did a research project on comparison of Official Play Store Apps and their 3rd Party App Stores counterparts to analyze what modifications are done to the 3rd Party versions of an app. For this purpose, we wrote this script to download pairs of an app from Google Play Store and Xiaomi App Store (a famous 3rd Party Chinese App Store). It downloads an app from Xiaomi and Google Play store only when that app is available on both stores. This way it creates a dataset of pairs of an app.

We are open-sourcing this tool so that it can be utilized by the research community for research in Android Security.

Moroever, to compare two Android Apps we wrote another tool named AndroCompare. We have open-sourced it as well.

For details regarding citing/referencing this tool for your research, check the 'Citation' section below.

Approach

The download URL of an app on Xiaomi App Store is like http://app.mi.com/download/23 The number at the end of the URL can be incremented to download as many apps as you want. So, theoretically you can download each and every app on the Xiaomi App Store. Hence there's a variable named 'target' in the tool. If target = 1000 then the tool will scan first 1000 urls for the apps. You can change the target to any number you want.

Features

  • download all apps from the famous Xiaomi App Store
  • download pairs of an app from Play Store and Xiaomi App Store
  • the record of all downloaded apps is maintained in a SQLite database
  • if the code is interrupted using CTRL + Z then the current progress is saved so that next time the code resumes downloading from where it left previously
  • incase the script is running and there occurs some internet connectivity issue then all current progress is saved and the script waits until the internet is connected again and resumes from where it left

How to Run Code

The code is in ready to run condition. It can be run on both Windows/Ubuntu Linux. Its written in Python 3. Moreover it uses gplaycli, so install it using pip.

You can use DB Browser for SQLite to view the database.


Note

This script can be easily modified to meet your specific needs e.g currently it first checks whether an app is present on both stores and if yes then it downloads it from each store. You can remove this constraint to download every possible app from Xiaomi App Store.

This code is for research purposes only.


Citation

DOI

If you use this tool for your research, then kindly cite it. Click the above badge for more information regarding the complete citation for this tool and diffferent citation formats like IEEE, APA etc.




Authors

You can get in touch with us on our LinkedIn profiles:

Haris Muneer

LinkedIn Link

To stay updated about my latest projects: GitHub Follow

Hussam Habib

LinkedIn Link

To stay updated about my latest projects: GitHub Follow


If you liked the repo then kindly support it by giving it a star ⭐ and share in your circles so more people can benefit from the effort.

Contributions Welcome

GitHub Issues

If you find any bugs, have suggestions, or face issues:

  • Open an Issue in the Issues Tab to discuss them.
  • Submit a Pull Request to propose fixes or improvements.
  • Review Pull Requests from other contributors to help maintain the project's quality and progress.

This project thrives on community collaboration! Members are encouraged to take the initiative, support one another, and actively engage in all aspects of the project. Whether it’s debugging, fixing issues, or brainstorming new ideas, your contributions are what keep this project moving forward.

With modern AI tools like ChatGPT, solving challenges and contributing effectively is easier than ever. Let’s work together to make this project the best it can be! 🚀

License

MIT

Copyright (c) 2018-present, harismuneer, hussamh10


Waving hand Hey there, I'm Haris Muneer 👨🏻‍💻

Total Github Stars Total Github Followers


  • 🕸️ Founder of Cyfy Labs: At Cyfy Labs, we provide advanced social media scraping tools to help businesses, researchers, and marketers extract actionable data from platforms like Facebook, Instagram, and X (formerly Twitter). Our tools support lead generation, sentiment analysis, market research, and various other use cases. To learn more, visit: www.cyfylabs.com

  • 🌟 Open Source Advocate: I’m passionate about making tech accessible. I’ve open-sourced several projects that you can explore on my GitHub profile and on the Open Source Software PK page.

  • 📫 How to Reach Me: You can learn more about my skills/work at LinkedIn. You can also reach out via email for collaboration or inquiries. For Cyfy Labs related queries, please reach out through the company website.


🤝 Follow my journey