Skip to content

wishforgood/playdrone

 
 

Repository files navigation

PlayDrone

This repository contains the code used in the following paper:

A Measurement Study of Google Play

The talk can be watched on Youtube: http://youtu.be/xS0lyL_0OAM

The slides are available here: http://viennot.com/playdrone-slides.pdf

The paper can be downloaded here: http://viennot.com/playdrone.pdf


Getting the Data

Playdrone is deployed on archive.org, and currently crawling. Instructions on how to get the data will be made available during Nov. 2014.


The code is research quality code. It's poorly documented, and have no test suite.

Most of the code lies in lib/ and app/models/.

I strongly discourage you to run the code and encourage you to use it only as a reference, but if you must, here are the basic steps to process an app in dev mode:

  1. Make sure you have Ruby and Java installed

  2. Make sure you have Elasticsearch and Redis running

  3. Run bundle install

  4. Run rails c

  5. Add a google account with Account.create(:email => 'email', :password => 'password', :android_id => 'id'). An android id can be generated with Android Checkin.

  6. Running Account.first_usable should not block, but return something.

  7. Run Stack.process_app(:app_id => 'com.facebook.katana').

  8. You should see the facebook app repo in the repos directory.


If you want to go in production and launch the crawler, you can use the PlayDrone Kitchen.

Follow the instructions, edit deploy/settings.rb and run cap deploy:setup and cap deploy.

If everything works out (good luck), you'll be able to kick of jobs from a rails console. Try Stack.process_app(:app_id => 'com.facebook.katana'), and PlayDrone should discover at least half of the market by looking at related apps. Note that to increase the throughput, you may need to add more Google accounts.

Here's what you can expect to see once everything is running in the graphite dashboard:

Dashboard


GET ALL THE APPS!

PlayDrone is released under the MIT license.

About

Google Play Crawler

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Ruby 87.3%
  • Gnuplot 5.9%
  • Shell 4.0%
  • CSS 2.5%
  • Makefile 0.2%
  • CoffeeScript 0.1%