Coming up with some metric (I used a s_i(transit) = sum of weights of users who marked a transit/sum of weights of all users who classified the light curve) and cut at 0.5 to identify. I plot that distribution in Figure 6 of Schwamb et al 2012 for current cases you've run (unsupervised quarter 1).
