Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark with independent classification model #68

Open
jalvear2dxc opened this issue May 21, 2021 · 3 comments
Open

Benchmark with independent classification model #68

jalvear2dxc opened this issue May 21, 2021 · 3 comments
Labels

Comments

@jalvear2dxc
Copy link

jalvear2dxc commented May 21, 2021

Hello @arogozhnikov ,

In order to check the quality of the reweighting process, I have used an independent classifier, based on the Ugradient boosting class in the same dataset, following the steps of the list below:

  1. Before Reweighting: Training (using prior weights as sample weights) and scoring
  2. Reweighting
  3. After Reweighting: Training using new weights as sample weights and scoring

When comparing the results with those of the reweighter classifier (rw.gb), I find that the decrease in the Weighted AUC is much greater than the obtained with the independent classifier,

Results before reweigthing: classifier AUC = 0.99 rw.gb AUC = 0.99
Results after reweigthing: classifier AUC = 0.95 rw.gb AUC = 0.55
Could you help me to identify a possible cause of this difference in behavior?

@arogozhnikov
Copy link
Owner

Hi @jalvear2dxc,
I'm not completely following which classifiers you compare, but large difference you report is possible.

Naturally, reweighing would remove discrepancies that are picked by models with tree configuration (e.g. depth) that is similar to reweighter's trees.
If you use uniforming loss, this may become an additional hint to classifier (though hard to predict without understanding/pondering the data).

Also, check that you use correct weights in every training and in every AUC scoring. Just in case.

@jalvear2dxc
Copy link
Author

jalvear2dxc commented May 27, 2021

Thanks Mr. @arogozhnikov.

I've improved dramatically the results not training a new classifier after the reweighting but just correcting the predictions of the firs model with the predicted weigths. Does it make sense? I think this is according with what you said in the answer.

@arogozhnikov
Copy link
Owner

@jalvear2dxc yes, seems to match with what I suggested

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants