#rstats #pydata #dataNerdFriendsOfAllFlavors
I'm building a binary classification model, and the data is really unbalanced (less than 2% of the data is in the class I want to predict)
tell me all your tips and tricks for dealing with data like this! What's your favorite model type to use in this case? Do you resample the data to change the balance?