This was a take-home assessment from psuedo-company "Relax, Inc." Defining an "adopted user" as a user who has logged into the product on three separate days in at least one sevenday period, my task was to identify which factors predict future user adoption.
Two .csv files containing the following information:
- A user table with data on 12,000 users who signed up for the product in the past 2 years
- A usage summary table with a row for each day that a user logged into the product
Adopted users were defined and labeled in the users table based on entries in the logins table. I derived some additional features, including the number of users in an organization, how many organization users were active, whether a user was invited by an active user, how many invitations a user had sent to others who signed up, and how many of their invitees were now adopted users. I fit the scaled & encoded data to an XGBoosted Tree Classifier and then found the SHAP values associated with each feature to interpret their influence on the results.
AUROC = 0.63
- Belonging to an organization with a higher percentage of adopted users
- Being invited by an adopted user
- Having an @hotmail.com email address
- Signing up through Google Authentication or through a guest invite
- Being opted in to the mailing list
- Having a @gmail.com email address
- Creating an account for personal projects
- Having an @yahoo.com email address
- Having a higher number of users in their organization
- Being invited through their organization
