In many businesses, identifying which customers will make a purchase (and when) and how much will they spend, is a critical exercise. This is true for both brick-and-mortar outlets and online stores. This project's data is website traffic data acquired from an online retailer.
Data URL: Kaggle Link
The data provides information on customer's website site visit behavior. Customers may visit the store multiple times, on multiple days, with or without making a purchase. The variable
More specifically, I am predicting the transformation of the aggregate customer-level sales value based on the natural log. That is, if a customer has multiple revenue transactions, then the sum of all the revenue generated across all of the transactions, i.e.,:
And then transform this variable as follows:
For this project, I have used
-
$LDA$ for classifying the customer if they will buy something or not. -
$MARS$ for predicting how much they might spend, in terms of$logarithmic$ value.