-
Hi, I've been trying to model customer subscription data using the ShiftedBetaGeoModelIndividual model. However I'm having a hard time getting the model to converge reliably and I'm wondering if it is an input data problem: The documentation says the data input should have:
Lets assume I'm looking over a 12 month period, each month during that time I have a number of customers starting a subscription and a number ending their subscriptions. Am I correct in making the input data like this:
Every example I see has the value T the same for the entire cohort which tells me one is looking at users who all started at the same time, which is not the case for me. I'm trying to, eventually, calculate customer lifetime value (I also have different kinds of users but lets assume I'm looking at only one specific subset). Is this data I'm using correct for my usecase? Any help will be greatly appreciated. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hey @ezietsman, Seems the documentation may require clarification, so thanks for bringing this to our attention.
If this seems confusing, it's because active customers are treated as "right-censored" (i.e., their contractual termination date is unobservable). The time durations of inactive customers are crucial for fitting this model. If this seems confusing at first, it'll make more sense after the predictive methods are ran:
|
Beta Was this translation helpful? Give feedback.
Hey @ezietsman,
Seems the documentation may require clarification, so thanks for bringing this to our attention.
T
is simply the maximum number of observed contract renewals in the dataset, andt_churn
is independent of start date. The input data should look like this:If this seems confusing, it's because active customers are treated as "right-censored" (i.e., their contractual termination date is unobservable). The time durations of inactive customers are crucial for fitting this model. If this seems confusing at first, it'…