Churn prediction turns reactive cancel-flow optimization into proactive retention. Instead of waiting for a customer to click cancel, you flag them weeks earlier — while there is still time and budget to intervene. Done well, it is one of the highest-ROI uses of customer data a subscription business can build.
What signals go into a churn model
- Engagement decay — declining email opens, fewer logins, longer gaps between active sessions.
- Subscription actions — pause requests, frequency changes, skipped cycles, swap requests.
- Payment events — failed charges, card updates, dunning emails opened.
- Support friction — tickets opened, sentiment in messages, time-to-resolution complaints.
- Tenure and cohort — month-since-signup, signup channel, signup discount applied.
Common modeling approaches
- Rule-based scoring. The starting point: assign points for risky behaviors (skip + skip + support ticket = high risk). No ML required, surprisingly effective for early-stage operators.
- Logistic regression. Statistical model that outputs a churn probability per customer. Easy to interpret — you can see which features drive the score.
- Gradient boosting (XGBoost, LightGBM). The workhorse of production churn prediction. Higher accuracy than logistic regression, harder to explain to non-technical stakeholders.
- Survival analysis. Models time-to-churn rather than binary outcome. Useful for forecasting cohort revenue, not just flagging at-risk individuals.
What to actually do with predictions
A score is useless without an intervention. The best churn-prediction setups feed directly into automated retention plays: high-risk customers get a personal email from a success manager, a discount offer, a swap suggestion, or a phone call. For most subscription merchants, a simple rule-based model plus a tailored cancel flow recovers more revenue than a sophisticated ML model with no follow-up. Start with the action, then build the model that triggers it. See customer churn modeling for the broader analytical view.