TY - JOUR
T1 - Uplift modeling with quasi-loss-functions
AU - Hu, Jinping
AU - de Haan, Evert
AU - Skiera, Bernd
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature 2024.
PY - 2024/7
Y1 - 2024/7
N2 - Uplift modeling, also referred to as heterogeneous treatment effect estimation, is a machine learning technique utilized in marketing for estimating the incremental impact of treatment on the response of each customer. Uplift models face a fundamental challenge in causal inference because the variable of interest (i.e., the uplift itself) remains unobservable. As a result, popular uplift models (such as meta-learners and uplift trees) do not incorporate loss functions for uplifts in their algorithms. This article addresses that gap by proposing uplift models with quasi-loss functions (UpliftQL models), which separately use four specially designed quasi-loss functions for uplift estimation in algorithms. Using simulated data, our analysis reveals that, on average, 55% (34%) of the top five models from a set of 14 are UpliftQL models for binary (continuous) outcomes. Further empirical data analysis shows that over 60% of the top-performing models are consistently UpliftQL models.
AB - Uplift modeling, also referred to as heterogeneous treatment effect estimation, is a machine learning technique utilized in marketing for estimating the incremental impact of treatment on the response of each customer. Uplift models face a fundamental challenge in causal inference because the variable of interest (i.e., the uplift itself) remains unobservable. As a result, popular uplift models (such as meta-learners and uplift trees) do not incorporate loss functions for uplifts in their algorithms. This article addresses that gap by proposing uplift models with quasi-loss functions (UpliftQL models), which separately use four specially designed quasi-loss functions for uplift estimation in algorithms. Using simulated data, our analysis reveals that, on average, 55% (34%) of the top five models from a set of 14 are UpliftQL models for binary (continuous) outcomes. Further empirical data analysis shows that over 60% of the top-performing models are consistently UpliftQL models.
KW - Causal inference
KW - Heterogeneous treatment effects
KW - Loss function
KW - Uplift modeling
UR - http://www.scopus.com/inward/record.url?scp=85195179742&partnerID=8YFLogxK
U2 - 10.1007/s10618-024-01042-x
DO - 10.1007/s10618-024-01042-x
M3 - Article
AN - SCOPUS:85195179742
SN - 1384-5810
VL - 38
SP - 2495
EP - 2519
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
ER -