Predictive Modelling for Customer Purchase Behaviour: A Logistic Regression Approach Based on Age and Estimated Salary

Selvakumar, S and Yogeshwaramoorthi, K and Jegathambal, P. M. G. (2025) Predictive Modelling for Customer Purchase Behaviour: A Logistic Regression Approach Based on Age and Estimated Salary. International Journal of Engineering and Management Research, 15 (5). pp. 34-43. ISSN 2250-0758

[thumbnail of IJEMR2025150508 (1).pdf] Text
IJEMR2025150508 (1).pdf

Download (469kB)

Abstract

Customer purchase prediction has become a critical requirement in the insurance industry, where
businesses strive to maximize customer acquisition while minimizing marketing costs. Accurate forecasting of whether a potential customer will purchase an insurance policy allows companies to focus on high potential leads and optimize their strategies. In this study, we propose a predictive modelling approach using logistic regression to classify customers based on two key demographic features: Age and Estimated Salary. A dataset of over 1,000 customer records was pre-processed, visualized, and divided into training and testing subsets using an 80:20 ratio. The logistic regression model was trained to identify significant patterns influencing purchase decisions and to estimate the probability of policy adoption. To enhance usability, the trained model was deployed in a Streamlit
based web application that includes secure user authentication, interactive input fields, decision
boundary visualization, and a leaderboard to track predictive outcomes. Experimental results
demonstrate that the logistic regression model achieves an accuracy of approximately 90%, with
strong interpretability through coefficient analysis and decision boundary visualization. This work
highlights the potential of combining machine learning models with lightweight, interactive
applications to support business analysts and decision-makers. The proposed framework offers a scalable, interpretable, and cost-effective solution for insurance companies seeking to strengthen customer targeting. Future work will focus on incorporating additional demographic and behavioral features, applying advanced ensemble models, and integrating large-scale realworld datasets to further enhance prediction performance. Keywords: Logistic Regression, Customer Purchase Prediction, Insurance Analytics, Streamlit Application, Decision Boundary Visualization, Predictive Modeling

Item Type: Article
Subjects: Computer Science Engineering > Deep Learning
Domains: Computer Science Engineering
Depositing User: user 15 15
Date Deposited: 06 Mar 2026 05:01
Last Modified: 06 Mar 2026 05:01
URI: https://ir.vistas.ac.in/id/eprint/13037

Actions (login required)

View Item
View Item