Analysis of Household Energy Consumption Patterns Using K-Means Clustering and Explainable Data Mining

Authors

  • Riley Emerson School of Engineering and Environment, Kingston University, Kingston upon Thames, London, KT1 2EE, UK
  • Genevieve Genevieve School of Engineering and Environment, Kingston University, Kingston upon Thames, London, KT1 2EE, UK

Keywords:

Household Energy Consumption, K-Means Clustering, Explainable Data Mining, Energy Analytics, Unsupervised Learning

Abstract

The increasing demand for household energy has created significant challenges for energy sustainability, resource management, and the development of effective energy efficiency strategies, thereby necessitating advanced analytical approaches to better understand residential consumption behavior. This study aims to analyze household energy consumption patterns using K-Means clustering and explainable data mining techniques. Household energy consumption data were collected from residential users and subjected to preprocessing procedures, including data cleaning, missing value handling, feature selection, and normalization to ensure data quality and analytical reliability. The K-Means clustering algorithm was then applied to identify homogeneous groups of households based on their energy consumption characteristics, while explainable data mining techniques were employed to interpret cluster profiles and determine the factors influencing cluster membership. The results revealed the existence of three distinct household energy consumption groups, namely low-, moderate-, and high-consumption households, each exhibiting significantly different consumption behaviors, appliance ownership levels, and energy expenditure patterns. Further analysis showed that household size, appliance ownership, and peak electricity usage were the most influential factors differentiating the clusters. These findings demonstrate that the integration of K-Means clustering and explainable data mining provides an effective and interpretable framework for understanding household energy consumption behavior. The proposed approach offers valuable insights for utility companies, policymakers, and consumers by supporting targeted energy efficiency programs, demand-side management initiatives, and evidence-based energy policy development aimed at promoting sustainable household energy consumption.

Downloads

Download data is not yet available.

References

Belle, V., & Papantonis, I. (2021). Principles and practice of explainable machine learning. Frontiers in Big Data, 4, 688969.

Burke, T., & Ralston, L. (2015). Household energy use: Consumption and expenditure patterns 1993-2012. Sydney: CRC for Low Carbon Living.

Chadoulos, S., Koutsopoulos, I., & Polyzos, G. C. (2020). Mobile apps meet the smart energy grid: A survey on consumer engagement and machine learning applications. Ieee Access, 8, 219632–219655.

Chu, X., Ilyas, I. F., Krishnan, S., & Wang, J. (2016). Data cleaning: Overview and emerging challenges. Proceedings of the 2016 International Conference on Management of Data, 2201–2206.

Czétány, L., Vámos, V., Horváth, M., Szalay, Z., Mota-Babiloni, A., Deme-Bélafi, Z., & Csoknyai, T. (2021). Development of electricity consumption profiles of residential buildings based on smart meter data clustering. Energy and Buildings, 252, 111376.

Davidson, I. (2002). Understanding K-means non-hierarchical clustering. Computer Science Department of State University of New York (SUNY), Albany.

Deaton, A. (2016). Measuring and understanding behavior, welfare, and poverty. American Economic Review, 106(6), 1221–1243.

Druckman, A., & Jackson, T. (2008). Household energy consumption in the UK: A highly geographically and socio-economically disaggregated model. Energy Policy, 36(8), 3177–3192.

Frederiks, E. R., Stenner, K., & Hobman, E. V. (2015). The socio-demographic and psychological predictors of residential energy consumption: A comprehensive review. Energies, 8(1), 573–609.

Genolini, C., & Falissard, B. (2010). KmL: k-means for longitudinal data. Computational Statistics, 25(2), 317–328.

Gewers, F. L., Ferreira, G. R., Arruda, H. F. De, Silva, F. N., Comin, C. H., Amancio, D. R., & Costa, L. da F. (2021). Principal component analysis: A natural approach to data exploration. ACM Computing Surveys (CSUR), 54(4), 1–34.

Ivosev, G., Burton, L., & Bonner, R. (2008). Dimensionality reduction and visualization in principal component analysis. Analytical Chemistry, 80(13), 4933–4944.

Jenny, A., López, J. R. D., & Mosler, H. (2006). Household energy use patterns and social organisation for optimal energy management in a multi?user solar energy system. Progress in Photovoltaics: Research and Applications, 14(4), 353–362.

Jiménez Betancourt, R. O., González López, J. M., Barocio Espejo, E., Concha Sánchez, A., Villalvazo Laureano, E., Sandoval Pérez, S., & Contreras Aguilar, L. (2020). Iot-based electricity bill for domestic applications. Sensors, 20(21), 6178.

Kavousian, A., Rajagopal, R., & Fischer, M. (2013). Determinants of residential electricity consumption: Using smart meter data to examine the effect of climate, building characteristics, appliance stock, and occupants’ behavior. Energy, 55, 184–194.

Leitao, J., Gil, P., Ribeiro, B., & Cardoso, A. (2020). A survey on home energy management. IEEE Access, 8, 5699–5722.

Llet?, R., Ortiz, M. C., Sarabia, L. A., & Sánchez, M. S. (2004). Selecting variables for k-means cluster analysis by using a genetic algorithm that optimises the silhouettes. Analytica Chimica Acta, 515(1), 87–100.

Loeb, S., Dynarski, S., McFarland, D., Morris, P., Reardon, S., & Reber, S. (2017). Descriptive Analysis in Education: A Guide for Researchers. NCEE 2017-4023. National Center for Education Evaluation and Regional Assistance.

McLoughlin, F., Duffy, A., & Conlon, M. (2015). A clustering approach to domestic electricity load profile characterisation using smart metering data. Applied Energy, 141, 190–199.

Mitra, D., Steinmetz, N., Chu, Y., & Cetin, K. S. (2020). Typical occupancy profiles and behaviors in residential buildings in the United States. Energy and Buildings, 210, 109713.

Oti, E. U., Olusola, M. O., Eze, F. C., & Enogwe, S. U. (2021). Comprehensive review of K-Means clustering algorithms. Criterion, 12(08), 22–23.

Pacini, G. C., Colucci, D., Baudron, F., Righi, E., Corbeels, M., Tittonell, P., & Stefanini, F. M. (2014). Combining multi-dimensional scaling and cluster analysis to describe the diversity of rural households. Experimental Agriculture, 50(3), 376–397.

Pan, S., Wang, X., Wei, Y., Zhang, X., Gal, C., Ren, G., Yan, D., Shi, Y., Wu, J., & Xia, L. (2017). Cluster analysis for occupant-behavior based electricity load patterns in buildings: A case study in Shanghai residences. Building Simulation, 10(6), 889–898.

Rasheed, M. B., Javaid, N., Awais, M., Khan, Z. A., Qasim, U., Alrajeh, N., Iqbal, Z., & Javaid, Q. (2016). Real time information based energy management using customer preferences and dynamic pricing in smart homes. Energies, 9(7), 542.

Ren, Z., Chen, D., & James, M. (2018). Evaluation of a whole-house energy simulation tool against measured data. Energy and Buildings, 171, 116–130.

Shao, J., Tanner, S. W., Thompson, N., & Cheatham, T. E. (2007). Clustering molecular dynamics trajectories: 1. Characterizing the performance of different clustering algorithms. Journal of Chemical Theory and Computation, 3(6), 2312–2334.

Singh, S., & Yassine, A. (2018). Big data mining of energy time series for behavioral analytics and energy consumption forecasting. Energies, 11(2), 452.

Steemers, K., & Yun, G. Y. (2009). Household energy consumption: a study of the role of occupants. Building Research & Information, 37(5–6), 625–637.

Wang, Y., Chen, Q., Hong, T., & Kang, C. (2018). Review of smart meter data analytics: Applications, methodologies, and challenges. IEEE Transactions on Smart Grid, 10(3), 3125–3148.

Wright, A. (2008). What is the relationship between built form and energy use in dwellings? Energy Policy, 36(12), 4544–4547.

Downloads

Published

2026-05-30

How to Cite

Emerson, R., & Genevieve, G. (2026). Analysis of Household Energy Consumption Patterns Using K-Means Clustering and Explainable Data Mining. Jurnal Teknik Informatika C.I.T Medicom, 18(2), 95–108. Retrieved from https://www.medikom.iocspublisher.org/index.php/JTI/article/view/1652