4.7 Article

Analysis of feature matrix in machine learning algorithms to predict energy consumption of public buildings

Journal

ENERGY AND BUILDINGS
Volume 249, Issue -, Pages -

Publisher

ELSEVIER SCIENCE SA
DOI: 10.1016/j.enbuild.2021.111208

Keywords

Public building energy consumption; Machine learning; Building energy dataset; Features importance; Recursive feature elimination

Funding

  1. National Natural Science Foundation of China [51978095]
  2. Big Data Intelligence Platform for Energy Supervision, Operation and Maintenance Evaluation of Public Institutions in Chongqing
  3. 111 Project [B13041]

Ask authors/readers for more resources

Machine learning algorithms were used to analyze energy consumption data of public buildings, identifying the top ten most important features and providing insights for database establishment and big data analysis.
With the development of building information and energy consumption data, machine learning methods are increasingly being used for predicting and analyzing building energy consumption. In this study, based on the actual energy consumption data of 2370 public buildings in Chongqing, we used six machine learning algorithms and recursive feature elimination to analyze the importance of each feature in the dataset. First, it is necessary to establish optimal prediction models for analyzing the importance of features, and XGboost has demonstrated its superiority in terms of accuracy and efficiency. Regardless of the algorithm, the cumulative contribution rate of the top ten features exceeds 80%, and there is an obvious diminishing marginal utility when the number of features continues to increase. The learning algorithms with similar kernels have similarities in judging feature importance. Tree model-based algorithms can achieve a satisfactory performance with fewer features compared to linear kernel-based algorithms. Furthermore, the dataset plays a crucial role in model performance. To achieve professional supervised learning, two conditions need to be considered simultaneously in data collection: the importance of features in physical processes and whether the samples have adequate variance on these features. Thus, this study can provide a reference for database establishment and big data analysis of urban building energy consumption. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available