Building energy performance benchmarking is adopted by many countries in the world as an effective tool to reduce energy consumption at city or country level. Machine learning holds a lot of promise for quickly and correctly predicting energy consumption from massive data, thereby it’s suitable for large-scale performance assessment. However, there is a severe problem of data imbalance in building types in many datasets. Due to the lack of samples for some types of buildings, unfavorable results, such as low accuracy of prediction, are produced sometimes. Meanwhile, the poor interpretability of machine learning models makes it difficult to promote the benchmarking frameworks based on machine learning. Therefore, this study proposed a novel machine learning based building performance benchmarking framework with improved generalization and interpretability. A reliable and convenient data augmentation approach was established to overcome the data imbalance problem while avoiding the overfitting problem. Superior results were obtained in case studies using three city-level open-source building datasets from two different countries. A complete rating framework was also proposed, with proper explanations of results at sample level. The performance of this rating framework was verified by comparing with other data-driven benchmarking frameworks. Moreover, the importance of variables was quantified and ranked, which can be a significant reference for data collectors and publishers. The results demonstrated that data augmentation can effectively solve the problem of data imbalance, which enables the universality of machine learning based benchmarking on all types of buildings. And the proposed GEIN benchmarking framework can also effectively address the issues of interpretability.
可解释的建筑能源基准;EUI预测;机器学习;数据增强
原文链接:点击左下角“阅读原文”
1. 科研论文 | 基于模型预测的调度策略以解锁和优化面向多服务电力市场的建筑能源灵活性
2. 科研论文 | 适用于三相吸收式蓄能的新型改性氯化锂溶液及其热物性