%0 Journal Article %K Building Performance Simulation %K SRG (Simulation Research Group) %K Building decarbonization %K Occupant modeling %K Climate resilience %A Han Li %A Giuseppe Pinto %A Marco Savino Piscitelli %A Alfonso Capozzoli %A Tianzhen Hong %B Engineering Applications of Artificial Intelligence %D 2024 %G eng %P 107701 %R 10.1016/j.engappai.2023.107701 %T Building thermal dynamics modeling with deep transfer learning using a large residential smart thermostat dataset %U https://linkinghub.elsevier.com/retrieve/pii/S0952197623018857 %V 130 %8 04/2024 %! Engineering Applications of Artificial Intelligence %X

Understanding thermal dynamics and obtaining the computational model of residential buildings enable its scaled application in energy retrofits, control optimization and decarbonization. In this paper, we present a deep learning approach to model building thermal dynamics with smart thermostat data collected from residential buildings, with the goal to investigate model generalizability. In the first stage, we developed and compared different Deep Learning architectures including Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM) models and CNN-LSTM to predict indoor air temperature in a multi-step time horizon. In the second stage, we implemented a Transfer Learning (TL) process, which aims to improve the prediction performance on a new set of buildings (targets), exploiting the knowledge of related or similar buildings (sources). Different TL strategies and source model identification methods were investigated. The study showed that the CNN-LSTM performed the best among the architectures compared, with an average Mean Absolute Error (MAE) of 0.26 ◦C for one-hour-ahead (twelve 5-min future steps) predictions. Furthermore, the results showed that freezing the LSTM layer and fine-tuning the other layers of the CNN-LSTM achieved the best performance among four TL strategies, which further improved the performance with respect to a machine learning approach by 10%, and proving the effectiveness and generalizability of the proposed approach. A comparison of three different source model identification methods showed that randomly selecting source models constrained by similar building characteristics can provide good TL performance while retaining simplicity comparing with other quantitative source identification methods.