Aliero, Muhammad S.Pasha, Muhammad F.Smith, David T.Ghani, ImranAsif, MuhammadJeong, Seung RyulMoveh, Samuel2023-11-062023-11-0620221996-1073https://hdl.handle.net/11363/6251https://doi.org/Recent advancements in the Internet of Things and Machine Learning techniques have allowed the deployment of sensors on a large scale to monitor the environment and model and predict individual thermal comfort. The existing techniques have a greater focus on occupancy detection, estimations, and localization to balance energy usage and thermal comfort satisfaction. Different sensors, actuators, and analytic data methods are often non-invasively utilized to analyze data from occupant surroundings, identify occupant existence, estimate their numbers, and trigger the necessary action to complete a task. The efficiency of the non-invasive strategies documented in the literature, on the other hand, is rather poor due to the low quality of the datasets utilized in model training and the selection of machine learning technology. This study combines data from camera and environmental sensing using interactive learning and a rule-based classifier to improve the collection and quality of the datasets and data pre-processing. The study compiles a new comprehensive public set of training datasets for building occupancy profile prediction with over 40,000 records. To the best of our knowledge, it is the largest dataset to date, with the most realistic and challenging setting in building occupancy prediction. Furthermore, to the best of our knowledge, this is the first study that attained a robust occupancy count by considering a multimodal input to a single output regression model through the mining and mapping of feature importance, which has advantages over statistical approaches. The proposed solution is tested in a living room with a prototype system integrated with various sensors to obtain occupant-surrounding environmental datasets. The model’s prediction results indicate that the proposed solution can obtain data, and process and predict the occupants’ presence and their number with high accuracy values of 99.7% and 99.35%, respectively, using random forest.eninfo:eu-repo/semantics/openAccessAttribution-NonCommercial-NoDerivs 3.0 United Statessmart buildingsenergyindooroccupancymachine learningcarbon dioxideNon-Intrusive Room Occupancy Prediction Performance Analysis Using Different Machine Learning TechniquesArticle152312210.3390/en152392312-s2.0-85143756315Q1WOS:000897231400001Q3