Analyzing machine and sensor data is about how to refine data from heating, ventilation, and air conditioning (HVAC) systems using the Cloudera Data Platform, and how to analyze the refined sensor data to maintain optimal building temperatures.
In order to perform analysis, we use the below data as input data:
• [url removed, login to view] – contains the targeted building temperatures, along with the actual (measured) building temperatures. The building temperature data was obtained using Apache Flume. Flume can be used as a log aggregator, collecting log data from many diverse sources and moving it to a centralized data store. In this case, Flume was used to capture the sensor log data, which we can now load into the Hadoop Distributed File System (HFDS).
• [url removed, login to view] – contains the “building” database table. Apache Sqoop can be used to transfer this type of data from a structured database into HFDS.