7.1 Get Overview of Data
A first step is getting an overview of the whole data set and specific series of it.
7.1.1 Load data
Load test data set in a data frame (e.g. from a csv-file):
## Warning: 3 failed to parse.
7.1.2 Names
Show the column headers of a data frame:
## [1] "time" "energyHeatingMeter" "supplyTempHeating"
7.1.3 Structure
Show the structure of the data frame:
## 'data.frame': 22317 obs. of 3 variables:
## $ time : POSIXct, format: "2018-03-03 00:00:00" "2018-03-03 01:00:00" ...
## $ energyHeatingMeter: num 45020 NA NA NA NA ...
## $ supplyTempHeating : num 24.7 24.1 23.7 23.4 31.6 ...
7.1.4 Head/Tail
Show the first and last values:
## time energyHeatingMeter supplyTempHeating
## 1 2018-03-03 00:00:00 45019.81 24.73
## 2 2018-03-03 01:00:00 NA 24.06
## 3 2018-03-03 02:00:00 NA 23.73
## 4 2018-03-03 03:00:00 NA 23.45
## 5 2018-03-03 04:00:00 NA 31.59
## 6 2018-03-03 05:00:00 NA 29.14
## time energyHeatingMeter supplyTempHeating
## 22312 2020-09-17 15:00:00 NA NA
## 22313 2020-09-17 16:00:00 NA NA
## 22314 2020-09-17 17:00:00 NA NA
## 22315 2020-09-17 18:00:00 NA NA
## 22316 2020-09-17 19:00:00 NA NA
## 22317 2020-09-17 20:00:00 NA NA
Note: if you want to show only the first three entries, you can type
head(df,3)
7.1.5 Five number summary
Reveals details about the distribution of the data:
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 18.37 23.35 24.61 25.72 27.49 44.68 362