Chapter 4 Missing values
4.1 Load library
4.2 Load data
4.3 Missing column pattern graph
4.4 Analysis
This is the graph that describes the missing column pattern. It shows that we have some missing values in the variables ‘emp_length’ and ‘annual_income’. The missing pattern shows that:
- 1. Most of the missing values are in the variable ‘emp_length’.
- 2. There is no record that has missing values in these two variables at the same time.
- 3. The missing values take only a small (about 5%) part in the total number of records.
In our analysis, we will simply delete those records with missing values.