1 Introducton
The Human Resources Analytics is a simulated dataset and the focus is to understand why the best and most experienced employees is leaving the company. We will explore the fact why employees will leave prematurely, including experienced ones. A model may tell us what kind of employees will in the end.
1.1 Variables
The meanings of variables are as follows:
-
satisfaction: Employee satisfaction level
-
evaluation: Last evaluation
-
project: Number of projects
-
hours: Average monthly hours
-
years: Time spent at the company
-
accident: Whether they have had a work accident
-
promotion: Whether they have had a promotion in the last 5 years
-
sales: Department
-
salary: Salary
-
left: Whether the employee has left
1.2 Dataset
We describe the data by variable left
into two groups 1
and 0
, which means left
and remained
.
We can have a ov