kandi background
Explore Kits

Employee Attrition Analysis & Prediction using NLP

by bhoyarpurvi

DATASET - Kaggle Uploaded dataset TABLEAU - Statistical Representation of results EMPLOYEE ATTRITION PROBLEM ABSTRACT - Nowadays employee attrition is one of the key problem in the today's scenario. Attrition is said to be gradual reduction in number of employees through resignation, death and retirement. When a well-trained and well-adapted employee leaves the organization for any of the reason, it creates an empty space in an organization . It creates a great difficulty for a Human resource personnel to fill the gap that has occurred. This study helps in knowing why attrition occurs, reasons for employee attrition, challenges faced by managers in retaining employees and also suggest some measures in retaining employees. This Project is concerned with the problem of employee attrition in the industry. We have prepared dataset manually and analyzed the reviews of employees . Under this capstone Project we have built a comparison between service based and product based companies and their attrition rates and their causes. Our final result is prepared using Data Mining , Preprocessing, Feature Extraction , Natural Language Processing on reviews , Visualization of the abstracted data .Our analysis will also give a clear idea of what are the main reasons of attrition in real life . Our prediction model gives an output of whether the employee will leave the company or not depending upon the situations like work life balance , employee personal and professional information company type , etc. Lastly , we have used a visualization tool TABLEAU in order to perform data visualization to show the insights.


Dataset sources - ambitionbox.com , indeed.com , glassdoor.com It contains more than 600 entries of different companies and employees and their resultant attrition type.


The basic libraries being used for preprocessing the dataset Pre-processing refers to the transformations applied to our data before feeding it to the algorithm. Data Preprocessing is a technique that is used to convert the raw data into a clean data set.

Kit Solution Source

REAL LIFE SOLUTION :- Learning and employee development opportunity • Regularly solicit feedback • Competitive pay package compared to other companies • Conducting exit interviews • Change of department according to calibre and need. For an issue like employee attrition especially in the IT sector , solution is not always optimized. Employee attrition can be reduced but can't be drawn to a value of ZERO(0), reason behind this is that a person will always go for more good and better opportunities if he/she is capable and have earned it by own. Nobody can stop or degrade a person's professional level just for company's betterment. Good opportunities and work life balance should be offered to employees .

Natural language processing libraries

It provides a large number of algorithms to build machine learning models. It has excellent documentation that helps makes it easier to learn. Natural language processing helps computers communicate with humans in their own language and scales other language-related tasks.

Code access

  • © 2022 Open Weaver Inc.