GitHub - ketanraj15/Fake-Job-Prediction-Model: A python based machine learning model,which uses algorithms like the Naive Bayes algorithm and Decision tree classifier algorithm,to predict whether a posted job is fake or real.

Problem

Scammers advertise jobs the same way legitimate employers do — online (in ads, on job sites, and social media), in newspapers, and sometimes on TV and radio. They promise you a job, but what they want is your money and your personal information.

Model Description

Fake-Job-Predictor is a python based machine learning model, which uses algorithms like the Naive Bayes algorithm and Decision tree classifier algorithm, to predict whether a posted job is fake or real. The model is deployed over heroku and can be viewed here.

Code for all the detailed analysis and model building can be viewed here.

Technology Stacks and Libraries

Numpy
Pandas
matplotlib
Imbalanced-learn
wordcloud
Natural Language Toolkit
Multinomial Naive Bayes (scikit-learn)
Decision tree classifier (scikit-learn)
flask

Analysis of Dataset

The Model uses Employment Scam Aegean Dataset (EMSCAD) , which can be viewed here.

Visualizing dataset based on location

Visualizing dataset based on Experience

Visualizing common words used in the dataset using Wordcloud

Model Evaluation

Naive Bayes Approach

In the first approach, Multinomial Naive Bayes,which is one of the most popular supervised learning classifications was used for the analysis of the categorical text data.
Classification Accuracy: 0.8980769230769231
Confusion Matrix:

Decision tree Classifier

In the Second approach, Decision tree classifier was used which classifies inputs by segmenting the input space into regions.
Classification Accuracy: 0.8173076923076923
Confusion Matrix:

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Images		Images
Fake_job_predictor.ipynb		Fake_job_predictor.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problem

Model Description

Technology Stacks and Libraries

Analysis of Dataset

Visualizing dataset based on location

Visualizing dataset based on Experience

Visualizing common words used in the dataset using Wordcloud

Model Evaluation

Naive Bayes Approach

Decision tree Classifier

About

Releases

Packages

Languages

ketanraj15/Fake-Job-Prediction-Model

Folders and files

Latest commit

History

Repository files navigation

Problem

Model Description

Technology Stacks and Libraries

Analysis of Dataset

Visualizing dataset based on location

Visualizing dataset based on Experience

Visualizing common words used in the dataset using Wordcloud

Model Evaluation

Naive Bayes Approach

Decision tree Classifier

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages