Data Mining Project:

Emotion Detection in Real-Time Video (With KDD analysis)

Overview

This project focuses on detecting emotions in real-time video streams using deep learning techniques. The model is trained on facial expression data to predict emotions in live video feeds.

Knowing the Data

The dataset comprises images of facial expressions categorized into seven emotions: angry, disgust, fear, happy, neutral, sad, and surprise.
Data is split into two sections: train and validation, each containing subfolders for different emotions.
The objective is to classify facial expressions into predefined emotion categories.

KDD

Objective: Gain insights into the dataset and understand its structure, features, and distribution.
Actions:
- Explore the dataset to understand the distribution of facial expressions.
- Analyze the characteristics of images in the dataset, such as resolution and quality.
- Identify any missing or erroneous data that may require preprocessing.

Data Preprocessing

Data preprocessing involves several steps to prepare the dataset for model training:
1. Loading Images: Images are loaded using the load_img function from the Keras library.
2. img_to_array: This function converts a PIL image object into a NumPy array. It is used to convert the loaded images into arrays for further processing.
3. ImageDataGenerator: This class generates batches of tensor image data with real-time data augmentation. It is used to perform data augmentation on the image data during training.
4. Data Splitting: The dataset is split into training and validation sets for model evaluation.

KDD

Objective: Preprocess the dataset to ensure it is suitable for model training.
Actions:
- Load the image data and convert it into a format compatible with the chosen deep learning framework.
- Perform data augmentation techniques to increase the diversity of the training data and improve the model's generalization.
- Split the dataset into training and validation sets to evaluate the model's performance.

Dataset Details

The dataset is organized into two main folders: train and validation.
Each folder contains subfolders corresponding to different emotion categories.
Images in each subfolder represent facial expressions of the respective emotion category.

Model Building

The model architecture consists of a convolutional neural network (CNN) designed to classify facial expressions.
Various CNN layers, including convolutional, pooling, and fully connected layers, are utilized to learn and extract features from facial images.
The model is trained using the Adam optimizer with a learning rate of 0.001 over 48 epochs.

KDD

Objective: Developed a deep learning model to classify facial expressions in real-time video streams.
Actions:
- Designed a convolutional neural network (CNN) architecture suitable for image classification tasks.
- Experiment with different CNN architectures, activation functions, and optimization algorithms to optimize model performance.
- Train the model using the prepared dataset and evaluate its performance on the validation set.

Function Explanation:

load_img: This function loads an image file and returns it as a PIL (Python Imaging Library) image object. It is used to load images from the dataset.
img_to_array: This function converts a PIL image object into a NumPy array. It is used to convert the loaded images into arrays for further processing.
ImageDataGenerator: This class generates batches of tensor image data with real-time data augmentation. It is used to perform data augmentation on the image data during training.
Sequential: This class allows you to build a sequential model layer-by-layer. It is used to create a sequential model for the CNN.
Conv2D: This class creates a convolutional layer for 2D spatial convolution. It applies a specified number of filters to the input data.
BatchNormalization: This layer normalizes the activations of the previous layer at each batch. It helps in stabilizing and accelerating the training process.
Activation: This layer applies an activation function to the output of the previous layer. Common activation functions include 'relu' (Rectified Linear Unit) and 'softmax'.
MaxPooling2D: This layer performs max pooling operation for spatial data. It reduces the spatial dimensions of the input volume.
Dropout: This layer applies dropout regularization to the input. It randomly sets a fraction of input units to zero during training to prevent overfitting.
Dense: This layer implements the operation: output = activation(dot(input, kernel) + bias). It is the standard fully connected layer.
Model: This class groups layers into an object with training and inference features. It is used to define the model architecture and compile it for training.
Adam: This optimizer is an extension to stochastic gradient descent. It computes adaptive learning rates for each parameter.
ModelCheckpoint: This callback saves the model after every epoch if the validation accuracy improves.
EarlyStopping: This callback stops training when a monitored metric has stopped improving.
ReduceLROnPlateau: This callback reduces the learning rate when a metric has stopped improving.

Working of the Code:

The code starts by importing necessary libraries and setting parameters such as image size, folder path, and target emotions.
It loads images from the specified folder path using load_img and img_to_array.
The dataset is split into training and validation sets using ImageDataGenerator and flow_from_directory.
A CNN model architecture is defined using Sequential and various layers such as Conv2D, BatchNormalization, Activation, etc.
The model is compiled with the Adam optimizer and categorical cross-entropy loss function.
Callbacks such as ModelCheckpoint, EarlyStopping, and ReduceLROnPlateau are defined to monitor the training process.
The model is trained using the fit_generator function, which iterates over the training set for a specified number of epochs.
Training and validation loss/accuracy curves are plotted using matplotlib.

This code serves as a foundation for building an emotion detection model and can be further extended and optimized for improved performance.

Performance Metrics

Training and validation loss/accuracy curves are plotted to evaluate the model's performance.
The model achieves a test accuracy of 65%, significantly outperforming random guessing.

Predictive Modeling

A predictive model is built to classify emotions in real-time video streams.
Key variables such as facial features, expressions, and historical data are considered in predicting emotions.
Various classifiers including Logistic Regression, Support Vector Machines, Decision Tree, etc., are tested to determine the best-performing model.

Results

The final model achieves a test accuracy of 65%, demonstrating its effectiveness in real-time emotion detection.
The model's performance is compared to a baseline predictor, showing a significant improvement in accuracy.

KDD

Objective: Assess the performance of the trained model and identify areas for improvement.
Actions:
- Evaluate the model's accuracy, precision, recall, and F1-score on the validation set.
- Use techniques such as confusion matrices and ROC curves to analyze the model's performance across different emotion categories.
- Identify any misclassifications or patterns in the model's predictions and refine the model accordingly.

Future Work

Implement real-time video processing optimizations for faster inference.
Extend the model to detect and classify complex emotional states in diverse scenarios.

Procedure to run:

1.Libraries Required:
Keras
tenserflow
pandas
scikit-learn
numpy
matplotlib
2.Open main.py file.

Data Set Link - https://www.kaggle.com/jonathanoheix/face-expression-recognition-dataset

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
myOutput		myOutput
README.md		README.md
emotion-classification-cnn-using-keras.ipynb		emotion-classification-cnn-using-keras.ipynb
haarcascade_frontalface_default.xml		haarcascade_frontalface_default.xml
main.py		main.py
model.h5		model.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Mining Project:

Emotion Detection in Real-Time Video (With KDD analysis)

Overview

Knowing the Data

KDD

Data Preprocessing

KDD

Dataset Details

Model Building

KDD

Function Explanation:

Working of the Code:

Performance Metrics

Predictive Modeling

Results

KDD

Future Work

Procedure to run:

About

Releases

Packages

Languages

aksweb/emotionDetection

Folders and files

Latest commit

History

Repository files navigation

Data Mining Project:

Emotion Detection in Real-Time Video (With KDD analysis)

Overview

Knowing the Data

KDD

Data Preprocessing

KDD

Dataset Details

Model Building

KDD

Function Explanation:

Working of the Code:

Performance Metrics

Predictive Modeling

Results

KDD

Future Work

Procedure to run:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages