Skip to content

pratikbarjatya/INSAID-Assignment

Repository files navigation

This repository contains INSAID GCDAI curriculum assignments and practice samples.


The repository contains various notebooks that explores

  • The basics of Data Analysis and Data Visualisation with use of Pandas, Numpy, Matplotlib, Seaborn, etc libraries.
  • Covers the Exploratory Data Analysis (EDA)
    EDA

EDAProject

  • What is EDA?

    • EDA is a phenomenon under data analysis used for gaining a better understanding of data aspects like main features of data, variables and relationships that hold between them, identifying which variables are important for our problem
    • Exploratory Data Analysis (EDA) helps in understanding the data sets by summarizing their main characteristics often plotting them visually.
    • Lifecycle of a Data Analysis projects consists of:
  • EDA Methods involve:

    • Table of Contents
      Steps in Data Exploration and Preprocessing:
      • Identification of variables and data types
      • Analyzing the basic metrics
      • Non-Graphical Univariate Analysis
      • Graphical Univariate Analysis
      • Bivariate Analysis
      • Variable transformations
      • Missing value treatment
      • Outlier treatment
      • Correlation Analysis
      • Dimensionality Reduction

Repository Overview