Skip to content

HNHalstead/Genome_Analysis

Repository files navigation

Genome_Analysis

Project Plan

For Reproducibility purposes, the steps completed have been provided in a list below. The files in this repository arranged to reflect this and contain further instruction within each folder. Details about analysis and steps can be found on the wiki along with a more detailed project plan while further instruction on running code can be found in the README.md files contained within each folder.

Purpose:

To determine relevant genes found in E. faecium E745 that play a role in the vancomycin-resistant strain's growth in human serum by looking at differential expression analysis between rich medium and heat-inactivated serum conditions as based off a study by Zhang (Zhang, 2017). Upon completetion, further investigation was done to investigate phage integration. Future studies might include comparison of resistance genes to prophage integration

What data is available?

FASTQ and FASTA files files of DNA, RNA, and Transposons

Analysis Overview

  1. Genome assembly of Pac-Bio DNA reads using Canu
  2. Vizualization of Illumina DNA sequences with FastQC
  3. Trimming of Illumina reads using Trimmomatic
  4. Mapping Illumina reads to Pac-Bio Assembly:
    4.1 Using BWA-MEM to map
    4.2 Converting SAM files to BAM using SAMtools
    4.3 Creating a combined genome assembly from mapping using Pilon
    4.4 QUAST
  5. Annotation of the combined assembly:
    5.1 Using Prokka for structural and functional annotation
    5.2 Using eggNog-Mapper for functional annotation
    5.3 Artemis ART and ACT used for Visualization and Synteny
  6. RNA Mapping:
    6.1 For serum RNA using BWA-MEM and converted to to BAM using SAMtools 6.2 For BHI RNA using BWA-MEM and converted to to BAM using SAMtools
  7. Counting RNA reads:
    7.1 Converting .gbk to .gtf with gbk2gtf.py
    7.2 Counting serum RNA reads using HTseq-count
    7.3 Counting BHI RNA reads using HTseq-count
  8. Differential expression using DEseq2
  9. Interpretation and Other Analysis**

**See Results and Discussion page for the interpretation of DEseq2 and other analysis results.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages