FLISOCHAR (Florida Isolate Characterization)

A Florida BPHL Pipeline to genomically characterize bacterial isolates

Introduction

Flisochar owes its inspiration to FLAQ-AMR, the Florida BPHL's standard pipeline for taxonomic characterization and AMR detection.

The Flisochar's overaching goal is to improve the identification of bactorial isolates using hybrid assembly from short and long-read sequencing data. Short-read and long-read sequences can be respectively from the Illumina MiSeq system and the Oxford Nanopore Technologies. The pipeline is built in Nextflow, and Python is used to develop custom scripts, enabling the parse of output. It comes with singularity container to simplify installation.

Workflow

The current worflow comprises:

Quality Control
De novo genome assembly
Species Identification
Genome Annotation
Detection of Antimicrobial Resistance Genes
Genomic Comparison

Software Tools implemented

Quality control on reads: fastp, longqc
Three genome assemblers: canu, dragonflye, unicycler
Taxonomic classification progams: Kaiju, Kraken, Mash
Genome annotators: bakta, pgap, prokka
Antimicrobial resistance genes marker: AMRFinderPlus
Average nucleotide identity (ANI): pyANI(pgap)

Installation

At the moment, it is meant to clone this repository to your local directory.Clone a directory

Software Requirements

Flisochar requires Python (version 3.6 or higher with the package Pandas installed), Nextlow, Singularity (apptainer) available in your system.

Pgap

Currenly, the installation of pgap is also required. Before installing, we recommend to create a directory path for the installation under your group

mkdir /*/YourGroup/UserName/repos/ncbi/pgap

and cd to it. Set this environment variable <PGAP_INPUT_DIR> to the created path,

export PGAP_INPUT_DIR=/*/YourGroup/UserName/repos/ncbi/pgap/

simply to save everything on your HPC cluster. Note the slash at the end of the previous path(../pgap/) is required, so that all pgap's files are found in that directory. Download the pgap.py file as directed. Change the file into executable mode (chmod +x pgap.py). Then execute the command below on your terminal, and pagap installation will be complete.

./pgap.py --update --taxcheck -D apptainer

Resource Requirements

Before running flisochar, ensure that required computing resources are available. Cores: 28, Memory: 200gb, Time ~ 2:00 hrs for one hybrid (short-read, long-read) bacterial sample

Running Flisochar

Once the pipeline is available on your system (or on HiPerGator), get an interactive run by following these first steps:

export PGAP_INPUT_DIR=/*/YourGroup/UserName/repos/ncbi/pgap/

module load nextflow apptainer

The above two commands and resources may also be written in a job scheduler (sbatch or slurm script) instead.

General Usage

nextflow run flisochar.nf --lreads 'Your_long-read/path/*.fastq.gz' --sreads 'Your_short-read/path/*_{1,2}.fastq.gz' --outdir << your output directory>>

The full usage may be accessed by executing the following command:

nextflow run flisochar.nf --help

Example

Run the pipeline on the test dataset in your working directory using the following command:

nextflow run flisochar.nf --lreads 'flisochar_test_data/LRdata/*.fastq.gz' --sreads 'flisochar_test_data/SRdata/*_{1,2}.fastq.gz' --outdir flisochar_test_out

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
bin		bin
conf		conf
flisochar_test_data		flisochar_test_data
metad		metad
README.md		README.md
flisochar.nf		flisochar.nf
flisochar_module_v02.nf		flisochar_module_v02.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FLISOCHAR (Florida Isolate Characterization)

Introduction

Workflow

Software Tools implemented

Installation

Software Requirements

Pgap

Resource Requirements

Running Flisochar

General Usage

Example

Author: Tassy J. Bazile

About

Releases

Packages

Contributors 2

Languages

BPHL-Molecular/flisochar

Folders and files

Latest commit

History

Repository files navigation

FLISOCHAR (Florida Isolate Characterization)

Introduction

Workflow

Software Tools implemented

Installation

Software Requirements

Pgap

Resource Requirements

Running Flisochar

General Usage

Example

Author: Tassy J. Bazile

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages