-
Notifications
You must be signed in to change notification settings - Fork 3
5. Outputs
π¦${outdir}
β£ π${timestamp}
β β£ π${taxa}
β β β£ π${cluster}
β β β β£ πalns
β β β β£π${timestamp}-${taxa}-${cluster}.{gubbins,snippy}.aln
β β β β£ πdists
β β β β βπ${timestamp}-${taxa}-${cluster}-{accessory,core-snps}_.{poppunk,gubbins,snippy}-{long,wide}.csv
β β β β£ πfigures
β β β β β£π${timestamp}-${taxa}-${cluster}-core-snps_{ML,NJ}.{gubbins,snippy}.microreact
β β β β β£π${timestamp}-${taxa}-${cluster}-core-snps_{ML,NJ}.{gubbins,snippy}.jpg
β β β β βπ${timestamp}-${taxa}-${cluster}-{accessory,core-snps}_dist.{poppunk,gubbins,snippy}.jpg
β β β β£ πsnippy
β β β β β π${timestamp}-${species}-${sample}.tar.gz
β β β β£ πstats
β β β β β π${timestamp}-${species}-${sample}.{gubbins,snippy}.stats
β β β β πtrees
β β β βπ${timestamp}-${taxa}-${cluster}-core-snps_{ML,NJ}.{gubbins,snippy}.nwk
β β β£ πpoppunk
β β β β£ π${timestamp}-${species}-pp_core_NJ.nwk
β β β β£ π${timestamp}-${species}-pp_microreact_clusters.csv
β β β β£ π${timestamp}-${species}-pp_perplexity20.0_accessory_mandrake.dot
β β β β£ π${timestamp}-${species}-pp.microreact
β β β β£ π${timestamp}-${species}-pp-core-acc-dist.txt.gz
β β β β£ π${timestamp}-${species}-pp-jaccard-dist.txt.gz
β β β β π${timestamp}-${species}-pp-clusters.csv
β β β£ πother
β β β β£ πmultiqc_report.html
β β β β πsoftware_versions.yml
β β β π${timestamp}-summary.tsv
β β π${timestamp}-db-info.csv
Results are summarized for all samples included in the analysis. This includes both new samples and those that were stored in your BigBacter database. Subsets of this summary are also saved for each species-cluster.
${timestamp}-summary.tsv
${timestamp}-${species}-${cluster}-summary.csv
Column Name | Description |
---|---|
ID | Sample ID (_T[0-9]+ denotes replicate number) |
STATUS | Sample status (NEW or OLD ) |
QUAL | Sample quality status (PASS or 'FAIL'). Based on values in PER_GENFRAC , PER_HET , and PER_LOWCOV
|
RUN_ID | Run timestamp |
TAXA | Sample taxonomy (same as supplied in the samplesheet via taxa ) |
CLUSTER | Assigned PopPUNK cluster |
ISO_IN_CLUSTER | Total number of isolates in the assigned PopPUNK cluster |
ISO_PASS_QC | Number of isolates in the assigned PopPUNK cluster that passed QC |
MEAN_SNP_DIST_SNIPPY | Mean pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (without recombination masked) |
MIN_SNP_DIST_SNIPPY | Minimum pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (without recombination masked) |
MAX_SNP_DIST_SNIPPY | Maximum pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (without recombination masked) |
STRONG_LINKAGE_SNIPPY | List of samples in the cluster with pairwise SNP differences <= --strong_link_cutoff (without recombination masked) |
INTER_LINKAGE_SNIPPY | List of samples in the cluster with pairwise SNP differences between --strong_link_cutoff and --inter_link_cutoff (without recombination masked) |
MEAN_SNP_DIST_GUBBINS | Mean pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (with recombination masked) |
MIN_SNP_DIST_GUBBINS | Minimum pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (with recombination masked) |
MAX_SNP_DIST_GUBBINS | Maximum pairwise SNP distances between the sample and all other isolates in the assigned cluster that passed QC, including the reference (with recombination masked) |
STRONG_LINKAGE_GUBBINS | List of samples in the cluster with pairwise SNP differences <= --strong_link_cutoff (with recombination masked) |
INTER_LINKAGE_GUBBINS | List of samples in the cluster with pairwise SNP differences between --strong_link_cutoff and --inter_link_cutoff (with recombination masked) |
LENGTH | Length of the reference genome (bp) |
ALIGNED | Reference positions covered by the sample (bp) |
UNALIGNED | Reference positions not covered by the sample (bp) |
RECOMB | Number of sites in recombinant regions, including variant and invariant sites (bp). |
VARIANT | Number of variant sites detected in the sample, as compared to the reference (bp) |
HET | Number of heterogenous sites (bp) |
MASKED | Number of masked sites (bp) |
LOWCOV | Number of low coverage sites (bp) |
PER_GENFRAC | Percentage of the reference genome covered by the sample |
PER_LOWCOV | |
PER_HET | Percentage of the reference genome with heterogeneous sites |
Visualizations of the PopPUNK database are generated via poppunk_visualise
with the --microreact
option. You can view the .microreact
file at https://microreact.org/upload.
${timestamp}-${species}-pp_core_NJ.nwk
${timestamp}-${species}-pp_microreact_clusters.csv
${timestamp}-${species}-pp_perplexity20.0_accessory_mandrake.dot
${timestamp}-${species}-pp.microreact
Cluster assignments for all samples in the PopPUNK database. Should contain the same information as ${timestamp}-${species}-pp_microreact_clusters.csv
.
${timestamp}-${species}-pp-clusters.csv
Pairwise core/accessory distances and Jaccard distances (same as Mash) produced from the PopPUNK database using sketchlib query
.
${timestamp}-${species}-pp-core-acc-dist.txt.gz
${timestamp}-${species}-pp-jaccard-dist.txt.gz
Whole genome (snippy) or core genome (gubbins) alignment files.
${timestamp}-${taxa}-${cluster}.{gubbins,snippy}.aln
Pairwise distance files in long and wide formats. Included for core SNPs (snippy & gubbins) and accessory distances.
${timestamp}-${taxa}-${cluster}-{accessory,core-snps}_.{poppunk,gubbins,snippy}-{long,wide}.csv
Static image files (.jpg) and Microreact files of pairwise distance matrices and/or phylogenetic trees.
π${timestamp}-${taxa}-${cluster}-core-snps\_{ML,NJ}.{gubbins,snippy}.jpg
π${timestamp}-${taxa}-${cluster}-{accessory,core-snps}\_dist.{poppunk,gubbins,snippy}.jpg
Note
Static images: new samples are shown in bold, black text. Historic samples are shown in grey, plain text. Asterisks on nodes indicate bootstrap values < 70%.
Tar files containing all the info needed to perform SNP analysis using snippy-core
.
${timestamp}-${species}-${sample}.tar.gz
Sample statistics for snippy and gubbins.
${timestamp}-${species}-${sample}.{gubbins,snippy}.stats
Maximum likelihood or neighbor joining trees generated from Snippy or Gubbins core SNP alignments.
${timestamp}-${taxa}-${cluster}-core-snps\_{ML,NJ}.{gubbins,snippy}.nwk \
Snapshot of your BigBacter database. This is only generated if --db_info
is true
.
${timestamp}-db-info.csv \