Skip to content

FASTA Files

ikb6 edited this page Sep 15, 2020 · 4 revisions

A FASTA file represents one or more DNA sequences. It is a simple text format, composed of a greater than sign (>) followed by an identifier, followed by a nucleotide string (actg) on a new line.

For example:

> ID_of_person_A
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG
> ID_of_person_B
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG

The file extension varies (.FASTA, .FAS, .FA). Any file that doesn't have a csv file extension will be parsed as a FASTA file.

Clone this wiki locally