Phylip

From BITS wiki
Jump to: navigation, search


Phylip format is a format to store sequence alignments. Phylip stands for PHYLogeny Inference Package. A package of programs for various computer platforms to infer phylogenies or evolutionary trees, freely available from the Web.

The basic phylip format data file consists of a file where the first line contains two numbers, the number of taxa, and the number of characters. There may also be some additional characters on this first line, which are used to control the behavior of the program (for example, if the data are interleaved, this must be indicated on the first line). Following the first line is the data matrix. The first ten characters (exactly!) of each line are the taxon name -- if the name is less than ten characters just add spaces. The taxon names are followed by data, i.e., nucleotide of amino acid sequences.

Example

7 50 
thermotogaATGGCGAAGGAAAAATTTGTGAGAACAAAACCGCATGTTAACGTTGGAAC
TthermophiATGGCGAAGGGCGAGTTTGTTCGGACGAAGCCTCACGTGAACGTGGGGAC  
TaquaticusATGGCGAAGGGCGAGTTTATCCGGACGAAGCCCCACGTGAACGTGGGGAC 
deinonema-ATGGCTAAGGGAACGTTTGAACGCACCAAACCCCACGTGAACGTGGGCAC  
ChlamydiaBATGTCAAAAGAAACTTTTCAACGTAATAAGCCTCATATCAACATAGGGGC 
flexistipsATGTCCAAGCAAAAGTACGAAAGGAAGAAACCTCACGTAAACGTAGGCAC 
borrelia-bATGGCAAAAGAAGTTTTTCAAAGAACAAAGCCGCACATGAATGTTGGAAC