From BITS wiki
Jump to: navigation, search

The GenBank sequence format is a rich format for storing sequences and associated annotations. It shares a feature table vocabulary and format with the EMBL and DDJB formats. NCBI provide a more detailed example.

other formats


LOCUS       CAA89576                 109 aa            linear   PLN 11-AUG-1997
DEFINITION  CYC1 [Saccharomyces cerevisiae].
VERSION     CAA89576.1  GI:1015707
DBSOURCE    embl locus SCYJR048W, accession Z49548.1
SOURCE      Saccharomyces cerevisiae (baker's yeast)
  ORGANISM  Saccharomyces cerevisiae
            Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes;
            Saccharomycetales; Saccharomycetaceae; Saccharomyces.
REFERENCE   1  (residues 1 to 109)
  AUTHORS   Huang,M.E., Chuat,J.C. and Galibert,F.
  JOURNAL   Unpublished
REFERENCE   2  (residues 1 to 109)
  TITLE     Direct Submission
  JOURNAL   Submitted (25-SEP-1995) Data collected by MIPS on behalf of the
            European yeast chromosome X sequencing project. MIPS at the
            Max-Planck-Institut fuer Biochemie, Am Klopferspitz 18a D-82152
            Martinsried, FRG; E-mail: Mewes@mips.embnet.org
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Saccharomyces cerevisiae"
     Protein         1..109
     CDS             1..109
                     /note="ORF YJR048w"
        1 mtefkagsak kgatlfktrc lqchtvekgg phkvgpnlhg ifgrhsgqae gysytdanik
       61 knvlwdennm seyltnpkky ipgtkmafgg lkkekdrndl itylkkace

This file format can be parsed by the BioPerl Bio::SeqIO system using the Bio::SeqIO::genbank module.