NGS-Var2017 Exercise.6

From BITS wiki
Jump to: navigation, search


[ Main_Page | Hands-on_introduction_to_NGS_variant_analysis-2017 | NGS-Var2017 Exercise.5 | NGS-Var2017 Exercise.7 ]


Annotate and filter VCF variant lists with SnpEff and SNPSift


SnpEff_logo.png
SnpSift_logo.png

Choose the right tool to enrich your VCF data

A growing number of tools are available to annotate and select from VCF files. The choice of the best tool for your application depends on several factors.

  • when you need the job done and do not worry about the flexibility, we advise to use SnpEff and the companion SnpSift which are both easy to use java programs.
  • if you wish to add annotations from third-party databases that are not present in the other tools, or if you work on a organism absent from the above tool, you may consider using Annovar that was included in our former training session ([1]).
  • when you only need to annotate a few VCF rows, you are welcome to use public servers like:
    • the EnsEMBL VEP server (introduced in our related Wiki page [2])
    • the UCSC VAI ([3])
    • the SeatleSeq server ([4])

Technical.png Submitting 'patentable' information to the WEB infringes the novelty clause and will expose patient information to the internet, and the size of input is limited to few 100's lines

  • other tools have been used with success like vcfCodingSnps ([5])

 

Annotate your variants with SnpEff

  • start the SnpEff module and link to the intersect VCF file with varscan as first set
ex6_01.png
  • run the tool and wait for results
ex6_02.png
  • open the result file ending with .html
ex6_03.png
  • read through the extensive report and get the idea
ex6_04.png
ex6_05.png
ex6_06.png

Filter and select relevant data from a VCF file with SnpSift

SnpEff has added a large number of annotations and scores which allow us filter the data and find loci of interrest based on our assumptions. Filtering is done using SnpSift, the companion tool of SnpEff

  • start the SnpSift module and connect the annotated VCF file
ex6_07.png
  • type a filtering query and run
ex6_08.png
  • review the results by opening the resulting VCF (works only if small enough, else will be downloaded to your computer)

Find out more about the syntax for queries on the SnpSift page

example queries:

  • ANN[0].EFFECT has 'missense_variant'
ex6_09.png
  • (ANN[*].EFFECT has 'missense_variant') && (ANN[*].GENE = 'TPTE')
ex6_10.png
  • ANN[*].IMPACT = 'HIGH'
ex6_11.png
  • ( CHROM = 'chr21' ) && ( POS > 1234567 ) && ( POS < 1235567 )"
ex6_12.png

Handicon.png Many more and diverse operations (and complex combinations thereof) can be done using this tool

 

download exercise files

Download exercise files here

Use the right application to open the files present in ex6-files

References:

[ Main_Page | Hands-on_introduction_to_NGS_variant_analysis-2017 | NGS-Var2017 Exercise.5 | NGS-Var2017 Exercise.7 ]