NGS-Var2017 Exercise.1

From BITS wiki
Jump to: navigation, search


[ Main_Page | Hands-on_introduction_to_NGS_variant_analysis-2017 | NGS-Var2017 Exercise.2 ]


QC paired end reads using fastQC


ex01_wf.png

You should have learned how to use FastQC during the Introductory training session. We now apply it to the sample data for the sake of time and to build this as a reflex in your Trainee's mind.

Apply FastQC to the FastQ data

You usually receive FastQ data from your sequence provider. It is always good to inspect it prior to starting analyses as it may be biased by artefacts and would in that case need cleaning.

What if you got a unique fastq file containing interleaved paired reads?

You can easily convert the unique fastQ file to two files, one for each read of a pair by using a number of tools found on the web. We recommend a simple bash tool developed by Nathan S. Watson-Haigh that will do the job very fast and can be found HERE; Just copy paste the code to a unix terminal and give it your fastq as input.

Handicon.png We use the fastQC GUI version to do this and will return to GenePattern after that

review fastQC results

The FastQC tool stores all results in a html file and linked pictures. You can review these results after decompressing the zip files created above. The QC results for the first read in pair data ares used to exemplify this.

You should get something like this (read1, read2).


shuffled_PE_NA18507_GAIIx_100_chr21_2_1_fastqc.png

 

Handicon.png Have a look at https://sequencing.qcfail.com/software/fastqc/ to find examples of QC failures

download exercise files

Download exercise files here

Use the right application to open the files present in ex1-files

References:

[ Main_Page | Hands-on_introduction_to_NGS_variant_analysis-2017 | NGS-Var2017 Exercise.2 ]