Exercises on primer design

From BITS wiki
Jump to: navigation, search
Go to parent QPCR analysis using qbasePLUS

The following exercise will make you familiar with the Primer3Plus software for designing primers for PCR. Primer3Plus is the user-friendly version of Primer3, the standard software for primer design.

Criteria for qPCR primers

Primers for qPCR have to follow all the gudelines for regular primers is and an additional set of rules specific for qPCR primers:

  • qPCR products are small: 80-160 bp
  • use intron or exon-exon junction spanning primers to detect genomic DNA contamination in the RNA samples.
    Primers of intron spanning primer pairs are located at both sides of an intron and will therefore generate a larger product on genomic DNA (containing the intron).
    Primer pairs containing an exon-exon junction spanning primer will not generate a PCR product on genomic DNA since the exon-exon junction only exist in the cDNA.
  • primer length between 9 and 30 bp with an optimum at 20 bp
  • melting temperature (Tm) of the primers between 58 and 60°C with an optimum at 59°C
  • maximum Tm difference between the primers of a pair: 2°C
  • GC content of the primers between 30 and 80% with an optimum at 50%
  • the 5 nucleotides at the 3' end of the primers should have no more than 2 G or C bases
  • avoid runs of 4 or more identical nucleotides (especially Gs)
  • primers must specifically target the region you want to amplify


There are many programs for designing primers, the most important ones:

The major downside of Primer3 and Primer3Plus is the fact that you have to check the specificity of the primers yourself. Primer3 will suggest a number of primer pairs that fulfill all of the above requirements, but Primer3 will not check the specificity of the primers. So you have use BLAST to check the specificity of the suggested primer pairs. Very often, the selected primers are not specific and you have to repeat the entire Primer3 analysis.

If you use Primer3 and do the BLAST yourself, BLAST against Refseq sequences unless they are not available for the organism you work with or you have reasons to believe that they are not complete (i.e. they do not represent the full genome). For model organisms, you can BLAST against the Refseq database. Limit the database to sequences from the organism you work with.

Additionally, it is especially important to check that the primers are specific at the 3' end because that's the site where the polymerase will attach nucleotides. So it is recommended to not use primers that contain long identical stretches (> 15nt for primers of 20nt long) to other regions in the genome, and certainly not if these stretches comprise the last nucleotide at the 3' end of the primer.

For these exercises we will use PrimerBLAST since it uses the same algorithm to pick primers as Primer3 [4] and does the specificity check for you!

Designing qPCR primers for the fruit fly tap gene

Designing qPCR primers using PrimerBLAST

The RefSeq entry NM_079400 contains the sequence of the D. melanogaster mRNA coding for tap, the target of Poxn. Tap encodes a bHLH protein expressed in larval chemosensory organs and involved in the response to sugar and salt. We wish to amplify the region encoding the Helix-loop-helix domain. In the sequence of the RefSeq record, the domain is located between position +577 and +745.

We want to design qPCR primers for measuring the expression level of the hlh domain using SYBR green. Remember that it is advised to design intron/exon-exon junction spanning primers for qPCR experiments that are based on fluorescent labels to detect/avoid amplification of contaminating genomic DNA.

Next, we will design primers to measure the expression of the hlh domain.

Since you want to measure the expression of the hlh domain you want primers that are located inside the domain.

The PrimerBLAST automatically decides to check primer specificity in the Drosophila (organism ID: 7227) RefSeq mRNA database which is exactly what you want. For the qPCR you are going to use RNA samples from fruitfly. This means that the primers will only come into contact with Drosophila mRNAs so you only have to check their specifity in this database.

The PrimerBLAST gives you a set of 9 primer pairs that are specific (according to the criteria that you have specified) and that fulfill all other requirements that you have defined. Look at the detailed report of the first primer pair:


PrimerBLAST11.png

All parameters are quite self-explanatory except for the Self complementary and Self 3'complementarity scores.

  • The first score represents the local alignment score when aligning a primer to itself. The scoring system gives 1.00 for a match, -1.00 for a mismatch. This means that the lower the score (the more mismatches), the less likely that the primer binds to itself.
  • The second score represents the global alignment score when aligning a primer to itself. Here again, the lower the score, the better.
The scores are followed by information on the specificity of the primer: alignments of the two primers to all target sequences from the database that match the criteria that you specified. In these alignments dots represent matching nucleotides while letters represent mismatches. A specific primer pair will have two alignments (one for each primer): both perfect alignments (all dots) to the sequence you want to amplify.

Analyzing primer characteristics using OligoAnalyzer

OligoAnalyzer is a tool implemented by ID&T (who sell primers) to check the characteristics of your primers. Take the first primer that is suggested by Primer-BLAST, the pair resulting in a product of 100bp.

As you can see the predicted melting temperature is 63.9 ºC, which is slightly different from the prediction made by BLAST. There are many different methods to predict Tm and each method will give a different result. Assumed concentrations of primers and ions have an enormous impact on the Tm prediction. So don't worry about these differences: these are theoretical calculations anyway, the only way to determine Tm values is by doing actual PCR. As long as the difference in Tm between the two primers is not too large, everything is fine.

Remember that the second primer had a large Self complementarity score according to PrimerBLAST.

You see that the highest scoring alignment indeed has 6 matches, giving a score of 6 as predicted by PrimerBLAST.


OligoAnalyzer5.png

It is advised to review all possible interactions between primers so both Self-Dimer (primers binding to themselves) and Hetero-Dimer (primers binding to each other) interactions between primers are examined.

Take a look at the second primer pair that was suggested by PrimerBLAST.


Designing qPCR primers for the human F9 gene

Designing qPCR primers using PrimerBLAST

The RefSeq entry NM_000133.3 contains the sequence of the human mRNA coding for coagulation factor F9. The gene contains 8 coding exons and gives rise to a transcript of 2780 bp encoding a protein of 461 amino acids.

Next, we want to design primers to measure the expression of the F9 gene.

Go to the RefSeq record of this transcript to study its structure. When you scroll down to the features section you see that the CDS is located from position 40 to position 1415.

Since RNA degradation starts at the 5'end of transcripts, we don't want to pick primers at the 5'end. On the other hand, we don't want to pick primers in the long 3'UTR either because it doesn't contain any introns (the exons are all coding) and we want to design exon-exon junction or intron spanning primers.

Let's try to find exon-exon junction spanning primers between position 400 and 1600, with optimal anneal temperature = 60.

Exclude predicted sequences in the database to search in .

The PrimerBLAST gives you a set of 10 primer pairs. Look at the detailed report of the first primer pair:


qPCRprimer3c.png

As you can see the primers are not specific: they can bind to various other targets albeit with lower affinity because of the mismatches .
The best option seems to be primer pair 7, which binds to both F9 transcript variants and potentially to one unintended target, but as you can see the last nucleotide at the 3' end of both primers are specific.

In silico PCR in the UCSC Browser

We will proceed using the third primer pair Primer-BLAST suggests.

You can visualize the PCR product (and additional annotation) in the UCSC Genome Browser using UCSC's In Silico PCR tool.

Select the most recent version of the human genome and paste the sequences of forward and reverse primers in their respective boxes. Click submit

Normally, this returns the location and the sequence of the PCR product but our primer pair doesn't return a match. When you think about this was to be expected since we are working with exon-exon junction spanning primers that are not able to match the genome sequence. So checking SNPs is not so straight-forward in the case of exon-exon junction spanning primers.

We will repeat the primer search now searching for intron-spanning primers to show you how to use the in silico PCR tool. Taking into account the fact that the results for the exon-exon junction spanning primers were so messy we will make the search more stringent this time:

  • We will the minimum number of mismatches to 4
  • and at least 3 mismatches in the last 3 bps at the 3'end

Primer-BLAST returns 10 primer pairs. Again the seventh primer pair is the specific one.





References:
  1. http://frodo.wi.mit.edu/
  2. http://primer3plus.com/cgi-bin/dev/primer3plus.cgi
  3. http://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome
  4. http://www.ncbi.nlm.nih.gov/tools/primer-blast/primerinfo.html