Primer Design

From BITS wiki
Jump to: navigation, search

Selecting primers for PCR

*Designing regular PCR primers using Primer3

The RefSeq entry NM_079400 contains the sequence of the D. melanogaster mRNA coding for tap, the target of Poxn. Tap encodes a bHLH protein expressed in larval chemosensory organs and involved in the response to sugar and salt. We wish to amplify by PCR the region coding for the HLH motif.

This location applies to the protein sequence so it's expressed in amino acids.

The Primer3 program can help to select suitable primers.

Go to Primer3 [1] or use the nicer interface at Primer3Plus [2] (in the solution below Primer3Plus was used).

Handicon.png Note the many configurable parameters of Primer3!

Note the Product Size Ranges box: given these settings Primer3 will first try to find primer pairs that amplify a product with length between 500 and 600 and if this is not possible it will go for a larger product etc...
This is, however, not what we want: we want to amplify the hlh region only so we aim for a much smaller product.

Click Pick Primers


Primer3e.png

In the results page you can see the following:
The program has selected 5 primer pairs, with the best at the top.
For each primer you can find:

  • the location on the sequence where the first base of the primer binds (for the forward/left primer this is on the complementary strand)
  • its length
  • its melting temperature
  • its %GC
  • the score of an alignment of the primer with the other primer and with itself

The program first considers all possible primers and retains those that satisfy a series of criteria, then considers all possible pairs of these primers and retains those who satisfy a series of criteria. At the bottom of the page you can see information about how many primers were considered and rejected/retained.

This leads to a form for further evaluating the primers.

The primers are sent to a blastn form allowing you to iniate a BLAST search to find potential targets of the primers. By default the nr database is selected. Of course you are only interested in targets in the organism you are working with. Hits in other organisms will not influence your PCR.

Look at the 9 highest scoring hits for the forward primer.

So it's always a better idea to Blast against Refseq sequences unless they are not available for the organism you work with or you have reasons to believe that they are not complete (i.e. they do not represent the full genome). However, for model organisms, full chromosome sequences are available so you can use Refseq for BLAST.

To go to the results of the reverse primer change Results for at the top of the page


BLAST34.png

When evaluating the specificity of the primers, it is especially important to check that the primers are specific at the 3' end because that's the site where the polymerase will attach nucleotides.
So generally, it is recommended to not use primers that contain long identical stretches (>15 nt for primers of 20 nt long) to other regions in the genome apart from the one you want to amplify/measure, and certainly not if these stretches comprise the last nucleotide at the 3' end of the primer.

*Designing regular PCR primers using PrimerBLAST

It's a lot of work to design primers using Primer3/CLC Main Workbench especially if you discover during the BLAST that the suggested primers are not specific and you have to start all over again.

Therefore, we will design primers using the PrimerBLAST tool [3]. PrimerBLAST uses the Primer3 algorithm to find primers allowing you to set all the parameters related to effciency of the primers (e.g. secondary structure and dimer formation, Tm, difference in Tm...). On top of this, PrimerBLAST uses the BLAST algorithm to find specific primers, that will only bind to the intended target. It is the only tool that incorporates BLAST in the primer search.

For some applications, it is vital to have specific primers. This is for instance the case when you use PCR for quantification (qPCR) or when you want to amplify a commonly occurring sequence from the genome.

As an example, we will design primers to amplify a region of RefSeq entry NM_079400, which represents the D. melanogaster mRNA coding for tap. Tap encodes a bHLH protein expressed in larval chemosensory organs and involved in the response to sugar and salt. We wish to amplify the region encoding the Helix-loop-helix domain. In the sequence of the RefSeq record, the domain is located between position +577 and +745.

So, let's go to the PrimerBLAST page.

Remember that PrimerBLAST also allows you to set the parameters related to specificity of the primers.

Only primers that fulfill the specificity criteria that you have set will be shown.

Expand the Advanced parameters section:

PrimerBLAST4b.png

This shows a set of parameters concerning primer specificity. I normally keep these at their default settings. If you want a more stringent search you can decrease the Blast word size. This is the minimum number of consecutive nucleotides that have to exactly match between the primer and a sequence from the database to consider it a hit. The lower you set it, the faster BLAST will consider a sequence from the database a hit.

After that follow the Primer parameters. At the top you have a set of general parameters that are self-explanatory.


PrimerBLAST4c.png

Then parameters concerning the likelihood that the primers will form hairpins and dimers. By default Primer-BLAST uses secondary structure alignment to predict these characteristics but you can choose more advanced and accurate thermodynamic models. Calculations will be slow when you choose these models. Basically, instead of looking at individual nucleotide matches you use the stability of nucleotide matches, e.g. the stability of CT hybridized to GA is different from that of CA hybridized to GT.

The scores for self and pair complementarity in the default secondary structure model are local alignment scores: the scoring system uses 1.00 for complementary bases, -1.00 for a mismatch, and -2.00 for a gap. So a score of 8 corresponds to 8 consecutive matches or 9 aligned nucleotides with 8 matches and 1 mismatch...


PrimerBLAST4d.png

The tool will first ask you to indicate the intended target. Thereafter it will return 5 specific high quality primer pairs:


PrimerBLAST3b.png

Knowing that PrimerBLAST uses the same algorithm to pick primers as Primer3 [4], PrimerBLAST will save you a lot of work and time when you have to design primers.

Designing primers for amplifying microbial genomes for sequencing using Primer Blast

In Ugene get 20 complete norovirus genomes from NCBI
Tip:

  • try searching the “nucleotide” database, using norovirus in the Organism field AND “complete genome” in All Fields
  • select all 20 results and Download in fasta format
  • select all sequences (the shift key is your best friend), right click and export as norovirus.fa

In Ugene make a MSA of these sequences (they're long so ClustalO might be a good choice) and save as norovirus.aln
select the consensus, right click and Copy consensus with gaps

Go to Primer-Blast and paste the consensus in the PCR template box. We want to see 10 primer combinations that generate a product of 1000-2000 bp.

Search in the Genomes (chromosomes from all organisms) database and make sure the primers do not amplify human DNA


Designing primers for qPCR

Exercises on qPCR primer design can be found on our primer design wiki page.

Analyzing primer characteristics using OligoAnalyzer

OligoAnalyzer is a tool implemented by ID&T (who sell primers) to check the characteristics of your primers.

Predicted melting temperature might be slightly different depending on the tool used. There are many methods to predict Tm and each method will give a different result. Concentrations of primers and ions have an enormous impact on the Tm prediction. Don't worry about the differences: the only way to determine Tm values is by doing actual PCR. Just make sure the difference in Tm between the two primers is small.

Complementarity will only cause problems when located at the 3' end of the primer. Since polymerases add bases at the 3’ end of the primer, the primer dimer can only be extended when it occurs at the 3'end of the primer giving rise to aspecific products. ID&T recommends to avoid complementary stretches of more than 2 bp at the 3’ end.

However, even if the primer dimer cannot be extended, it could interfere when its formation competes with the annealing of primer and target. This is only the case when the stability of the dimer is similar to the stability of a perfectly matched primer-target duplex. The stability of the perfectly matched duplex is shown as a Maximum Delta G at the top of the results. So non-extendable dimer structures that are much shorter than the intended duplex are not going to disrupt the PCR reaction.

It is advised to review all possible interactions between primers so both Self-Dimer (primers binding to themselves) and Hetero-Dimer (primers binding to each other) interactions between primers are examined.

Again, it's only problematic when you have a stretch of 3 matching nucleotides at the 3'end of one of the primers.

If you have complementarity at the 3'end of your primers you might consider taking a look at the other pairs of primers that Primer3 or PrimerBLAST suggests.


Obtaining the reverse complement of a sequence

Use this tool to obtain the reverse complement of a sequence.

In silico PCR in the UCSC Browser

You can visualize a PCR product (and additional annotation) in the UCSC Genome Browser using the In Silico PCR tool.

Select the most recent version of the human genome and paste the sequences of forward and reverse primers in their respective boxes. Click submit

This returns the location and the sequence of the PCR product:

qPCRprimer7b.png

Clicking the link of the location visualizes the PCR product in the UCSC genome browser. If necessary you can emove unnecessary tracks by right clicking the tack's name and selecting hide.


qPCRprimer8b.png

if necessary, add tracks showing relevant annotations like positions of SNPs by scrolling down to the Variation section. Select the tracks you want to visualize and click refresh.


qPCRprimer9.png

Setting the SNPs track from hide to full or dense shows the SNPs in the browser. Zoom in on the primer in the same way as in the Ensembl Genome Browser.


qPCRprimer11.png

You will often see that a primer matches SNPs but as long as they are not located near the 3'end of the primer it does not pose a problem.



References:
  1. http://bioinfo.ut.ee/primer3/
  2. http://primer3plus.com/cgi-bin/dev/primer3plus.cgi
  3. http://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome
  4. http://www.ncbi.nlm.nih.gov/tools/primer-blast/primerinfo.html