|Year : 2011 | Volume
| Issue : 1 | Page : 19-29
Mining of simple sequence repeats in the Genome of Gentianaceae
R Sathishkumar1, P T.V Lakshmi2, A Annamalai3, V Arunachalam4
1 Phytomatics Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore, Tamil Nadu, India
2 Phytomatics Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore, Tamil Nadu;Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, India
3 Plant Cell and Molecular Biology Laboratory, Department of Biotechnology, Karunya University, Coimbatore, Tamil Nadu, India
4 Molecular Biology and Bioinformatics Laboratory, Central Plantation Crops Research Institute, Kasaragod, Kerala, India
|Date of Submission||29-Jul-2010|
|Date of Decision||25-Sep-2010|
|Date of Web Publication||7-Apr-2011|
P T.V Lakshmi
Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry - 605 014
Source of Support: None, Conflict of Interest: None
| Abstract|| |
Simple sequence repeats (SSRs) or short tandem repeats are short repeat motifs that show high level of length polymorphism due to insertion or deletion mutations of one or more repeat types. Here, we present the detection and abundance of microsatellites or SSRs in nucleotide sequences of Gentianaceae family. A total of 545 SSRs were mined in 4698 nucleotide sequences downloaded from the National Center for Biotechnology Information (NCBI). Among the SSR sequences, the frequency of repeat type was about 429 -mono repeats, 99 -di repeats, 15 -tri repeats, and 2 --hexa repeats. Mononucleotide repeats were found to be abundant repeat types, about 78%, followed by dinucleotide repeats (18.16%) among the SSR sequences. An attempt was made to design primer pairs for 545 identified SSRs but these were found only for 169 sequences.
Keywords: Gentianaceae , nucleotide, simple sequence repeats
|How to cite this article:|
Sathishkumar R, Lakshmi P T, Annamalai A, Arunachalam V. Mining of simple sequence repeats in the Genome of Gentianaceae. Phcog Res 2011;3:19-29
|How to cite this URL:|
Sathishkumar R, Lakshmi P T, Annamalai A, Arunachalam V. Mining of simple sequence repeats in the Genome of Gentianaceae. Phcog Res [serial online] 2011 [cited 2020 Jan 17];3:19-29. Available from: http://www.phcogres.com/text.asp?2011/3/1/19/79111
| Introduction|| |
Gentianaceae , or the Gentian family, is a family of flowering plants of 87 genera and over 1650 species.  Plants are usually rhizomatous. These are annuals or perennials, mostly upright though a few species lie on the ground and have upright branch tips. Leaves are opposite or whorled with entire edges and bases connately attached to the stem, mostly without a petiole. Flowers have four to five sepals, petals, and stamens, but only one pistil. Sepals and petals are fused at the base, with four to five free lobes above. Stamens alternate with the corolla lobes. Ovary is superior; fruit is a capsule. Stipules is absent. Plants usually accumulate bitter iridoid substances; bicollateral bundles are present. The fruits are dehiscent septicidal capsules splitting into two halves. The Gentianaceae contains many species with interesting phytochemical properties. They have been widely used in traditional medicine and also as constituents in bitters and similar concoctions. The family consists of trees, shrubs, and herbs showing a wide range of colors and floral patterns.
Simple sequence repeats (SSRs),  or microsatellites  or short tandem repeats,  are short (1-6 bp) repeat motifs that show a high level of length polymorphism due to insertion or deletion mutations of one or more repeat types.  Studies suggest that both protein coding and noncoding regions of DNA sequences contain SSRs.  SSRs present in coding sequences are less polymorphic than those in the genomic sequences. Moreover, different taxon varies in abundance of different types of SSRs and these are present in greater abundance in noncoding regions than coding SSRs.  The SSRs are either developed conventionally  or from sequence databases.  PCR-based techniques such as AFLP and microsatellites or SSRs have also played important roles in plant DNA profiling. Primers are essential components of PCR-based systems as well as modern microarray systems which utilize appropriate probes for PCR amplification. 
In genetics, a sequence motif is a nucleotide or amino acid sequence pattern that is widespread and is believed to have, a biological significance. When a sequence motif appears in the exon of a gene, it may encode the "structural motif" of a protein, that is, a stereotypical element of the overall structure of the protein. "Noncoding" sequences are not translated into proteins. Outside of gene exons, there exist regulatory sequence motifs and motifs within the "junk," such as satellite DNA.  Robinson et al. developed a computer program to identify and design PCR primers for amplification of SSR loci based on available DNA sequence information. SSR primers have been designed using publicly available expressed sequence tags (ESTs) in barley,  almond (Prunus communis Fritsch.), and peach (P. persica (L.) Batsch.),  T. aestivum, and O. Sativa.  These SSRs are useful as molecular markers because their development is inexpensive, they represent transcribed genes, and their putative function can often be deduced by a homology search.  SSRs have been the backbone to creating molecular maps for a number of years.
The increasing number of genomic and expressed sequences in public databases provides a valuable source for bioinformatical data mining. However, there are a number of exciting application of these sequence data; used in comparative genome analysis - to trace the evolution among the related species, to study the genome structure and their gene functions. Comparative genome analysis requires the same sets of genes (i.e., cross-reference genes) to be mapped to chromosomes in the species compared. Thus, comparative maps with sets of EST-derived markers (i.e., cross-species markers) are essential for comparative genome analysis. Several studies have utilized publicly available ESTs to mine SSRs or microsatellites markers for plants, ,,, catfish,  insects,  animals,  and human.  The EST-derived SSR markers (EST-SSRs) have proved very useful for the construction of genetic and comparative maps.  The software used here is MISA, a microsatellite identifying tool which has the advantage of detecting the mono- to decamer repeats and also compound repeats. But it has the disadvantage of inability to detect above decanucleotide repeats. Riju and Arunachalam,  mined the SSRs in oil palm ESTs with five different software and have reported that MISA program has given maximum coverage of SSRs in both oil palm ESTs and Contigs.
PCR primer design in general
Understanding of primer properties is very important for primer design. The major aspects of primer properties include specificity, melting temperature (Tm ), and intraprimer or interprimer homology. Primer specificity is mostly determined by the 3'-end sequences. It was reported that single internal mismatches had no significant effect on PCR product yield while the 3'-terminal mismatches, especially the A:A, A:G, G:A, and C:C mismatches, markedly reduced overall PCR product yield.  Khabar et al. assessed the annealing specificity of primers in PCR reactions under different annealing temperatures (35°C, 40°C, and 45°C) and found perfect matches between at least eight bases at the 3'-end of the 5'-primers and the target region, whereas mispriming occurred only toward the 5'-end. Therefore it is critical to include 8-10 unique bases at the 3'-end of the primer.
Ideally the primer has a Tm in the range of 50-65°C, random nucleotide composition with a 40-60% GC-content, and 18-30 bases long. The intraprimer or interprimer homology is kept as low as possible to avoid formation of hairpin structures or primer dimmers (>3 bp complementarities between primers) which otherwise will interfere with annealing of primer to the DNA template. 
ESTs, which represent the expressed part of genome, also serve as a source of SSRs.  Detection of SSRs facilitates the development of SSR markers that are useful in the study of genetic variation, gene tagging, and linkage mapping,  and are also useful across a number of related species.  Microsatellites can be amplified for identification by the polymerase chain reaction (PCR) process, using the unique sequences of flanking regions as primers. Once the potentially useful microsatellites are determined (removing nonuseful ones such as those with random inserts within the repeat region), the flanking sequences can be used to design oligonucleotide primers which will amplify the specific microsatellite repeat in a PCR. Microsatellite loci are widely distributed throughout the genome and can be isolated from semidegraded DNA of older specimens, as all that is needed is a suitable substrate for amplification through PCR. Hence, the present study was to find out the distribution and abundance of SSRs for the development of markers and to annotate SSR-containing sequence in Gentianaceae family. Nucleotide database, which contains sequences of well-characterized genes as well as hundreds of thousands novel EST sequences, was retrieved to perform the analysis.
| Materials and Methods|| |
Retrieval of nucleotide sequences and detection of SSRs
A total of 647 nucleotide sequences of Gentianaceae were downloaded from the NCBI (http://www.ncbi.nlm.nih.gov/Nucleotide/?term=Gentianaceae) and harvested for SSRs using a perl script. The minimum length of SSR was ﬁxed at 14 bp according to the criteria used by Gupta et al..  The SSRs were deﬁned as 14-bp mononucleotide or dinucleotide repeats; 15-bp trinucleotide repeats; 16 tetranucleotide repeats; 20 pentanucleotide repeats; 18 hexanucleotide repeats. The poly A and poly T repeats were removed by using an inhouse developed perl script, as these are not considered as SSRs due to their presence at 3'-end of mRNA/cDNA sequences.
Primer designing for SSRs
A pair of primer ﬂanking each SSR was designed using FastPCR software available at www-genome.wi.mit.edu/cgi-bin/primer/primer3_www.cgi, which takes input according to user-deﬁned conditions and pick primers according to these speciﬁed parameters. Default parameters of the FastPCR, viz, the optimum primer size of 20.0 (the range was 18-28), the optimum annealing temperature of 60.0 (the range was 57.0-63.0), and the range of% GC content of 44-60, were selected for primer designing.
Detection of SSR positions with respect to open reading frames
Open reading frames (ORFs) are predicted for all the SSR-containing sequences using ORF ﬁnder available at NCBI (http://www.ncbi.nl m.nih.gov/gorf/gorf.html) using standard genetic code. Sequence fragments with maximum length uninterrupted by stop codon were taken as the primary encoding segment (ORF) of the query sequences. In all the predicted ORFs, the relative positions of SSRs were detected, that is, whether the SSR was present within the ORF, in the 5' UTR untranslated region (UTR) or in the 3' UTR
| Results|| |
Screening of Gentianaceae sequences for SSRs
In the present study, 4698 nucleotide sequences of Gentianaceae available at NCBI (http://www.ncbi.nlm.nih.gov) were searched for SSRs with a minimum length of 18 bp. A total of 545 SSRs were detected from 2889 kb of data mined, excluding poly A and poly T. Depending upon the length of the repeat unit itself (1-6 bp), the lengths of the identiﬁed SSRs varied from 14 to 48 bp, respectively.
Frequencies of classified repeat types of Gentianaceae
From a number of 4698 sequences screened, only a subset of 461 sequences contained 545 SSRs, suggesting that merely 9.83% of sequences contained SSRs. The frequencies of SSRs with mono-, di-, tri-, tetra-, and hexanucleotide repeat units showed the frequent repeat type within the nucleotide sequences of Gentiana family that were found to be in mononucleotide (84.58%) followed by dinucleotide repeats (18.16%), trinucleotide (2.75%), and hexanucleotide (0.65%), respectively [Figure 1]. Whereas, no tetranucleotide and pentanucleotide repeat was detected during the analysis.
|Figure 1: Frequency distribution of different repeat types identifi ed in nucleotide sequences of Gentianaceae|
Click here to view
The observed frequency of different repeat types comprising the SSRs is presented in [[Figure 2]a-d] and summarized in [Table 1]. SSRs were comprised of four different types of mononucleotide (A,T, C, and G), nine different types of dinucleotide (CA)n, (TG)n, (AC)n, (GA)n, (CT)n, (TA)n, (AT)n, (GC)n, (TC)n, (AG)n, (GT)n repeats, seven different types of trinucleotide (GAG)n, (ATG)n, (CTT)n, (TTA)n, (CAA)n, (AAC)n, (ACA)n repeats, and two types of hexanucleotide (CCACAC)n, (GGTCAA)n repeats.
|Table 1: Summary of in silico mining of Nucleotide sequences of Gentianaceae|
Click here to view
|Figure 2: Frequency distribution of (a) mono-, (b) di-, (c) tri-, and (d) hexanucleotide repeat motifs in the genome of Gentianaceae|
Click here to view
Designing of primers for SSRs
Out of 545 SSRs detected, the primers could be designed only for 169 (31%) SSRs and the rest 376 (69%) sequences did not produce any acceptable primers. These 169 SSRs for which primers were designed include 133 mono-, 29 di-, 7 tri-, and no hexanucleotide repeats. The details of the accession numbers of nucleotide sequences of Gentiana, repeat motif of SSRs for which primer were designed, primer sequences, GC%, product size, and annealing temperature are given in [Table 2].
Prediction of ORF in SSR-containing sequences
An attempt was made to predict the ORFs in SSR-containing sequences using ORF finder. Out of the 545 SSRs identified, the positions of 359 SSRs with respect to ORF were determined, while for the remaining 186 SSR-containing sequences, no ORF were predicted. Of these 359 SSRs, a large number of 161 (44.84%) were present in the 5' untranslated region, 129 (35.93%) SSRs occurred within ORF, and the remaining 69 (19.22%) occurred in the 3' untranslated region.
| Discussion|| |
In the present study, a large number of nucleotide sequences (4698) of Gentiana retrieved from NCBI were mined for SSRs. In the sequences that were mined the SSRs were characterized, and a subset of these SSRs was used for designing the markers. A total of 545 SSRs was detected and this was in accordance to the findings of  who reported that the abundance of different repeats varied broadly depending upon the species.
Microsatellites or SSRs are stretches of DNA containing tandem repeats of di-, tri-, tetra-, and above nucleotide units ubiquitously distributed throughout the eukaryotic genome. They are found to be abundant in plant genomes and are thought to be the major sources of genetic variation in quantitative traits. The abundance of the different repeat motifs (1-6 bp) in the SSRs as detected in Gentiana family during the present study was variable so that the SSRs with different repeat motifs were not evenly distributed. The SSRs with dinucleotide repeats (18.16%) were abundant. This is in agreement with the results of earlier studies on Arabidopsis in which the dinucleotide repeats were also found to be abundant,  perhaps because the genomic sequences of this species may include SSRs in noncoding regions too. The smaller repeat motifs were found to be predominant among SSRs identified and as the length of repeat unit increases, their occurrence decreases. We excluded poly A and poly T repeats due to which their number is under-represented. The abundance of trinucleotide SSRs may be attributed to the absence of frame shift mutations due to variation in trinucleotide repeats. 
Molecular genetic markers can be used to examine a group of individuals or populations to estimate various diversity measures and genetic distances, intergenetic structure and clustering patterns, test for Hardy-Weinberg equilibrium and multilocus equilibrium, and to test polymorphic loci for the evidence of selective neutrality. This can be useful to plant breeders, germplasm managers, or others who are interested in population genetic properties of materials that they are working with. The three most common types of markers used today are RFLP, RAPD, and microsatellites. A wide variety of methods for the construction of libraries enriched for microsatelite sequences have been reported, the most popular among those being the ones based on vectorette PCR using anchored primers. But this method is highly time-consuming and expensive, and the alternative is to use bioinformatics, that is, computational tools to screen the public database and find SSR. EST-derived molecular markers, especially SSR and SNP, are highly useful in developing linkage maps and markers assisted breeding programs. These markers are also transferable to related genera.
Molecular marker techniques are advantageous as they directly reflect variations in the DNA sequences and therefore of independence of environment. Among many molecular marker techniques currently available, microsatellites and SSRs  provide an improved technology in assessing genetic diversity and genetic relationships in plants as they are highly polymorphic, codominants, very informative, and PCR based. EST-SSRs offer the following advantages over other genome DNA-based markers: (1) they should detect variation in the expressed portion of the genome so that gene tagging should give "perfect" marker-trait associations; (2) they can be developed at no cost from the EST databases; and (3) once developed, these markers, unlike genomic SSRs, may be used across a number of related species. With the growth of sequence databases, several authors have reported an abundance of SSRs in different genomes. The Distribution of SSRs in the rice genome has also been studied on the basis of the two whole genome draft sequences released, respectively, by Syngenta and by the Beijing Genome Institute (BGI). In the draft sequence released by Syngenta, for instance, 48,351 SSRs (including di-, tri-, and tetranucleotide repeats) were available, giving a density of 8 kb per SSR in the whole genome; SSRs represented by di-, tri-, and tetranucleotide repeats accounted respectively for 24%, 59%, and 17% of the total SSRs.
SSRs are very polymorphic due to the high mutation rate affecting the number of repeat units. Such length-polymorphisms can be easily detected on high-resolution gels (e.g., sequencing gels), by running PCR-amplified fragments obtained using a unique pair of primers flanking the repeat.  Chung and Staub  developed a set of consensus chloroplast primer pairs for ccSSRs from N. tabacum chloroplast sequences. All primer pairs produced amplicons after PCR employing chloroplast DNA from members of the Cucurbitaceae (six species) and Solanaceae (four species). Sixteen, 22, and 19 of the initial 23 primer pairs were successively amplified by PCR using template DNA from species of the Apiaceae (two species), Brassicaceae (one species), and Fabaceae (two species), respectively. Twenty of the 23 primer pairs were also functional in three monocot species of the Liliaceae (onion and garlic), and the Poaceae (oat). ccSSR primers were strategically "recombined" and referred to correctly as recombined consensus chloroplast primers (RCCP) for PCR analysis of cucumber DNA such that the primers designed for the SSR-containing genus of Gentiana family would be utilized for the production of amplicons from different members of family.
Kijas et al. tested two primer sets in 10 different Citrus species and two related genera and found conservation of the sequences. Cross-species amplification has also been reported between cultivated rice and related wild species  and between Vitis species.  Provan et al. could show successful amplification of two tomato SSR primer pairs tested on potato cultivars. Weising et al,.  reported conservation of SSR flanking sites in different species of kiwifruit (Actinidia chinensis). Usually, a low percentage of markers also amplified fragments from species belonging to other genera from the same family. Within the Poaceae family, primers worked even across different genera,  but only 50% of microsatellite loci identified in wheat were also polymorphic in rye and barley cultivars. Whitton et al. tested 13 SSR loci in 25 representatives of the Asteraceae, where it was demonstrated that the regions flanking in the repeats are not highly conserved, neither in the nucleotide sequence nor in the relative position.
Indeed, in general, transferability of polymorphic markers in plants is likely to be successful mainly within genera (success rate close to 60% in eudicots and close to 40% in the reviewed monocots) rather than between genera (transfer rates are approximately 10% for eudicots) within the same family.  This transferability of polymorphic markers nature in plant generally enhances the utilization of the primers in random way. Comparative genome analysis facilitates high-throughput comparative mapping with the assistance of cross-species markers, and further facilitates gene cloning by identifying cross-reference genes. Seventeen SSR primer sets developed for Quercus petraea were tested on eight different members of the Fagaceae family.  In total 66% resulted in interpretable amplification products and most of them were really homologous to the originally cloned SSR fragment from Q. petraea. The primers could be designed successfully for a very large number (169, 31%) of SSRs [Table 2]. However, it was not possible to design the primers for remaining SSRs (376, 69%) because the sequence flanking at both ends of the SSRs was inadequate in size to design the primers. The large number of primer pairs for the SSRs that have been designed during the present study may be utilized for a variety of purposes, for example, gene tagging, genetic mapping, population studies, etc. Due to a high level of potential for length polymorphisms, SSRs have become a valuable source of genetic markers and have been broadly applied to various areas of genetic research including studies of genome variation, establishment of genetic maps, integration of physical and genetic maps, determination of evolutionary relationships, and comparative genome analyses.
| Conclusions|| |
Nucleotide sequences of Gentiana family were systematically searched for SSRs using the ''ssr_finder.pl'' perl program for the development of SSR markers. This is a valuable approach for both costs and time, given a sufficient amount of available Gentiana family sequences. The use of SSRs in genetic diversity studies is a novel tool that reveals variation in genomes.
| References|| |
|1.||Struwe L, Kadereit JW, Klackenberg J, Nilsson S, Thiv M, von Hagen KB, et al. Systematics, character evolution, and biogeography of Gentianaceae, including a new tribal and subtribal classification. In: Struwe, L. and V. A. Albert, editors, editors. Cambridge: Gentianaceae-Systematics and Natural History Cambridge University Press; 2002. p. 21-309. |
|2.||Jacob HJ, Lindpaintner K, Lincoln SE, Kusumi K, Bunker RK, Mao YP, et al. Genetic mapping of a gene causing hypertensive rat. Cell 1991;67:213-24. |
|3.||Litt M, Luty JA. A hypervariable microsatellite revealed by in vitro amplification of a dinucleotide repeat within the cardiac muscle actin gene. Am J Hum Genet 1989;44:397-401. |
|4.||Edwards A, Civitello A, Hammond HA, Caskey CT. DNA typing and genetic mapping with trimeric and tetrameric tandem repeats. Am J Hum Genet 1991;49:746-56. |
|5.||Tautz D, Renz M. Simple sequence repeats are ubiquitous repetitive components of eukaryotic genomes. Nucleic Acids Res 1984;12:4127-38. |
|6.||Katti MV, Ranjekar PK, Gupta VS. Differential distribution of simple sequence repeats in eukaryotic genome sequences. Mol Biol Evol 2001;18:1161-7. |
|7.||Hancock JM. The contribution of slippage-like processes to genome evolution. J Mol Evol 1995;41:1038-47. |
|8.||Ahmad R, Struss D, Southwick SM. Development and characterization of microsatellite markers in citrus. J Am Soc Hortic Sci 2003;128:584-90. |
|9.||Chen C, Zhou P, Choi YA, Huang S, Gmitter FG Jr. Mining and characterizing microsatellites from citrus ESTs. Theor Appl Genet 2006;112:1248-57. |
|10.||Yang X, Scheffler BE, Weston LA. Recent developments in primer design for DNA polymorphism and mRNA profiling in higher plants. Plant Met 2006;2:4. |
|11.||Witzany G. Noncoding RNAs: Persistent viral agents as modular tools for cellular needs. Ann NY Acad Sci 2009;1178:244-67. |
|12.||Robinson AJ, Love CG, Batley J, Barker G, Edwards D. Simple sequence repeat marker loci discovery using SSR primer. Bioinformatics 2004;20:1475-6. |
|13.||Thiel T, Michalek W, Varshney RK, Graner A. Exploiting EST databases for the development and characterization of genederived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet 2003;106:411-22. |
|14.||Xu Y, Ma RC, Xie H, Liu JT, Cao MQ. Development of SSR markers for the phylogenetic analysis of almond trees from China and the Mediterranean region. Genome 2004;47:1091-104. |
|15.||Yu JK, La Rota M, Kantety RV, Sorrells ME. EST derived SSR markers for comparative mapping in wheat and rice. Mol Genet Genomics 2004;271:742-51. |
|16.||Varshney RK, Graner A, Sorrells ME. Genic microsatellite markers in plants: Features and applications. Trends Biotechnol 2005;23:48-55. |
|17.||Temnykh S, DeClerck G, Lukashova A, Lipovich L, Cartinhour S, McCouch S. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): Frequency, length variation, transposon associations, and genetic marker potential. Genome Res 2001;11:1441-52. |
|18.||Kantety RV, La Rota M, Matthews DE, Sorrells ME. Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol Biol 2002;48:501-10. |
|19.||Pinto LR, Oliveira KM, Ulian EC, Garcia AA, de Souza AP. Survey in the sugarcane expressed sequence tag database (SUCEST) for simple sequence repeats. Genome 2004;47:795-804. |
|20.||Peng JH, Lapitan NL. Characterization of EST-derived microsatellites in the wheat genome and development of eSSR markers. Funct Integr Genomics 2005;5:80-96. |
|21.||Serapion J, Kucuktas H, Feng J, Liu Z. Bioinformatic mining of type I microsatellites from expressed sequence tags of channel catfish (Ictalurus punctatus). Mar Biotechnol (NY) 2004;6:364-77. |
|22.||Prasad MD, Muthulakshmi M, Madhu M, Archak S, Mita K, Nagaraju J. Survey and analysis of microsatellites in the silkworm, bombyx mori: Frequency, distribution, mutations, marker potential and their conservation in heterologous species. Genetics 2005;169:197-214. |
|23.||Dranchak PK, Chaves LD, Rowe JA, Reed KM. Turkey microsatellite loci from an embryonic cDNA library. Poult Sci 2003;82:526-31. |
|24.||Haddad LA, Parra FC, Pena SD. Characterization and mapping of four novel human expressed polymorphic trinucleotide microsatellites. Gene 1998;223:369-74. |
|25.||Fraser LG, Harvey CF, Crowhurst RN, De Silva HN. EST-derived microsatellites from Actinidia species and their potential for mapping. Theor Appl Genet 2003;108:1010-6. |
|26.||Riju A, Arunachalam V. Data mining for simple sequence repeats in oil palm expressed sequence tags. Nature Proceedings 2009. |
|27.||Kwok S, Kellogg DE, McKinney N, Spasic D, Goda L, Levenson C, et al. Effects of primer-template mismatches on the polymerase chain reaction: Human immunodeficiency virus type 1 model studies. Nucleic Acids Res 1990;18:999-1005. |
|28.||Khabar KS, Dhalla M, Bakheet T, Sy C, al-Haj L. An integrated computational and laboratory approach for selective amplification of mRNAs containing the adenylate uridylate-rich element consensus sequence. Genome Res 2002;12:985-95. |
|29.||Sharrocks AD. The design of primers for PCR. In: Griffin HG, Griffin AM, editors. PCR technology: Current innovations. London: CRC Press; 1994. p. 5-11. |
|30.||Ramsay L, Macaulay M, degli Ivannissevich S, MacLean K, Cardle L, Fuller J, et al. Simple sequence repeat-based linkage map of barley. Genetics 2000;156:1997-2005. |
|31.||Gupta PK, Rustgi S, Sharma S, Singh R, Kumar N, Balyan HS. EST-SSRs for transferability, polymorphism and genetic diversity in bread wheat. Mol Genet Genomics 2003;270:315-23. |
|32.||Toth G, Gaspari Z, Jurka J. Microsatellites in different eukaryotic genome, survey and analysis. Genome Res 2000;10:1967-81. |
|33.||Cardle L, Ramsay L, Milborne D, Macaulay M, Marshall D, Waugh R. Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics 2000;156:847-54. |
|34.||Metzgar D, Bytof J, Wills C. Selection against frameshift mutations limits microsatellite expansion in coding DNA. Genome Res 2000;10:72-80. |
|35.||Powell W, Machray GC, Provan J. Polymorphism revealed by simple sequence repeats. Trends Plant Sci 1996;1:215-22. |
|36.||Weber, May. Abundant class of human DNA polymorphism which can be typed using the polymerase chain reaction. Am J Hum Genet 1989;44:388-96. |
|37.||Chung SM, Staub JE. The development and evaluation of consensus chloroplast primer pairs that possess highly variable sequence regions in a diverse array of plant taxa. Theor Appl Genet 2003;107:757-67. |
|38.||Kijas JM, Fowler JC, Thomas MR, Scott NS. An evaluation of sequence tagged microsatellite site markers for genetic analysis within citrus and related species. Genome 1995;38:349-55. |
|39.||Wu K, Tanksley SD. Abundance, polymorphism and genetic mapping of microsatellites in rice. Mole Gen Genet 1993;241:225-35. |
|40.||Thomas MR, Scott NS. Microsatellite repeats in grapevine reveal DNA polymorphisms when analysed as sequence-tagged sites. Theor Appl Genet 1993;86:985-90. |
|41.||Provan J, Waugh R, Powell W. Microsatellite analysis of relationships within cultivated potato (Solanum tuberosum). Theoretical and Applied Genetics 1996;92:1076-84. |
|42.||Weising K, Fung RW, Keeling DJ, Atkinson RG, Gardner RC. Characterisation of microsatellites from Actinidia chinensis. Mole Breed 1997;3:159-60. |
|43.||Röder MS, Plaschke JS, König U, Börner A, Sorrells ME. Abundance, variability and chromosomal location of microsatellites in wheat. Mole Gen Genet 1995;246:327-33. |
|44.||Whitton J, Rieseberg LH, Ungerer MC. Microsatellite loci are not conserved across Asteraceae. Mole Biol Evol 1997;14:204-9. |
|45.||Barbará T, Palma-Silva C, Paggi GM, Bered F, Fay MF, Lexer C. Cross-species transfer of nuclear microsatellite markers: Potential and limitations. Mole Ecol 2007;16:3759-67. |
|46.||Steinkellner H, Lexer C, Turetschek E, Glössl J. Conservation of (GA)n microsatellite loci between Quercus species. Mole Ecol 1997;6:1189-94. |
[Figure 1], [Figure 2]
[Table 1], [Table 2]