Evaluation of Genetic Structure of Amaranth Accessions from the United States

  • cc icon
  • ABSTRACT

    Amaranths (Amaranthus sp.), an endemic American crop, are now grown widely across the world. This study used 14 simple sequence repeat (SSR) markers to analyze the genetic diversity of 74 amaranth accessions from the United States, with eight accessions from Australia as controls. One hundred twenty-two alleles, averaging eight alleles per locus, were observed. The average major allele frequency, expected heterozygosity, and polymorphism information content (PIC) were 0.44, 0.69, and 0.65, respectively. The structure analysis based on genetic distance classified 77 accessions (94%) into three clusters, while five accessions (6%) were admixtures. Among the three clusters, Cluster 3 had the highest allele number and PIC values, while Cluster 2 had the lowest. The lowest FST was between Clusters 1 and 3, indicating that these two clusters have higher gene flow between them compared to the others. This finding was reasonable because Cluster 2 included most of the Australian accessions. These results indicated satisfactory genetic diversity among U.S. amaranths. These findings can be used to design effective breeding programs involving different plant characteristics.


  • KEYWORD

    Amaranths (Amaranthus sp.) , Genetic Diversity , Population Structure , SSR

  • Introduction

    Amaranths (Amaranthus sp.) belongs to the family of Amaranthaceae, which is originated in Americas and Europe. Dated backed to Mayan civilization of South and Central America, amaranth has been cultivated for more than 8,000 years. An estimated 87 accessions belonged to the genus Amaranthus, 40 of which are considered to be native America species, including cultivated grains, vegetable crops, and wild species (Chan and Sun, 1997; Mujica and Jacobsen, 2003). As a major country, the United States possessed a huge area at American continent, and had a diverse set of amaranth, as one of the places where amaranths originated (Wetzel et al., 1999). It will be of great significance to analyze the amaranths genetic group structure of this region.

    Analyses of genetic diversity and population structure are important, not just for amaranths, but for many crops, and such studies have direct benefits in research on evolution and plant breeding (Chung and Park, 2010). Many molecular markers have been used to analyze diversity, such as restriction fragment length polymorphisms (RFLPs), amplified fragment length polymorphisms (AFLPs), simple sequence repeats (SSRs), and single nucleotide polymorphisms (SNPs; Bao et al., 2006; Cheng et al., 2011; Feltus et al., 2004; Jin et al., 2010; Li et al., 2012; Liang et al., 1994; Nagaraju et al., 2002; Zhao et al., 2009). Different marker systems have been used to investigate genetic diversity (Tam et al., 2005), and random amplified polymorphic DNA (RAPD) markers and SSRs have been applied to study the genetic diversity and phylogenetic relationships among Amaranthus species (Khaing et al., 2013; Lee et al., 2008; Wassom and Tranel, 2005; Xu and Sun, 2001).

    Amaranths have superior nutrition, drought tolerance, disease and pest resistance, and production yield, making these native Americian crops more attractive for cultivation in developing countries and increasing their rate of consumption in recent years (Ray and Roy, 2009). Varying amounts of outcrossing and frequent interspecific and intervarietal hybridization of amaranths have resulted in a large variety of amaranth genotypes (Ray and Roy, 2009). Due to their complex genetic background, amaranths show tremendous adaptability to different ecogeographic situations (Lee et al., 2008). and have evolved many characteristics adapted to different environments, such as cold, drought, and salinity resistance.

    Understanding the genetic diversity and polymorphism of Amaranthus is important. In particular, a detailed SSR analysis of the genetic diversity and population structure of U.S. amaranth accessions would make a significant contribution, as the United States has played a major role in the development of amaranths (Ray and Roy, 2009). Therefore, we used a model-based structure analysis to elucidate the genetic diversity and structure of U.S. amaranth germplasm.

    Materials and Methods

      >  Plant materials

    Eighty two accessions belonging to 29 species were genotyped using 14 SSRs (Table 1). All plant materials including 74 accessions from the U.S. and 8 accessions from Australia were obtained from the National Genebank of the Rural Development Administration, Republic of Korea (RDA-Genebank).

      >  SSR genotyping

    Total DNA was extracted from all accessions using a DNA extraction kit (Qiagen, Seoul, Republic of Korea). Fourteen polymorphic SSR markers developed by Lee et al. (2008) were used in this study. The M13-tail polymerase chain reaction (PCR) method (Schuelke, 2000) was used to measure the size of the PCR products, as described previously(Lee et al., 2008). Using GeneScan 3.7 (Applied Biosystems, Foster City, CA, USA ), the SSR alleles were resolved on an ABI Prism 3100 DNA sequencer (Applied Biosystems) and sized precisely using GeneScan 500 ROX (6-carbon-X-rhodamine) molecular size standards (35–500 bp; Applied Biosystems).

      >  Data analysis

    The data were analyzed statistically using the PowerMarker V3.23 genetic analysis package (Liu and Muse, 2005) to measure the diversity at each microsatellite locus, including the total number of alleles (NA), allele frequency, major allele (allele with the highest frequency), accession-specific alleles, and polymorphism information content (PIC). Genetic distances between each pair of accessions were determined by calculating the shared allele frequencies using PowerMarker V3.23. The unweighted pair group method with arithmetic mean (UPGMA) tree and neighbor joining method (NJ) were constructed from the shared allele frequencies using MEGA 4.0 embedded in PowerMarker.

    Population structure and the identification of admixed individuals were determined using the Structure modelbased software program (Pritchard et al., 2000). In this model, a number of populations (K) are assumed to be present with each population characterized by a set of allele frequencies at each locus. Individuals in the sample are then assigned to populations (clusters), or jointly to more populations if their genotypes indicate that they are admixed.

    All loci are assumed to be independent, and each K population is assumed to follow Hardy–Weinberg equilibrium. The posterior probabilities were estimated using the Markov chain Monte Carlo (MCMC) method. The MCMC chains were run with a 100,000-iteration burn-in period followed by 200,000 iterations using a model allowing for admixture and correlated allele frequencies. At least three runs of Structure were performed, setting K from 1 to 10, and an average likelihood value, L(K), across all runs was calculated for each K. The model choice criterion that detected the most probable value of K was ΔK, which is an ad hoc quantity related to the second-order change of the log probability of data with respect to the number of clusters inferred by Structure (Evanno et al., 2005). An individual with more than 80% of its genome fraction value was assigned to a group.

    The value of FST was calculated using an analysis of molecular variance (AMOVA) approach in Arlequin 3.11 (Excoffier et al., 2005; Schneider and Excoffier, 1999).

    Results and Discussion

      >  SSR polmorphisms

    In total, 122 alleles were observed among the 82 amaranth accessions at 14 SSR loci, ranging from 4 (78N) to 14 (104H and 99N) alleles per accession, with an average of eight alleles per locus. The database of allele frequencies showed that rare alleles (frequency < 0.05) comprised 51.6% of all detected alleles, whereas intermediate (frequency 0.05–0.50) and abundant (frequency > 0.50) alleles comprised 44.3% and 4.1%, respectively (Table 2, Fig. 1). The average major allele frequency was 0.44, ranging from 0.16 in 99N to 0.93 in 78N, and the expected heterozygosity was 0.69, ranging from 0.11 in 78N to 0.90 in 99N. The average PIC was 0.65, which indicated that the 14 SSR markers exhibit good polymorphism across the accessions (Table 2).

      >  Genetic diversity and population structure analysis

    Previously, Pritchard et al. (2000) used a model-based method to analyze the population structure and identify admixed individuals. Unfortunately, the estimated likelihood values do not indicate the exact value of K using this model (Fig. 2). Therefore, an ad hoc quantity (ΔK) was used to overcome the difficulty interpreting real K values (Evanno et al., 2005). Using this approach, an identifiable peak indicated the true value of K based on ΔK. For the 82 accessions, the highest value of ΔK was K = 3 (Fig. 2); therefore, we used K = 3 for the final analysis. When alpha is near zero, most individuals are essentially from one population. Conversely, when alpha is greater than 1, most individuals are admixed (Evanno et al., 2005; Ostrowski et al., 2006). The relatively small value of alpha (α = 0.0345) indicated that most accessions originated from one primary ancestor (Ostrowski et al., 2006).

    The genetic diversity analysis of the 82 amaranth accessions indicated an average of 8.64 alleles in accessions from the United States and 2.64 for Australia, with an overall average of 5.64. The major allele in the Australian accessions was more frequent than in U.S. accessions, while the opposite was true for the PIC.

    Based on the structure results, most of the 82 accessions were clearly classified into three subpopulations. Clusters 1–3 included 20, 23, and 34 accessions, respectively. Only five accessions were admixtures: three from Australia and two from the United States. Of the three subpopulations, Cluster 3 has the highest allele numbers and PIC values, while Cluster 2 had the lowest. The FST was 0.4221, 0.2209, and 0.4274 between Clusters 1 and 2, Clusters 1 and 3, and Clusters 2 and 3, respectively (Table 4).

    A genetic distance-based analysis was performed by calculating the shared allele frequencies among the 82 accessions. An unrooted phylogram was computed using MEGA 4 (Tamura et al., 2007) embedded in the PowerMarker program (Liu and Muse, 2005). The NJ tree clustered all accessions into three main groups with a few exceptions. As shown in Fig. 4, 82 amaranth accessions were distributed among the three groups which were consistent with the results of structure. Admixtures were marked with black color. Most of the accessions from the same species were clustered into the same group.

    Generally, a narrow genetic base and low genetic diversity are detrimental to a breeding program (Wolfe, 1985). Although only 74 accessions from the United States were evaluated in this study, 122 alleles were detected and the PIC was high. Therefore, we concluded that the United States, which is near the center of origin of Amaranthus, exhibits rich genetic polymorphism and this finding will be used to design effective breeding programs involving different plant characteristics aimed to meet societal demands.

  • 1. Bao J.S., Corke H., Sun M. 2006 Analysis of genetic diversity and relationship in waxy rice (Oryza sativa L.) using AFLP and ISSR marker [Genet. Resour. Crop Ev.] Vol.53 P.323-330 google doi
  • 2. Chan K.F., Sun M. 1997 Genetic diversity and relationships detected by isozyme and RAPD analysis of crop and wild species of Amaranthus [Theor. Appl. Genet.] Vol.95 P.865-873 google doi
  • 3. Cheng Y., Kim C.H., Shin D.I., Kim S.M. 2011 Development of simple sequence repeat (SSR) markers to study diversity in the herbaceous peony (Paeonia lactiflora) [J. Med. Plants Res.] Vol.5 P.6744-6751 google
  • 4. Chung J.W., Park Y.J. 2010 Population structure analysis reveals the maintenance of isolated sub-populations of weedy rice [Weed Res.] Vol.50 P.606-620 google doi
  • 5. Costea M., Weaver S.E., Tardif F.J. 2004 The biology of Canadian weeds. 130. Amaranthus retroflexus L., A. powellii S. Watson and A. hybridus L [Can. J. Plant Sci.] Vol.84 P.631-668 google doi
  • 6. Evanno G., Regnaut S., Goudet J. 2005 Detecting the number of clusters of individuals using the software structure: a simulation study [Mol. Ecol.] Vol.14 P.2611-2620 google doi
  • 7. Excoffier L., Laval G., Schneider S. 2005 Arlequin (version 3.0): an integrated software package for population genetics data analysis [Evol. Bioinform.] Vol.1 P.47 google
  • 8. Feltus F.A., Wan J., Schulze S.R., Estill J.C. 2004 An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments [Genome Res.] Vol.14 P.812-1819 google doi
  • 9. Jin L., Lu Y., Xiao P., Sun M., Corke H. 2010 Genetic diversity and population structure of a diverse set of rice germplasm for association mapping [Theor Appl Genet.] Vol.121 P.475-487 google doi
  • 10. Khaing A.A., Moe K.T., Chung J.W., Baek H.J. 2013 Genetic diversity and population structure of the selected core set in Amaranthus using SSR markers [Plant Breeding] Vol.132 P.165-173 google doi
  • 11. Lee J.R., Hong G.Y., Dixit A., Chung J.W. 2008 Characterization of microsatellite loci developed for Amaranthus hypochondriacus and their cross-amplification in wild species [Conserv. Genet.] Vol.9 P.243-246 google doi
  • 12. Li G., Kwon S.W., Park Y.J. 2012 Updates and perspectives on the utilization of molecular makers of complex traits in rice [Genet. Mol. Res.] Vol.11 P.4157-4168 google doi
  • 13. Liang C.Z., Gu M.H., Pan X.B., Liang G.H. 1994 RFLP tagging of a new semidwarfing gene in rice [Theor. Appl. Genet.] Vol.88 P.898-900 google doi
  • 14. Liu K., Muse S.V. 2005 PowerMarker: an integrated analysis environment for genetic marker analysis [Bioinformatics] Vol.21 P.2128-2129 google doi
  • 15. Mujica A., Jacobsen S.E. 2003 The genetic resources of Andean grain amaranths (Amaranthus caudatus L., A. cruentus L. and A. hypochondriacus L.) in America [Plant Genet. Resour. Newsl.] Vol.133 P.41-44 google
  • 16. Nagaraju J., Kathirvel M., Kumar R.R., Siddiq E., Hasnain S.E. 2002 Genetic analysis of traditional and evolved Basmati and non-Basmati rice varieties by using fluorescence-based ISSR-PCR and SSR markers [P. Natl. Acad. Sci. USA] Vol.99 P.5836-5841 google doi
  • 17. Ostrowski M.F., David J., Santoni S., Mckhann H. 2006 Evidence for a large-scale population structure among accessions of Arabidopsis thaliana: possible causes and consequences for the distribution of linkage disequilibrium [Mol. Ecol.] Vol.15 P.1507-1517 google doi
  • 18. Pritchard J.K., Stephens M., Falush D. 2000 Inference of Population Structure Using Multilocus Genotype Data: Linked Loci and Correlated Allele Frequencies [Genetics] Vol.155 P.945-959 google
  • 19. Ray T., Roy S.C. 2009 Genetic diversity of Amaranthus species from the Indo-Gangetic Plains revealed by RAPD analysis leading to the development of ecotype-specific SCAR marker [J Hered.] Vol.100 P.338-347 google doi
  • 20. Sauer J.D. 1967 The grain amaranths and their relatives: a revised taxonomic and geographic survey [Ann. Missouri. Bot. Gard.] Vol.54 P.103-137 google doi
  • 21. Schneider S., Excoffier L. 1999 Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among Sites: Application to human mitochondrial DNA [Genetics] Vol.152 P.1079-1089 google
  • 22. Schuelke M. 2000 An economic method for the fluorescent labeling of PCR products [Nat. Biotechnol.] Vol.18 P.233-234 google doi
  • 23. Tam S.M., Mhiri C., Vogelaar A., Kerkveld M., Pearce S.R. 2005 Comparative analyses of genetic diversities within tomato and pepper collections detected by retrotransposonbased SSAP, AFLP and SSR [Theor. Appl. Genet.] Vol.110 P.819-831 google doi
  • 24. Tamura K., Dudley J., Nei M., Kumar S. 2007 MEGA4: Molecular evolutionary genetics analysis (MEGA) software version4.0 [Mol. Biol. Evol.] Vol.24 P.1596-1599 google doi
  • 25. Wassom J.J., Tranel P.J. 2005 Amplified fragment length polymorphism based genetic relationships among weedy amaranthus species [J. Hered.] Vol.96 P.410-416 google doi
  • 26. Wetzel D., Michael K., Horak J., Skinner D.J. 1999 Use of PCR-based molecular markers to identify weedy amaranthus species [Weed Science] Vol.7 P.518-523 google
  • 27. Wolfe M. S. 1985 The Current status and prospects of multiline cultivars and variety mixtures for disease resistance [Annu. Rev. Phytopathol.] Vol.23 P.251-273 google doi
  • 28. Xu F., Sun M. 2001 Comparative analysis of phylogenetic relationships of grain amaranths and their wild relatives (Amaranthus; Amaranthaceae) using internal transcribed spacer, amplified fragment length polymorphism, and double-primer fluorescent intersimple sequence repeat markers [Mol. Phylogenet. Evol.] Vol.21 P.372-387 google doi
  • 29. Zhao W., Chung J.W., Ma K.H., Kim T.S. 2009 Analysis of genetic diversity and population structure of rice cultivars from Korea, China and Japan using SSR markers [Genes Genom] Vol.31 P.283-292 google doi
  • [Table 1.] The 82 amaranth accessions used in this study
    The 82 amaranth accessions used in this study
  • [Table 2.] Size range, number of alleles, number of rare alleles, major allele frequency, expected heterozygosity, and polymorphism information content index for 14 simple sequence repeat loci in 82 accessions, including eight Australian accessions.
    Size range, number of alleles, number of rare alleles, major allele frequency, expected heterozygosity, and polymorphism information content index for 14 simple sequence repeat loci in 82 accessions, including eight Australian accessions.
  • [Fig. 1.] Histogram of allele frequencies for all 122 alleles in the 82 amaranth accessions.
    Histogram of allele frequencies for all 122 alleles in the 82 amaranth accessions.
  • [Fig. 2.] Determination of K value in Structure analysis. Red line are log-likelihood of the data (n=82), L(K), as a function of K (number of groups used to stratify the sample). Blue line are values of ΔK, which is model value used to detect true K of the three groups (k=3).
    Determination of K value in Structure analysis. Red line are log-likelihood of the data (n=82), L(K), as a function of K (number of groups used to stratify the sample). Blue line are values of ΔK, which is model value used to detect true K of the three groups (k=3).
  • [Table 3.] Characterization of polymorphism for each country.
    Characterization of polymorphism for each country.
  • [Table 4.] The diversity information and FST value of the three cluster.
    The diversity information and FST value of the three cluster.
  • [Fig. 3.] Model-based clustering for each of the 82 amaranth accessions examined based on the 14 SSR markers used to build the Q matrix.
    Model-based clustering for each of the 82 amaranth accessions examined based on the 14 SSR markers used to build the Q matrix.
  • [Fig. 4.] NJ dendrogram based on a genetic distance matrix among 82 amaranth accessions. The branch colors correspond to the model-based clusters revealed by Structure analysis. Different shapes reflect different countries.
    NJ dendrogram based on a genetic distance matrix among 82 amaranth accessions. The branch colors correspond to the model-based clusters revealed by Structure analysis. Different shapes reflect different countries.