Abstract
Background: Also known as Simple Sequence Repetitions (SSRs), microsatellites are profoundly informative molecular markers and powerful tools in genetics and ecology studies on plants.
Objective: This research presents a workflow for developing microsatellite markers using genome skimming.
Methods: The pipeline was proposed in several stages that must be performed sequentially: obtaining DNA sequences, identifying microsatellite regions, designing primers, and selecting candidate microsatellite regions to develop the markers. Our pipeline efficiency was analyzed using Illumina sequencing data from the non-model tree species Pterodon emarginatus Vog.
Results: The pipeline revealed 4,382 microsatellite regions and drew 7,411 pairs of primers for P. emarginatus. However, a much larger number of microsatellite regions with the potential to develop markers were discovered from our pipeline. We selected 50 microsatellite regions with high potential for developing markers and organized 29 microsatellite regions in sets for multiplex PCR.
Conclusion: The proposed pipeline is a powerful tool for fast and efficient development of microsatellite markers on a large scale in several species, especially nonmodel plant species.
Keywords: Bioinformatics, genome skimming, PCR multiplex, primer design, Pterodon emarginatus, microsatellite markers.
Graphical Abstract
[http://dx.doi.org/10.3390/molecules23020399] [PMID: 29438290]
[http://dx.doi.org/10.1590/1678-4685-GMB-2016-0027] [PMID: 27561112]
[http://dx.doi.org/10.1038/srep32224] [PMID: 27561724]
[http://dx.doi.org/10.1016/j.cpb.2020.100167]
[http://dx.doi.org/10.1371/journal.pone.0237538] [PMID: 32804981]
[http://dx.doi.org/10.3390/ijms141122499] [PMID: 24240810]
[http://dx.doi.org/10.1371/journal.pone.0159825] [PMID: 27459734]
[http://dx.doi.org/10.4238/gmr16029639] [PMID: 28453176]
[http://dx.doi.org/10.1146/annurev.genom.9.081307.164359] [PMID: 18576944]
[http://dx.doi.org/10.1038/nrg2626] [PMID: 19997069]
[http://dx.doi.org/10.1016/j.gpb.2014.07.003] [PMID: 25132181]
[http://dx.doi.org/10.4236/ojgen.2012.24022]
[http://dx.doi.org/10.1590/S1984-82502015000100013]
[http://dx.doi.org/10.24863/rib.v8i1.32]
[http://dx.doi.org/10.1590/S1984-82502010000400002]
[http://dx.doi.org/10.1093/bioinformatics/btu170] [PMID: 24695404]
[http://dx.doi.org/10.1186/gb-2009-10-3-r25] [PMID: 19261174]
[http://dx.doi.org/10.1093/bioinformatics/btt476] [PMID: 23990416]
[http://dx.doi.org/10.1016/S0022-2836(05)80360-2] [PMID: 2231712]
[http://dx.doi.org/10.1093/bioinformatics/btp670] [PMID: 20007741]
[http://dx.doi.org/10.1016/j.ygeno.2017.05.005] [PMID: 28502701]
[http://dx.doi.org/10.4238/gmr18463]
[http://dx.doi.org/10.1186/s12864-017-3881-5] [PMID: 28732469]
[http://dx.doi.org/10.3732/apps.1500025] [PMID: 26312192]
[http://dx.doi.org/10.1186/s12864-018-4702-1] [PMID: 29695227]