Study of Molecular phylogenetic and matk protein structure in "Susan -e Chelcheragh" (Lilium ledebourii [Baker] Boiss)

Document Type : scientific research article

Authors

Dept. of Horticultural Sciences, University of Tehran, Karaj, Iran

Abstract

Background and objectives: Chloroplastic matk gene is one of the best genes for a plant barcode and study of phylogenic/evolutionary relationships between plant species and families, possesses a high evolutionary rate at a nucleotide/amino acid level. Lilium, with ~100 species, is the most important genus in the Liliaceae that have been classified. The classification of Liliaceae is continuously studied morphologically, cariologically, and phylogenetically. New molecular phylogenic approaches have lead in fundamental changes in this classification. Lilium ledebourii is a valuable species endemic to Iran, but it is seriously endangered. Notwithstanding its valuable characteristics, Lilium ledebourii has never been phylogenetically studied together with the other members of the Liliaceae. Using matk sequence, we studied the taxonomic position and evolutionary relationships of Lilium ledebourii and 41 other Lilium species.
Materials and methods: First, DNA was extracted by using Qiagen DNEasy. After verifying DNA quality and quantity, real-time sequencing (SMRT) was performed using the next generation PacBio platform and a complete matK sequence was obtained using bioinformatics tools. Finally, it was registered in NCBI (ID: MN557236 - Pending publishing). For the study of the phylogenetic relationships, matk sequences of the 41 species were downloaded from NCBI. The sequences were aligned by Mega7 software, following the ClustalW method. Phylogenetic tree was drawn according to the maximum likelihood method. Genetic distance was calculated by the K2P method. The maximum composite likelihood of the nucleotide substitution pattern was estimated using the substitution matrix. By using FFPred, InterProScan, TargetP, and Phyre2, secondary structures such as Alpha helix, Beta turn, and random coil, as well as the tertiary structure of the protein, were studied.
Findings: According to the phylogenetic tree, the Lilium species were classified into 4 clusters: A, B, C, and D. The Iranian species was placed in cluster B. According to phylogenic topology as well as to the study of genetic distance, Lilium ledebourii had the highest similarities to L. pyrenaicum, L. ciliatum, and L. candidum with 97% bootstrap. In bioinformatic analysis, a high level of conservation was observed in the alignment of this protein. Lilium ledebourii contained arginine at position #271, which was similar only to L. pyrenaicum, L. ciliatum, and L. candidum. The other species had lysine at this position. The species displayed many differences in the positions of amino acids, such as at #317, #346, #363, and #417. The study of the similarities of the secondary structure of matk in L. ledebourii showed this protein to have 233 amino acids at α helix (45.51%), 24 amino acids at β turn (4.69%), and 156 amino acids at random coil (30.74%).
Conclusions: The present study analyzed matk in Lilium ledebourii for the first time, revealing its secondary structure, the position of α- helix and β- turn. Additionally, the Tertiary structure of this protein was proposed for the first time. According to phylogenic topology as well as to the study of genetic distance, Lilium ledebourii had the highest similarities to L. pyrenaicum, L. ciliatum, and L. candidum. Overall, the molecular relationship (phylogenetics and protein structure) of Lilium ledebourii with other Lilium species was studied.

Keywords


1.Amandita, F.Y., Rembold, K., Vornam, B., Rahayu, S., Siregar, I.Z., Kreft, H. and Finkeldey, R. 2019. DNA barcoding of flowering plants in Sumatra, Indonesia. Ecol. Evol. 9: 4. 1858-1868.
2.APG. 2009. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Bot. J. Linn. Soc. 161: 105-121.
3.Bakker, F.T., Culham, A., Gomez-Martinez, R., Carvalho, J., Compton, J., Dawtrey, R. and Gibby, M. 2000. Patterns of nucleotide substitution in angiosperm cpDNA trnL (UAA)–trnF (GAA) regions. Mol. Biol. Evol. 17: 8. 1146-1155.
4.Combet, C., Blanchet, C., Geourjon, C. and Deleage, G. 2000. NPS@: network protein sequence analysis. Trends Biochem. Sci. 25: 3. 147-150.
5.Cozzetto, D., Minneci, F., Currant, H. and Jones, D.T. 2016. FFPred 3: feature-based function prediction for all Gene Ontology domains. Sci. Rep. 6: 31865.
6.Darabi, M., Masoudi-Nejad, A. and Nemat-Zadeh, G. 2012. Bioinformatics study of the 3-hydroxy-3-methylglotaryl-coenzyme A reductase (HMGR) genein Gramineae. Mol. Boil. Rep.39: 9. 8925-8935.
7.Hayashi, K. and Kawano, S. 2000. Molecular systematics of Lilium and allied genera (Liliaceae): phylogeneticrelationships among Lilium and related genera based on the rbcL and matkgene sequence data. Plant Spec. Biol.15: 1. 73-93.
8.Hollingsworth, P.M., Graham, S.W.and Little, D.P. 2011. Choosing and using a plant DNA barcode. PloS One.6: 5. e19254.
9.Ikinci, N. 2011. Molecular phylogeny and divergence times estimates of Lilium section Liriotypus (Liliaceae) based on plastid and nuclear ribosomal ITSDNA sequence data. Turk. J. Bot.35: 4. 319-330.
10.Kelley, L.A., Mezulis, S., Yates, C.M., Wass, M.N. and Sternberg, M.J. 2015. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Prot. 10: 6. 845.
11.Kim, J.S. and Kim, J.H. 2018. Updated molecular phylogenetic analysis, dating and biogeographical history of the lily family (Liliaceae: Liliales). Bot. J. Linn. Soc. 187: 4. 579-593.
12.Lee, C.S., Kim, S.C., Yeau, S.H. and Lee, N.S. 2011. Major lineages of the genus Lilium (Liliaceae) based on nrDNA ITS sequences, with special emphasis on the Korean species. J. Plant Biol. 54: 3. 159-171.
13.Muratovic, E., Hidalgo, O., Garnatje,T. and Siljak-Yakovlev, S. 2010. Molecular phylogeny and genome size in European lilies (Genus Lilium, Liliaceae). Adv. Sci. Lett. 3: 2. 180-189.
14.Nuka, G., Fraser, M., Mitchell, A., Potter, S., Yong, S.Y. and Finn,R.D. 2017. InterProScan: Protein sequence analysis and classification. F1000 Research. 6.
15.Pang, X., Song, J., Zhu, Y., Xu,H., Huang, L. and Chen, S.2011. Applying plant DNA barcodesfor Rosaceae species identification. Cladistics. 27: 2. 165-170.
16.Picoult-Newberg, L., Ideker, T.E., Pohl, M.G., Taylor, S.L., Donaldson, M.A., Nickerson, D.A. and Boyce-Jacino,
M. 1999. Mining SNPs from EST databases. Genom. Res. 9: 2. 167-174.
17.Sanders, E.R., Karol, K.G. and McCourt, R.M. 2003. Occurrenceof matK in a trnK group II intronin charophyte green algae and phylogeny of the Characeae. Am. J.Bot. 90: 4. 628-633.
18.Stevens, P.S. 2015. Angiosperm phylogeny website. http://www.mobot.org/ MOBOT/research/APweb/welcome.html.
19.Yu, J., Xue, J.H. and Zhou, S.L. 2011. New universal matk primers for DNA barcoding angiosperms. J. Syst. Evol. 49: 3. 176-181.