9.9
CiteScore
7.1
Impact Factor
Turn off MathJax
Article Contents

Assembly of a high-quality reference genome and characterization of a chemical-mutagenized library of an elite soybean cultivar Tianlong 1

doi: 10.1016/j.jgg.2025.08.006
Funds:

This work was supported by the National Natural Science Foundation of China (grant number 31970344) and Joint Funds of the Natural Science Foundation of Hainan Province (grant number 2021JJLH0065).

  • Received Date: 2025-06-01
  • Accepted Date: 2025-08-13
  • Rev Recd Date: 2025-08-12
  • Available Online: 2025-08-26
  • Soybean (Glycine max L.) is a globally vital crop for oil production and food security. High-quality genomic resources are instrumental for both functional genomics and breeding. Here, we report a near-complete, high-quality genome assembly of the elite cultivar Tianlong 1 (TL1), featuring fully resolved telomeres and centromeres, as well as a gap-free assembly of 14 of its 20 chromosomes. On the basis of the genome assembly, we generate an ethyl methanesulfonate (EMS)-mutagenized population comprising 2,555 M7 plants. Whole-genome re-sequencing of 288 EMS mutants uncovers 1,163,869 high-confidence single-nucleotide polymorphisms (SNPs) and 542,709 insertions/deletions (InDels), achieving 91.89% coverage of predicted protein-coding genes. Phenotypic screening demonstrates robust genotype–phenotype associations, with two nonsynonymous mutants displaying pronounced defects in seed and leaf development. Collectively, the chromosome-scale TL1 genome assembly and the extensively characterized mutant population establish valuable resources for functional genomics and precision breeding in soybean and related legume species.
  • loading
  • Allen, G.C., Flores-Vergara, M.A., Krasynanski, S., Kumar, S., Thompson, W.F., 2006. A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide. Nat. Protoc. 1, 2320-2325.
    Bolger, A.M., Lohse, M., Usadel, B., 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114-2120.
    Campbell, M.S., Law, M., Holt, C., Stein, J.C., Moghe, G.D., Hufnagel, D.E., Lei, J., Achawanantakun, R., Jiao, D., Lawrence, C.J., et al., 2014. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. Plant Physiol. 164, 513-524.
    Cheng, H., Concepcion, G.T., Feng, X., Zhang, H., Li, H., 2021. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170-175.
    Chen, Q., Yang, C., Zhang, G., Wu, D., 2024. GCI: a continuity inspector for complete genome assembly. Bioinformatics 40, btae633.
    Danecek, P., Bonfield, J.K., Liddle, J., Marshall, J., Ohan, V., Pollard, M.O., Whitwham, A., Keane, T., McCarthy, S.A., Davies, R.M., et al., 2021. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008.
    Deng, Y., Liu, S., Zhang, Y., Tan, J., Li, X., Chu, X., Xu, B., Tian, Y., Sun, Y., Li, B., et al., 2022. A telomere-to-telomere gap-free reference genome of watermelon and its mutation library provide important resources for gene discovery and breeding. Mol. Plant 15, 1268-1284.
    Espina, M.J., Lovell, J.T., Jenkins, J., Shu, S., Sreedasyam, A., Jordan, B.D., Webber, J., Boston, L., Bruna, T., Talag, J., et al., 2024. Assembly, comparative analysis, and utilization of a single haplotype reference genome for soybean. Plant J. 120, 1221-1235.
    Fang, C., Li, C., Li, W., Wang, Z., Zhou, Z., Shen, Y., Wu, M., Wu, Y., Li, G., Kong, L.A., et al., 2014. Concerted evolution of D1 and D2 to regulate chlorophyll degradation in soybean. Plant J. 77, 700-712.
    Ge, F., Zheng, N., Zhang, L., Huang, W., Peng, D., Liu, S., 2018. Chemical mutagenesis and soybean mutants potential for identification of novel genes conferring resistance to soybean cyst nematode. J. Integr. Agric. 17, 2734-2744.
    Gill, N., Findley, S., Walling, J.G., Hans, C., Ma, J., Doyle, J., Stacey, G., Jackson, S.A., 2009. Molecular and chromosomal evidence for allopolyploidy in soybean. Plant Physiol. 151, 1167-1174.
    Goel, M., Sun, H., Jiao, W.B., Schneeberger, K., 2019. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277.
    Goettel, W., Zhang, H., Li, Y., Qiao, Z., Jiang, H., Hou, D., Song, Q., Pantalone, V.R., Song, B. H., Yu, D., et al., 2022. POWR1 is a domestication gene pleiotropically regulating seed quality and yield in soybean. Nat. Commun. 13, 3051.
    Greene, E.A., Codomo, C.A., Taylor, N.E., Henikoff, J.G., Till, B.J., Reynolds, S.H., Enns, L.C., Burtner, C., Johnson, J.E., Odden, A.R., et al., 2003. Spectrum of chemically induced mutations from a large-scale reverse-genetic screen in Arabidopsis. Genetics 164, 731-740.
    Guo, G., Xu, K., Zhang, X., Zhu, J., Lu, M., Chen, F., Liu, L., Xi, Z., Bachmair, A., Chen, Q., et al., 2015. Extensive analysis of GmFTL and GmCOL expression in northern soybean cultivars in field conditions. PLoS One 10, e0136601.
    He, W., Yang, J., Jing, Y., Xu, L., Yu, K., Fang, X., 2023. NGenomeSyn: an easy-to-use and flexible tool for publication-ready visualization of syntenic relationships across multiple genomes. Bioinformatics 39, btad121.
    Huang, Y., Koo, D.H., Mao, Y., Herman, E.M., Zhang, J., Schmidt, M.A., 2024. A complete reference genome for the soybean cv. Jack. Plant Commun. 5, 100765.
    Jia, J., Ji, R., Li, Z., Yu, Y., Nakano, M., Long, Y., Feng, L., Qin, C., Lu, D., Zhan, J., et al., 2020. Soybean DICER-LIKE2 regulates seed coat color via production of primary 22-nucleotide small Interfering RNAs from long inverted repeats. Plant Cell 32, 3662-3673.
    Kolmogorov, M., Bickhart, D.M., Behsaz, B., Gurevich, A., Rayko, M., Shin, S.B., Kuhn, K., Yuan, J., Polevikov, E., Smith, T.P.L., et al., 2020. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat. Methods 17, 1103-1110.
    Koren, S., Walenz, B.P., Berlin, K., Miller, J.R., Bergman, N.H., Phillippy, A.M., 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722-736.
    Li, H., Durbin, R., 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754-1760.
    Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R., 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078-2079.
    Li, K., Xu, P., Wang, J., Yi, X., Jiao, Y., 2023. Identification of errors in draft genome assemblies at single-nucleotide resolution for quality assessment and improvement. Nat. Commun. 14, 6556.
    Li, Z., Jiang, L., Ma, Y., Wei, Z., Hong, H., Liu, Z., Lei, J., Liu, Y., Guan, R., Guo, Y., et al., 2017. Development and utilization of a new chemically-induced soybean library with a high mutation density. J. Integr. Plant Biol. 59, 60-74.
    Liang, S., Duan, Z., He, X., Yang, X., Yuan, Y., Liang, Q., Pan, Y., Zhou, G., Zhang, M., Liu, S., et al., 2024. Natural variation in GmSW17 controls seed size in soybean. Nat. Commun. 15, 7417.
    Mach, J., 2017. Saddle up, soybean seed pigments: Argonaute5 in spatially regulated silencing of chalcone synthase genes. Plant Cell 29, 604.
    Marcais, G., Delcher, A.L., Phillippy, A.M., Coston, R., Salzberg, S.L., Zimin, A., 2018. MUMmer4: a fast and versatile genome alignment system. PLoS Comput. Biol. 14, e1005944.
    McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., et al., 2010. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297-1303.
    Pendleton, M., Sebra, R., Pang, A.W., Ummat, A., Franzen, O., Rausch, T., Stutz, A.M., Stedman, W., Anantharaman, T., Hastie, A., et al., 2015. Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nat. Methods 12, 780-786.
    Ruan, J., Li, H., 2020. Fast and accurate long-read assembly with wtdbg2. Nat. Methods 17, 155-158.
    Schmutz, J., Cannon, S.B., Schlueter, J., Ma, J., Mitros, T., Nelson, W., Hyten, D.L., Song, Q., Thelen, J.J., Cheng, J., et al., 2010. Genome sequence of the palaeopolyploid soybean. Nature 463, 178-183.
    Stanke, M., Diekhans, M., Baertsch, R., Haussler, D., 2008. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 24, 637-644.
    Sun, X., Li, X., Lu, Y., Wang, S., Zhang, X., Zhang, K., Su, X., Liu, M., Feng, D., Luo, S., et al., 2022. Construction of a high-density mutant population of Chinese cabbage facilitates the genetic dissection of agronomic traits. Mol. Plant 15, 913-924.
    Takagi, H., Tamiru, M., Abe, A., Yoshida, K., Uemura, A., Yaegashi, H., Obara, T., Oikawa, K., Utsushi, H., Kanzaki, E., et al., 2015. MutMap accelerates breeding of a salt-tolerant rice cultivar. Nat. Biotechnol. 33, 445-449.
    Tarasov, A., Vilella, A.J., Cuppen, E., Nijman, I.J., Prins, P., 2015. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032-2034.
    Tek, A.L., Kashihara, K., Murata, M., Nagaki, K., 2010. Functional centromeres in soybean include two distinct tandem repeats and a retrotransposon. Chromosome Res. 18, 337-347.
    Tian, Z., Nepomuceno, A.L., Song, Q., Stupar, R.M., Liu, B., Kong, F., Ma, J., Lee, S.H., Jackson, S.A., 2025. Soybean2035: a decadal vision for soybean functional genomics and breeding. Mol. Plant 18, 245-271.
    Valliyodan, B., Cannon, S.B., Bayer, P.E., Shu, S., Brown, A.V., Ren, L., Jenkins, J., Chung, C.Y., Chan, T.F., Daum, C.G., et al., 2019. Construction and comparison of three reference-quality genome assemblies for soybean. Plant J. 100, 1066-1082.
    Wang, D., Li, Y., Wang, H., Xu, Y., Yang, Y., Zhou, Y., Chen, Z., Zhou, Y., Gui, L., Guo, Y., et al., 2023a. Boosting wheat functional genomics via an indexed EMS mutant library of KN9204. Plant Commun. 4, 100593.
    Wang, K., Li, M., Hakonarson, H., 2010. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164.
    Wang, L., Zhang, M., Li, M., Jiang, X., Jiao, W., Song, Q., 2023b. A telomere-to-telomere gap-free assembly of soybean genome. Mol. Plant 16, 1711-1714.
    Waterhouse, R.M., Seppey, M., Simao, F.A., Manni, M., Ioannidis, P., Klioutchnikov, G., Kriventseva, E.V., Zdobnov, E.M., 2018. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543-548.
    Xiao, C., Chen, Y., Xie, S., Chen, K., Wang, Y., Han, Y., Luo, F., Xie, Z., 2017. MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat. Methods 14, 1072-1074.
    Yuan, X., Jiang, X., Zhang, M., Wang, L., Jiao, W., Chen, H., Mao, J., Ye, W., Song, Q., 2024. Integrative omics analysis elucidates the genetic basis underlying seed weight and oil content in soybean. Plant Cell 36, 2160-2175.
    Zahra, S., Hussain, M., Zulfiqar, S., Ishfaq, S., Shaheen, T., Akhtar, M., Mehboob ur, R., 2022. EMS-based mutants are useful for enhancing drought tolerance in spring wheat. Cereal Res. Commun. 50, 767-778.
    Zhang, C., Xie, L., Yu, H., Wang, J., Chen, Q., Wang, H., 2023. The T2T genome assembly of soybean cultivar ZH13 and its epigenetic landscapes. Mol. Plant 16, 1715-1718.
    Zhang, J., Kudrna, D., Mu, T., Li, W., Copetti, D., Yu, Y., Goicoechea, J.L., Lei, Y., Wing, R.A., 2016. Genome puzzle master (GPM): an integrated pipeline for building and editing pseudomolecules from fragmented sequences. Bioinformatics 32, 3058-3064.
    Zhang, M., Zhang, X., Jiang, X., Qiu, L., Jia, G., Wang, L., Ye, W., Song, Q., 2022. iSoybean: a database for the mutational fingerprints of soybean. Plant Biotechnol. J. 20, 1435-1437. https://doi.org/10.1111/pbi.13844.
    Zhang, Y., Liu, S., Liang, X., Zheng, J., Lu, X., Zhao, J., Li, H., Zhan, Y., Teng, W., Li, H., et al., 2025. GmFER1, a soybean ferritin, enhances tolerance to salt stress and root rot disease and improves soybean yield. Plant Biotechnol. J. 23, 3094-3112.
    Zhong, X., Wang, J., Shi, X., Bai, M., Yuan, C., Cai, C., Wang, N., Zhu, X., Kuang, H., Wang, X., et al., 2024. Genetically optimizing soybean nodulation improves yield and protein content. Nat. Plants 10, 736-742.
    Zhou, Y., Zhang, T., Wang, X., Wu, W., Xing, J., Li, Z., Qiao, X., Zhang, C., Wang, X., Wang, G., et al., 2023. A maize epimerase modulates cell wall synthesis and glycosylation during stomatal morphogenesis. Nat. Comm. 14, 4384.
    Zhou, Z., Yu, Z., Huang, X., Liu, J., Guo, Y., Chen, L., Song, J., 2022. GenomeSyn: a bioinformatics tool for visualizing genome synteny and structural variations. J. Genet. Genomics 49, 1174-1176.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (3) PDF downloads (0) Cited by ()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return