Committee Chair
Wang, Yingfeng
Committee Member
Liang, Yu; Jain, Hemant (Hemant K.); Qin, Hong; Wu, Dalei
College
College of Engineering and Computer Science
Publisher
University of Tennessee at Chattanooga
Place of Publication
Chattanooga (Tenn.)
Abstract
Tandem mass spectrometry (MS/MS) is a modern technique for measuring metabolites. MS/MS spectra represent molecules by the fragment patterns of compounds that contain structural features of the precursor molecules. The database-searching strategy is the most popular for metabolite identification among its peers. It matches the query MS/MS spectrum to a database of molecule candidates, identifying the metabolite that best matches the query spectrum. This study uses the database-searching strategy and focuses on developing a novel machine learning identification tool. This tool applies autoencoders to map metabolite structures and MS/MS spectra to latent spaces separately. Then, we train a classifier to identify real metabolite-spectrum matches based on the latent space features of metabolites and spectra. Further, we build a generative adversarial network (GAN) to optimize the classifier as the discriminator. A large number of experiments are conducted. The experimental results verify the effectiveness of our tool.
Acknowledgments
I would like to express my heartfelt gratitude to my supervisor, Dr. Yingfeng Wang, his unwavering support, insightful guidance, and invaluable feedback throughout the course of my research. His expertise and mentorship have been instrumental in shaping this dissertation. I would also like to express my heartfelt gratitude to my parents, Gang-Hwang and Li-Chen, for their unwavering support and encouragement throughout my academic journey. Your love and belief in me have been a constant source of motivation.
Degree
Ph. D.; A dissertation submitted to the faculty of the University of Tennessee at Chattanooga in partial fulfillment of the requirements of the degree of Doctor of Philosophy.
Date
12-2025
Subject
Deep learning (Machine learning); Generative adversarial networks (Computer networks); Metabolites--Identification; Tandem mass spectrometry
Document Type
Doctoral dissertations
DCMI Type
Text
Extent
xiv, 88 leaves
Language
English
Rights
http://rightsstatements.org/vocab/InC/1.0/
License
http://creativecommons.org/licenses/by-sa/4.0/
Date Available
1-1-2027
Recommended Citation
Tsai, Meng Hsiu, "Advance metabolite identification from tandem mass spectra using deep generative models" (2025). Masters Theses and Doctoral Dissertations. 
https://scholar.utc.edu/theses/1030
Department
Dept. of Computer Science and Engineering