Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gabriel Asher

LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Oct 30, 2025

Gabriel Asher, Devesh Shah, Amy A. Caudy, Luke Ferro, Lea Amar, Ana S. H. Costa, Thomas Patton, Niall O'Connor, Jennifer M. Campbell, Jack Geremia

Figure 1 for LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Figure 2 for LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Figure 3 for LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Figure 4 for LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Abstract:A vast majority of mass spectrometry data remains uncharacterized, leaving much of its biological and chemical information untapped. Recent advances in machine learning have begun to address this gap, particularly for tasks such as spectral identification in tandem mass spectrometry data. Here, we present the latest generation of LSM-MS2, a large-scale deep learning foundation model trained on millions of spectra to learn a semantic chemical space. LSM-MS2 achieves state-of-the-art performance in spectral identification, improving on existing methods by 30% in accuracy of identifying challenging isomeric compounds, yielding 42% more correct identifications in complex biological samples, and maintaining robustness under low-concentration conditions. Furthermore, LSM-MS2 produces rich spectral embeddings that enable direct biological interpretation from minimal downstream data, successfully differentiating disease states and predicting clinical outcomes across diverse translational applications.

Via

Access Paper or Ask Questions

Not cool, calm or collected: Using emotional language to detect COVID-19 misinformation

Mar 27, 2023

Gabriel Asher, Phil Bohlman, Karsten Kleyensteuber

Abstract:COVID-19 misinformation on social media platforms such as twitter is a threat to effective pandemic management. Prior works on tweet COVID-19 misinformation negates the role of semantic features common to twitter such as charged emotions. Thus, we present a novel COVID-19 misinformation model, which uses both a tweet emotion encoder and COVID-19 misinformation encoder to predict whether a tweet contains COVID-19 misinformation. Our emotion encoder was fine-tuned on a novel annotated dataset and our COVID-19 misinformation encoder was fine-tuned on a subset of the COVID-HeRA dataset. Experimental results show superior results using the combination of emotion and misinformation encoders as opposed to a misinformation classifier alone. Furthermore, extensive result analysis was conducted, highlighting low quality labels and mismatched label distributions as key limitations to our study.

Via

Access Paper or Ask Questions