Abstract
Background: Automated annotation of MS spectra remains highly challenging. Therefore, the Critical Assessment of Small Molecule Identification (CASMI) Contest (http://casmi-contest.org/) represents a unique opportunity to blindly evaluate an annotation workflow.
Objective: The 2014 CASMI Contest consisted of 42 MS and MS2 spectra (from “molecules detected in mammalian biofluids and tissues”, “natural products of plant, fungal or bacterial origin”, “synthetic or semi-synthetic molecules”), which had to be identified (molecular formulae and structures). Methods: An R script based on the Rdisop and RMassBank packages was devised for the automated annotation of the provided spectra. Searching within various databases yielded the assignment of structures. Further discrimination between annotations was achieved using phylogenetic information and in silico fragmentation by competitive fragmentation modelling using CFM-ID. Results: Successes and failures of the proposed script were investigated after release of the CASMI Contest solutions. Conclusion: The CASMI Contest allowed identifying key points for the successful annotation of MS spectra. As an example, this study pointed out the importance of taking a large number of adducts into consideration (such as [M+H]+, [M+H-H2O]+, [M+Na]+, [M+K]+ and [M]+ in positive ionization) to avoid failed annotations.Keywords: Annotation, automated method, CASMI contest, dereplication, mass spectrometry, natural products.
Graphical Abstract