Molecular Structure Elucidation Using Ant Colony Optimization: A Preliminary Study

Identifying the structure of unknown molecules is an important activity in the pharmaceutical industry where it underpins the production of new drugs and the analysis of complex biological samples. We present here a new method for automatically identifying the structure of an unknown molecule from its nuclear magnetic resonance (NMR) spectrum. In the technique, an ant colony optimization algorithm is used to search iteratively the highly-constrained space of feasible molecular structures, evaluating each one by reference to NMR information on known molecules stored (in a raw form) in a database. Unlike existing structure elucidation systems, ours: does not need prior training or use spectrum prediction; does not rely on expert rules; and avoids enumeration of all possible candidate structures. We describe the important elements of the system here and include results on a preliminarytest set of molecules. Whilst the results are currently too limited to allow parameter studies or comparison to other methods, they nevertheless indicate the system is working acceptably and shows considerable promise.

[1]  Dennis H. Smith,et al.  Applications of artificial intelligence for chemical inference. XVII. Approach to computer-assisted elucidation of molecular structure , 1975 .

[2]  Antony J. Williams,et al.  Application of a new expert system for the structure elucidation of natural products from their 1D and 2D NMR data. , 2002, Journal of natural products.

[3]  Thomas Stützle,et al.  The MAX–MIN Ant System and Local Search for Combinatorial Optimization Problems: Towards Adaptive Tools for Global Optimization , 1997 .

[4]  Dennis H. Smith,et al.  Applications of artificial intelligence for chemical inference. 37. GENOA: a computer program for structure elucidation utilizing overlapping and alternative substructures , 1981 .

[5]  J. Munkres Algorithms for the Assignment and Transportation Problems , 1957 .

[6]  Dennis H. Smith,et al.  Applications of artificial intelligence for chemical inference—XXI: Chemical studies of marine invertebrates—XVII. The computer-assisted identification of [+]-palustrol in the marine organism Cespitularia sp., aff. subviridis☆☆☆ , 1976 .

[7]  M. Elyashberg,et al.  X-PERT: a user-friendly expert system for molecular structure elucidation by spectral methods , 1997 .

[8]  Thomas Stützle,et al.  Ant Colony Optimization Theory , 2004 .

[9]  D. Kell Systems biology, metabolic modelling and metabolomics in drug discovery and development. , 2006, Drug discovery today.

[10]  N. S. Sridharan,et al.  Applications of Artificial Intelligence for Chemical Inference. XVI. Computer Generation of Vertex-Graphs and Ring Systems , 1975, J. Chem. Inf. Comput. Sci..

[11]  Douglas B Kell,et al.  Metabolomic biomarkers: search, discovery and validation , 2007, Expert review of molecular diagnostics.

[12]  J. Meiler,et al.  Genius: a genetic algorithm for automated structure elucidation from 13C NMR spectra. , 2002, Journal of the American Chemical Society.

[13]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[14]  David W. Aha,et al.  Lazy Learning , 1997, Springer Netherlands.

[15]  Silvano Martello,et al.  Meta-Heuristics: Advances and Trends in Local Search Paradigms for Optimization , 2012 .

[16]  J. Bright,et al.  Towards the automatic analysis of 1H NMR spectra: Part 3. Confirmation of postulated chemical structure , 2002 .

[17]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[18]  Luca Maria Gambardella,et al.  An Ant Colony System Hybridized with a New Local Search for the Sequential Ordering Problem , 2000, INFORMS J. Comput..

[19]  Yoshihiro Kudo,et al.  Structure elucidation system using structural information from multisources: CHEMICS , 1985, J. Chem. Inf. Comput. Sci..