MSnet: A Neural Network which Classifies Mass Spectra

Abstract We have designed a feed-forward neural network to classify low-resolution mass spectra of unknown compounds according to the presence or absence of 100 organic substructures. The neural network, MSnet, was trained to compute a maximum-likelihood estimate of the probability that each substructure is present. We discuss some design considerations and statistical properties of neural network classifiers, and the effect of various training regimes on generalization behavior. The MSnet classifies mass spectra more reliably than other methods reported in the literature, and has other desirable properties.

[1]  M. Koehler,et al.  Application of pattern recognition to mass spectral data of toxic organic compounds in ambient air , 1987 .

[2]  Thomas L. Isenhour,et al.  Chemical applications of pattern recognition , 1975 .

[3]  Thomas L. Isenhour,et al.  Information content of mass spectra as determined by pattern recognition methods , 1974 .

[4]  Yann LeCun,et al.  Improving the convergence of back-propagation learning with second-order methods , 1989 .

[5]  R. Lippmann Pattern classification using neural networks , 1989, IEEE Communications Magazine.

[6]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[7]  J. G. Hoffman,et al.  Determination of organic structures by physical methods , 1955 .

[8]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[9]  Terrence J. Sejnowski,et al.  Learned classification of sonar targets using a massively parallel network , 1988, IEEE Trans. Acoust. Speech Signal Process..

[10]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[11]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[12]  P. Harrington,et al.  Approaches to Pyrolysis/Mass Spectrometry Data Analysis of Biological Materials , 1990 .

[13]  E. Feigenbaum,et al.  Applications of artificial intelligence for chemical inference. III. Aliphatic ethers diagnosed by their low-resolution mass spectra and nuclear magnetic resonance data , 1969 .

[14]  Lorien Y. Pratt,et al.  Comparing Biases for Minimal Network Construction with Back-Propagation , 1988, NIPS.

[15]  N. A. B. Gray,et al.  Constraints on "learning machine" classification methods , 1976 .

[16]  Gail M. Pesyna,et al.  Computer‐aided interpretation of mass spectra. Information on substructural probabilities form stirs , 1976 .

[17]  K. Varmuza Pattern recognition in analytical chemistry , 1980 .

[18]  Peter G. Anderson,et al.  The interpretation of i.r. and Raman spectra using pattern recognition , 1977 .

[19]  F. McLafferty Interpretation of Mass Spectra , 1966 .

[20]  Hervé Bourlard,et al.  Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.

[21]  S. Wold,et al.  Extraction of mass spectral information by a combination of autocorrelation and principal components models , 1984 .

[22]  E. Feigenbaum,et al.  Applications of artificial intelligence for chemical inference—X , 1973 .

[23]  James L. McClelland,et al.  Explorations in parallel distributed processing: a handbook of models, programs, and exercises , 1988 .

[24]  K. Varmuza,et al.  Selective detection of classes of chemical compounds by gas chromatography/mass spectrometry/pattern recognition: polycyclic aromatic hydrocarbons and alkanes , 1987 .

[25]  P. T. Palmer,et al.  Development of algorithms for automated elucidation of spectral feature/substructure relationships in tandem mass spectrometry , 1988 .

[26]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[27]  Yoh-Han Pao,et al.  Adaptive pattern recognition and neural networks , 1989 .

[28]  Stephen R. Lowry,et al.  Comparison of various K-nearest neighbor voting schemes with the self-training interpretive and retrieval system for identifying molecular substructures from mass spectral data , 1977 .

[29]  Mike James,et al.  Classification Algorithms , 1986, Encyclopedia of Machine Learning and Data Mining.

[30]  David E. Rumelhart,et al.  Predicting the Future: a Connectionist Approach , 1990, Int. J. Neural Syst..