论文信息 - Supernova Recognition Using Support Vector Machines

Supernova Recognition Using Support Vector Machines

We introduce a novel application of support vector machines (SVMs) to the problem of identifying potential supernovae using photometric and geometric features computed from astronomical imagery. The challenges of this supervised learning application are significant: 1) noisy and corrupt imagery resulting in high levels of feature uncertainty, 2) features with heavy-tailed, peaked distributions, 3) extremely imbalanced and overlapping positive and negative data sets, and 4) the need to reach high positive classification rates, i.e. to find all potential supernovae, while reducing the burdensome workload of manually examining false positives. High accuracy is achieved via a sign-preserving, shifted log transform applied to features with peaked, heavy-tailed distributions. The imbalanced data problem is handled by oversampling positive examples, selectively sampling misclassified negative examples, and iteratively training multiple SVMs for improved supernova recognition on unseen test data. We present cross-validation results and demonstrate the impact on a large-scale supernova survey that currently uses the SVM decision value to rank-order 600,000 potential supernovae each night

[1] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[2] M. Phillips,et al. Observational Evidence from Supernovae for an Accelerating Universe and a Cosmological Constant , 1998, astro-ph/9805201.

[3] R. Ellis,et al. Measurements of $\Omega$ and $\Lambda$ from 42 high redshift supernovae , 1998, astro-ph/9812133.

[4] Chris H. Q. Ding,et al. Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[5] R. Bacon,et al. Overview of the Nearby Supernova Factory , 2002, SPIE Astronomical Telescopes + Instrumentation.

[6] Lei Xu,et al. Improved system for object detection and star/galaxy classification via local subspace analysis , 2003, Neural Networks.

[7] Yanxia Zhang,et al. Automated clustering algorithms for classification of astronomical objects , 2004, astro-ph/0403431.

[8] W. Wood-Vasey. Rates and progenitors of type Ia supernovae , 2004, astro-ph/0505604.

[9] Hwanjo Yu,et al. SVM selective sampling for ranking with application to data retrieval , 2005, KDD '05.

[10] Zeki Erdem,et al. Ensemble of SVMs for Incremental Learning , 2005, Multiple Classifier Systems.

[11] Xiangji Huang,et al. Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles , 2006, PAKDD.

[12] Sungzoon Cho,et al. EUS SVMs: Ensemble of Under-Sampled SVMs for Data Imbalance Problems , 2006, ICONIP.