论文信息 - Intelligence and Security Informatics

Intelligence and Security Informatics

Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and/or semantic constraints on the positions of desired entities within natural language sentences. The IE systems also provide a set of pattern templates that determines the kind of syntactic and semantic constraints to be considered. In this paper, we argue that such pattern templates restricts the kind of extraction patterns that can be learned by IE systems. To allow a wider range of context information to be considered in learning extraction patterns, we first propose to model the content and context information of a candidate entity to be extracted as a set of features. A classification model is then built for each category of entities using Support Vector Machines (SVM). We have conducted IE experiments to evaluate our proposed method on a text collection in the terrorism domain. From the preliminary experimental results, we conclude that our proposed method can deliver reasonable accuracies.

[1] Edsger W. Dijkstra,et al. A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[2] Richard L. Francis,et al. Network models for building evacuation , 1982 .

[3] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[4] Éva Tardos,et al. “The quickest transshipment problem” , 1995, SODA '95.

[5] Rajeev Motwani,et al. Randomized Algorithms , 1995, SIGA.

[6] Roberto Battiti,et al. Reactive Local Search for the Maximum Clique Problem1 , 2001, Algorithmica.

[7] C. McDiarmid. SIMULATED ANNEALING AND BOLTZMANN MACHINES A Stochastic Approach to Combinatorial Optimization and Neural Computing , 1991 .

[8] Sean R. Eddy,et al. Biological sequence analysis: Preface , 1998 .

[9] R. K. Shyamasundar,et al. Introduction to algorithms , 1996 .

[10] S. Wasserman,et al. Social Network Analysis: Computer Programs , 1994 .

[11] Butler,et al. The Dynamics of Cyberspace: Examining and Modelling Online Social Structure , 1999 .

[12] T. Snijders. The statistical evaluation of social network dynamics , 2001 .

[13] D. Watts,et al. Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[14] Éva Tardos,et al. Polynomial time algorithms for some evacuation problems , 1994, SODA '94.

[15] Richard L. Francis,et al. EVACNET+: A computer program to determine optimal building evacuation plans , 1985 .