Cognitive-Based Multi-document Summarization Approach

Automatic text summarization is an important and useful research area in natural language processing and information retrieval. Most of current approaches for text summarization do not make full use of human reading process. This paper proposes a multi-document scanning mechanism by simulating human reading process. The mechanism simulates human memory of words, association between words and three cognitive processes invoked when reading. Changes of human memory of topic words in reading process are used to denote sentences' significance, based on which sentences are then ordered and extracted to form a summary. Experiments on DUC2007 test data show that our proposing method is efficient and outperforms two baseline methods.

[1]  Hai Zhuge,et al.  Interactive semantics , 2010, Artif. Intell..

[2]  Hai Zhuge,et al.  Semantic linking through spaces for cyber-physical-socio intelligence: A methodology , 2011, Artif. Intell..

[3]  Jerry R. Hobbs,et al.  Learning by Reading: A Prototype System, Performance Baseline and Lessons Learned , 2007, AAAI.

[4]  Desney S. Tan,et al.  Learning to Learn: Algorithmic Inspirations from Human Problem Solving , 2012, AAAI.

[5]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[6]  Hua Li,et al.  Document Summarization Using Conditional Random Fields , 2007, IJCAI.

[7]  Wai Lam,et al.  MEAD - A Platform for Multidocument Multilingual Text Summarization , 2004, LREC.

[8]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[9]  Hai Zhuge,et al.  Probabilistic Resource Space Model for Managing Resources in Cyber-Physical Society , 2012, IEEE Transactions on Services Computing.

[10]  Regina Barzilay,et al.  Sentence Fusion for Multidocument News Summarization , 2005, CL.

[11]  Kam-Fai Wong,et al.  Extractive Summarization Using Supervised and Semi-Supervised Learning , 2008, COLING.

[12]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[13]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[14]  Regina Barzilay,et al.  Information Fusion in the Context of Multi-Document Summarization , 1999, ACL.

[15]  Anthony J. Hornof,et al.  Cognitive strategies and eye movements for searching hierarchical computer displays , 2003, CHI '03.

[16]  Jacob Beal,et al.  Self-Managing Associative Memory for Dynamic Acquisition of Expertise in High-Level Domains , 2009, IJCAI.

[17]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[18]  Hai Zhuge,et al.  The Web Resource Space Model , 2008 .

[19]  David Reitter,et al.  Social Cognition: Memory Decay and Adaptive Information Filtering for Robust Information Maintenance , 2012, AAAI.

[20]  Hai Zhuge,et al.  Resource space model, OWL and database: Mapping and integration , 2008, TOIT.

[21]  Hai Zhuge,et al.  Communities and Emerging Semantics in Semantic Link Network: Discovery and Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[23]  Eduard H. Hovy,et al.  The Automated Acquisition of Topic Signatures for Text Summarization , 2000, COLING.

[24]  Bei Xu,et al.  A Text Scanning Mechanism Simulating Human Reading Process , 2013, IJCAI.

[25]  Hai Zhuge,et al.  The Knowledge Grid:Toward Cyber-Physical Society , 2012 .

[26]  Allison Woodruff,et al.  Enhancing a digital book with a reading recommender , 2000, CHI.

[27]  Robert Dale,et al.  Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics , 1999 .

[28]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.