Parallel Sequential Pattern Mining of Massive Trajectory Data

The trajectory pattern mining problem has recently attracted much attention due to the rapid development of location-acquisition technologies, and parallel computing essentially provides an alternative method for handling this problem. This study precisely addresses the problem of parallel mining of trajectory sequential patterns based on the newly proposed concepts with regard to trajectory pattern mining. We propose an efficient and effective parallel sequential patterns mining (plute) algorithm that includes three essential techniques: prefix projection, data parallel formulation, and task parallel formulation. Firstly, the prefix projection technique is used to decompose the search space as well as greatly reduce the candidate trajectory sequences. Secondly, the data parallel formulation decomposes the computations associated with counting the support of trajectory patterns. Thirdly, the task parallel formulation employs the MapReduce programming model to assign the computations across a set of machin...

[1]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[2]  Guangquan Zhang,et al.  An Extended Kth-Best Approach For Referential-Uncooperative Bilevel Multi-Follower Decision Making , 2008, Int. J. Comput. Intell. Syst..

[3]  David A. Padua,et al.  Parallel mining of closed sequential patterns , 2005, KDD '05.

[4]  Thomas Brinkhoff,et al.  Generating Traffic Data , 2003, IEEE Data Eng. Bull..

[5]  Charu C. Aggarwal,et al.  A Tree Projection Algorithm for Generation of Frequent Item Sets , 2001, J. Parallel Distributed Comput..

[6]  Valerie Guralnik,et al.  Parallel Tree Projection Algorithm for Sequence Mining , 2001, Euro-Par.

[7]  Zou Xiang Study on Distributed Sequential Pattern Discovery Algorithm , 2005 .

[8]  Samuel Williams,et al.  The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[9]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[10]  Mohammed J. Zaki Efficient enumeration of frequent sequences , 1998, CIKM '98.

[11]  Dino Pedreschi,et al.  Mining sequences with temporal annotations , 2006, SAC '06.

[12]  Nikos Mamoulis,et al.  Mining frequent spatio-temporal sequential patterns , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[13]  Valerie Guralnik,et al.  Parallel tree-projection-based sequence mining algorithms , 2004, Parallel Comput..

[14]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[15]  Changjie Tang,et al.  PartSpan: Parallel Sequence Mining of Trajectory Patterns , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[16]  Jian-Bo Yang,et al.  Management of Uncertainty and Spatio-Temporal Aspects for Monitoring and Diagnosis in a Smart Home , 2008, Int. J. Comput. Intell. Syst..

[17]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[18]  Huidong Jin,et al.  Processing Constrained k-Closest Pairs Queries in Crime Databases , 2010, Security Informatics.

[19]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[20]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[21]  Jianwen Su,et al.  Universal trajectory queries for moving object databases , 2004, IEEE International Conference on Mobile Data Management, 2004. Proceedings. 2004.

[22]  Thomas G. Dietterich,et al.  Discovering Patterns in Sequences of Events , 1985, Artif. Intell..

[23]  George Karypis,et al.  Introduction to Parallel Computing , 1994 .

[24]  Huidong Jin,et al.  KISTCM: knowledge discovery system for traditional Chinese medicine , 2010, Applied Intelligence.

[25]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[26]  Masaru Kitsuregawa,et al.  Mining Algorithms for Sequential Patterns in Parallel: Hash Based Approach , 1998, PAKDD.

[27]  Mohammed J. Zaki Parallel Sequence Mining on Shared-Memory Machines , 1999, J. Parallel Distributed Comput..

[28]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[29]  Dino Pedreschi,et al.  Efficient Mining of Temporally Annotated Sequences , 2006, SDM.

[30]  Jun Liu,et al.  A Linguistic-Valued Weighted Aggregation Operator to Multiple Attribute Group Decision Making with Quantitative and Qualitative Information , 2008, Int. J. Comput. Intell. Syst..

[31]  Huidong Jin,et al.  PutMode: prediction of uncertain trajectories in moving objects databases , 2010, Applied Intelligence.