Privacy Preservation in Streaming Data Collection

Big data management and analysis has become a hot topic in academic and industrial research. In fact, a large portion of big data in service today are initially streaming data. To preserve the privacy of such data that are collected from data streams, the most efficient way is to control the process of data collection according to corresponding privacy polices. In this paper, we design a framework to support data stream management with privacy-preserving capabilities. In particular, we focus on two premier principles of data privacy, limited disclosure and limited collection. With these two principles guaranteed, the archived data will not necessarily be checked for privacy protection, before analysis and other operations can be done.

[1]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[2]  Kian-Lee Tan,et al.  ACStream: Enforcing Access Control over Data Streams , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[3]  Ramakrishnan Srikant,et al.  Hippocratic Databases , 2002, VLDB.

[4]  Jörg Meier,et al.  Securing the Borealis Data Stream Engine , 2006, 2006 10th International Database Engineering and Applications Symposium (IDEAS'06).

[5]  David J. DeWitt,et al.  Limiting Disclosure in Hippocratic Databases , 2004, VLDB.

[6]  Dennis Shasha,et al.  StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time , 2002, VLDB.

[7]  Kian-Lee Tan,et al.  A framework to enforce access control over data streams , 2010, TSEC.

[8]  Elisa Bertino,et al.  A Security Punctuation Framework for Enforcing Access Control on Streaming Data , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[9]  Elisa Bertino,et al.  Hippocratic Data Streams-Concepts, Architectures and Issues , 2005 .

[10]  Elisa Bertino,et al.  FENCE: Continuous access control enforcement in dynamic data stream environments , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[11]  Tony Fountain,et al.  The Ring Buffer Network Bus (RBNB) DataTurbine Streaming Data Middleware for Environmental Observing Systems , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).