Computational Cost of Querying for Related Entities in Different Ontologies

The computational cost of querying for similar entities across ontologies is high since, in the worst case, every pair of entities will have to be considered. Therefore, links discovered during ontology alignment have been used to speed up querying across ontologies by following relatedness links to discover similar entities. We derive the computational complexity of querying across ontologies using the ontology alignment links discovered using the Unified Fuzzy Ontology Matching (UFOM) framework. We consider querying for related entities by following either a single alignment link or by following multiple alignment links. These methods have different computational complexity and produce different query results. We also study the impact of the specific implementation approach on query time. We consider implementations based on multiple accesses of the triplestore using a high-level procedural language and by execution of a single SPARQL graph query on the ontology server. These approaches were evaluated using ontologies derived from an enterprise-scale dataset. Experimental results show that an implementation using nested for-loops in a procedural language outperformed by nearly 2× an implementation based on a single SPARQL query.

[1]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[2]  Masaki Aono,et al.  Resolving scalability issue to ontology instance matching in Semantic Web , 2012, 2012 15th International Conference on Computer and Information Technology (ICCIT).

[3]  Masaki Aono,et al.  Anchor-Flood: Results for OAEI 2009 , 2009, OM.

[4]  Andreas Thor,et al.  Instance-Based Matching of Large Life Science Ontologies , 2007, DILS.

[5]  Felix Naumann,et al.  A Machine Learning Approach to Foreign Key Discovery , 2009, WebDB.

[6]  M. Larsen Record Linkage Modeling in Federal Statistical Databases , 2010 .

[7]  Jérôme Euzenat,et al.  Ontology Matching: State of the Art and Future Challenges , 2013, IEEE Transactions on Knowledge and Data Engineering.

[8]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[9]  Beng Chin Ooi,et al.  On multi-column foreign key discovery , 2010, Proc. VLDB Endow..

[10]  Hyoil Han,et al.  A survey on ontology mapping , 2006, SGMD.

[11]  Felix Naumann,et al.  Efficiently Detecting Inclusion Dependencies , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[12]  Viktor K. Prasanna,et al.  UFOM: Unified fuzzy ontology matching , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[13]  Viviana Mascardi,et al.  Automatic Ontology Matching via Upper Ontologies: A Systematic Evaluation , 2010, IEEE Transactions on Knowledge and Data Engineering.