Semantic, Fuzzy Set and Fuzzy Measure Similarity for the Gene Ontology

Innovations in computational biology such as high throughput genomic technologies have resulted in numerous large databases containing genomic information which are being annotated with terms from the gene ontology to ensure consistency of the referenced biological concepts. These annotations are being used in determining the similarity between genes and gene products, an important task in post-genomics study. This paper presents an investigation of similarity measures between annotated objects with a focus on the domain of computational biology and the gene ontology. A framework for experimenting with and developing new similarity measures between annotated objects in other domains is discussed in the context of the querying with ontological terminologies and their annotations (QUOTA) system currently being developed.

[1]  Valerie V. Cross,et al.  Geometric compatibility modification , 1996, Fuzzy Sets Syst..

[2]  Mi Zhou,et al.  GeneInfoViz: Constructing and visualizing gene relation networks , 2004, Silico Biol..

[3]  Valerie V. Cross,et al.  Semantic Relatedness Measures in Ontologies Using Information Content and Fuzzy Set Theory , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[4]  Cliff Joslyn,et al.  The Gene Ontology Categorizer , 2004, ISMB/ECCB.

[5]  Kaizhong Zhang,et al.  Algorithms for Approximate Graph Matching , 1995, Inf. Sci..

[6]  Kuo-Chen Chou,et al.  Predicting protein localization in budding Yeast , 2005, Bioinform..

[7]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[8]  J.C. Bezdek,et al.  Functional summarization of gene product clusters using Gene Ontology similarity measures , 2004, Proceedings of the 2004 Intelligent Sensors, Sensor Networks and Information Processing Conference, 2004..

[9]  Ying Xu,et al.  Prediction of functional modules based on comparative genome analysis and Gene Ontology application , 2005, Nucleic acids research.

[10]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[11]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[12]  W. Pedrycz,et al.  An introduction to fuzzy sets : analysis and design , 1998 .

[13]  M. Sugeno,et al.  An interpretation of fuzzy measures and the Choquet integral as an integral with respect to a fuzzy , 1989 .

[14]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[15]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[16]  A. Tversky Features of Similarity , 1977 .

[17]  Operational approach to general fuzzy measures , 2000 .

[18]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[19]  Thomas Sudkamp,et al.  Similarity and Compatibility in Fuzzy Set Theory , 2002 .

[20]  Jennifer Widom,et al.  Exploiting hierarchical domain structure to compute similarity , 2003, TOIS.

[21]  James M. Keller,et al.  Fuzzy Measures on the Gene Ontology for Gene Product Similarity , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[22]  Olivier Bodenreider,et al.  Ontology-driven similarity approaches to supporting gene func- tional assessment , 2005 .

[23]  Hai Hu,et al.  Semantic similarity-based validation of human protein-protein interactions , 2005, 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05).