APCluster: an R package for affinity propagation clustering

SUMMARY Affinity propagation (AP) clustering has recently gained increasing popularity in bioinformatics. AP clustering has the advantage that it allows for determining typical cluster members, the so-called exemplars. We provide an R implementation of this promising new clustering technique to account for the ubiquity of R in bioinformatics. This article introduces the package and presents an application from structural biology. AVAILABILITY The R package apcluster is available via CRAN-The Comprehensive R Archive Network: http://cran.r-project.org/web/packages/apcluster CONTACT apcluster@bioinf.jku.at; bodenhofer@bioinf.jku.at.

[1]  Dirk Eddelbuettel,et al.  Rcpp: Seamless R and C++ Integration , 2011 .

[2]  C. Micchelli,et al.  Functions that preserve families of positive semidefinite matrices , 1995 .

[3]  Fan Yang,et al.  A Poisson-based adaptive affinity propagation clustering for SAGE data , 2010, Comput. Biol. Chem..

[4]  Sach Mukherjee,et al.  Temporal clustering by affinity propagation reveals transcriptional modules in Arabidopsis thaliana , 2010, Bioinform..

[5]  E. Klement,et al.  A fuzzy algorithm for pixel classification based on the discrepancy norm , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[6]  Jerzy Tiuryn,et al.  MODEVO: exploring modularity and evolution of protein interaction networks , 2010, Bioinform..

[7]  Michele Leone,et al.  Clustering by soft-constraint affinity propagation: applications to gene-expression data , 2007, Bioinform..

[8]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[9]  Roland L. Dunbrack,et al.  A new clustering of antibody CDR loop conformations. , 2011, Journal of molecular biology.

[10]  B. Baets,et al.  Metrics and T-Equalities , 2002 .

[11]  C. Micchelli Interpolation of scattered data: Distance matrices and conditionally positive definite functions , 1986 .

[12]  F Yang,et al.  Using affinity propagation combined post-processing to cluster protein sequences. , 2010, Protein and peptide letters.

[13]  Shoshana J. Wodak,et al.  Markov clustering versus affinity propagation for the partitioning of protein interaction graphs , 2009, BMC Bioinformatics.

[14]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[15]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[16]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[17]  H. Weyl Über die Gleichverteilung von Zahlen mod. Eins , 1916 .

[18]  Hui Liu,et al.  Detecting microarray data supported microRNA-mRNA interactions , 2010, Int. J. Data Min. Bioinform..

[19]  Jeffrey Skolnick,et al.  TASSER_low‐zsc: An approach to improve structure prediction using low z‐score–ranked templates , 2010, Proteins.

[20]  Chih-Wei Wang,et al.  iPARTS: an improved tool of pairwise alignment of RNA tertiary structures , 2010, Nucleic Acids Res..

[21]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[22]  Ingrid G. Abfalter,et al.  Complex Networks Govern Coiled-Coil Oligomerization – Predicting and Profiling by Means of a Machine Learning Approach , 2011, Molecular & Cellular Proteomics.

[23]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[24]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[25]  K. Arndt,et al.  Coiled Coil Domains: Stability, Specificity, and Biological Implications , 2004, Chembiochem : a European journal of chemical biology.

[26]  Georgios A. Pavlopoulos,et al.  Arena3D: visualization of biological networks in 3D , 2008, BMC Systems Biology.