Predicting human immunodeficiency virus inhibitors using multi-dimensional Bayesian network classifiers

OBJECTIVE Our aim is to use multi-dimensional Bayesian network classifiers in order to predict the human immunodeficiency virus type 1 (HIV-1) reverse transcriptase and protease inhibitors given an input set of respective resistance mutations that an HIV patient carries. MATERIALS AND METHODS Multi-dimensional Bayesian network classifiers (MBCs) are probabilistic graphical models especially designed to solve multi-dimensional classification problems, where each input instance in the data set has to be assigned simultaneously to multiple output class variables that are not necessarily binary. In this paper, we introduce a new method, named MB-MBC, for learning MBCs from data by determining the Markov blanket around each class variable using the HITON algorithm. Our method is applied to both reverse transcriptase and protease data sets obtained from the Stanford HIV-1 database. RESULTS Regarding the prediction of antiretroviral combination therapies, the experimental study shows promising results in terms of classification accuracy compared with state-of-the-art MBC learning algorithms. For reverse transcriptase inhibitors, we get 71% and 11% in mean and global accuracy, respectively; while for protease inhibitors, we get more than 84% and 31% in mean and global accuracy, respectively. In addition, the analysis of MBC graphical structures lets us gain insight into both known and novel interactions between reverse transcriptase and protease inhibitors and their respective resistance mutations. CONCLUSION MB-MBC algorithm is a valuable tool to analyze the HIV-1 reverse transcriptase and protease inhibitors prediction problem and to discover interactions within and between these two classes of inhibitors.

[1]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[2]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[3]  Yves Moreau,et al.  Analysis of HIV-1 pol sequences using Bayesian Networks: implications for drug resistance , 2006, Bioinform..

[4]  Bryan Chan,et al.  Human Immunodeficiency Virus Reverse Transcriptase and Protease Sequence Database , 1999, Nucleic Acids Res..

[5]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[6]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[7]  Philippe Flandre,et al.  Tipranavir-Ritonavir Genotypic Resistance Score in Protease Inhibitor-Experienced Patients , 2008, Antimicrobial Agents and Chemotherapy.

[8]  D. Katzenstein,et al.  Bayesian network analyses of resistance pathways against efavirenz and nevirapine , 2008, AIDS.

[9]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[10]  Constantin F. Aliferis,et al.  HITON: A Novel Markov Blanket Algorithm for Optimal Variable Selection , 2003, AMIA.

[11]  Thomas Lengauer,et al.  Improved Prediction of Response to Antiretroviral Combination Therapy using the Genetic Barrier to Drug Resistance , 2006, Antiviral therapy.

[12]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part II: Analysis and Extensions , 2010, J. Mach. Learn. Res..

[13]  Glenn Fung,et al.  Automated Heart Wall Motion Abnormality Detection from Ultrasound Images Using Bayesian Networks , 2007, IJCAI.

[14]  Luis Enrique Sucar,et al.  A Two-Step Method to Learn Multidimensional Bayesian Network Classifiers Based on Mutual Information Measures , 2011, FLAIRS.

[15]  S. Sarafianos,et al.  Structure and function of HIV-1 reverse transcriptase: molecular mechanisms of polymerization and inhibition. , 2009, Journal of molecular biology.

[16]  Philippe Flandre,et al.  Factors Associated with the Selection of Mutations Conferring Resistance to Protease Inhibitors (PIs) in PI-Experienced Patients Displaying Treatment Failure on Darunavir , 2007, Antimicrobial Agents and Chemotherapy.

[17]  Linda C. van der Gaag,et al.  Inference and Learning in Multi-dimensional Bayesian Network Classifiers , 2007, ECSQARU.

[18]  Judea Pearl,et al.  The recovery of causal poly-trees from statistical data , 1987, Int. J. Approx. Reason..

[19]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[20]  José Antonio Lozano,et al.  Multi-Objective Learning of Multi-Dimensional Bayesian Classifiers , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[21]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[22]  D. Richman,et al.  2022 update of the drug resistance mutations in HIV-1. , 2022, Topics in antiviral medicine.

[23]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[24]  T. Silander,et al.  Bayesian network analysis of resistance pathways against HIV-1 protease inhibitors. , 2007, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[25]  Colombe Chappey,et al.  Broad nucleoside reverse-transcriptase inhibitor cross-resistance in human immunodeficiency virus type 1 clinical isolates. , 2003, The Journal of infectious diseases.

[26]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[27]  Concha Bielza,et al.  Multi-dimensional classification with Bayesian networks , 2011, Int. J. Approx. Reason..

[28]  R. Shafer,et al.  HIV-1 Protease Mutations and Protease Inhibitor Cross-Resistance , 2010, Antimicrobial Agents and Chemotherapy.

[29]  D. Richman,et al.  Update of the drug resistance mutations in HIV-1: December 2010. , 2010, Topics in HIV medicine : a publication of the International AIDS Society, USA.

[30]  Wilhelm Huisinga,et al.  Drug-Class Specific Impact of Antivirals on the Reproductive Capacity of HIV , 2010, PLoS Comput. Biol..

[31]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[32]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[33]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[34]  P. Libin,et al.  Resistance pathways of human immunodeficiency virus type 1 against the combination of zidovudine and lamivudine. , 2010, The Journal of general virology.

[35]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part I: Algorithms and Empirical Evaluation , 2010, J. Mach. Learn. Res..

[36]  Concha Bielza,et al.  Bayesian Chain Classifiers for Multidimensional Classification , 2011, IJCAI.

[37]  Linda C. van der Gaag,et al.  Multi-dimensional Bayesian Network Classifiers , 2006, Probabilistic Graphical Models.