How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

We consider the problem of monitoring and controlling a partially-observed dynamic process that spreads over a graph. This problem naturally arises in contexts such as scheduling virus tests or quarantining individuals to curb a spreading epidemic; detecting fake news spreading on online networks by manually inspecting posted articles; and targeted marketing where the objective is to encourage the spread of a product. Curbing the spread and constraining the fraction of infected population becomes challenging when only a fraction of the population can be tested or quarantined. To address this challenge, we formulate this setup as a sequential decision problem over a graph. In face of an exponential state space, combinatorial action space and partial observability, we design RLGN, a novel tractable Reinforcement Learning (RL) scheme to prioritize which nodes should be tested, using Graph Neural Networks (GNNs) to rank the graph nodes. We evaluate this approach in three types of social-networks: community-structured, preferential attachment, and based on statistics from real cellular tracking. RLGN consistently outperforms all baselines in our experiments. It suggests that prioritizing tests using RL on temporal graphs can increase the number of healthy people by $25\%$ and contain the epidemic $30\%$ more often than supervised approaches and $2.5\times$ more often than non-learned baselines using the same resources.

[1]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[2]  Piet Van Mieghem,et al.  Optimization of network protection against virus spread , 2011, 2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN).

[3]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[4]  Ning Feng,et al.  Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting , 2019, AAAI.

[5]  Luyang Liu,et al.  Examining COVID-19 Forecasting using Spatio-Temporal Graph Neural Networks , 2020, ArXiv.

[6]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[7]  Svetha Venkatesh,et al.  Graph Transformation Policy Network for Chemical Reaction Prediction , 2018, KDD.

[8]  Pascal Poupart,et al.  Representation Learning for Dynamic Graphs: A Survey , 2020, J. Mach. Learn. Res..

[9]  Sudip Saha,et al.  Approximation Algorithms for Reducing the Spectral Radius to Control Epidemic Spread , 2015, SDM.

[10]  Le Song,et al.  2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[11]  A. Rodriguez,et al.  DeepCOVID: An Operational Deep Learning-driven Framework for Explainable Real-time COVID-19 Forecasting , 2020, medRxiv.

[12]  Marcel Salathé,et al.  Dynamics and Control of Diseases in Networks with Community Structure , 2010, PLoS Comput. Biol..

[13]  James P. Callan,et al.  Explicit Semantic Ranking for Academic Search via Knowledge Graph Embedding , 2017, WWW.

[14]  Joel C. Miller,et al.  Effective vaccination strategies for realistic social networks , 2007 .

[15]  Constantine Caramanis,et al.  Quarantines as a Targeted Immunization Strategy , 2020, ArXiv.

[16]  Yuan He,et al.  Graph Neural Networks for Social Recommendation , 2019, WWW.

[17]  Shivani Agarwal,et al.  Ranking on graph data , 2006, ICML.

[18]  Jie Chen,et al.  EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs , 2020, AAAI.

[19]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[20]  Yongli Cai,et al.  A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action , 2020, International Journal of Infectious Diseases.

[21]  Victor M. Preciado,et al.  Optimal Containment of Epidemics in Temporal and Adaptive Networks , 2016, ArXiv.

[22]  Yu Liu,et al.  T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction , 2018, IEEE Transactions on Intelligent Transportation Systems.

[23]  Chris T. Bauch,et al.  Policy Resistance Undermines Superspreader Vaccination Strategies for Influenza , 2013, PLoS Comput. Biol..

[24]  Davide Eynard,et al.  Temporal Graph Networks for Deep Learning on Dynamic Graphs , 2020, ArXiv.

[25]  Keith W. Ross,et al.  Efficient Entropy for Policy Gradient with Multidimensional Action Space , 2018, ICLR.

[26]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[27]  Qi Zhang,et al.  GSTNet: Global Spatial-Temporal Network for Traffic Flow Prediction , 2019, IJCAI.

[28]  Charles A. Sutton,et al.  GEMSEC: Graph Embedding with Self Clustering , 2018, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[29]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[30]  Rosalind M Eggo,et al.  Epidemic Wave Dynamics Attributable to Urban Community Structure: A Theoretical Characterization of Disease Transmission in a Large Network , 2015, Journal of medical Internet research.

[31]  Tamer Basar,et al.  Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents , 2018, ICML.

[32]  Chinwendu Enyioha,et al.  Optimal Resource Allocation for Network Protection Against Spreading Processes , 2013, IEEE Transactions on Control of Network Systems.

[33]  Gary MacGillivray,et al.  AUSTRALASIAN JOURNAL OF COMBINATORICS Volume 43 (2009), Pages 57–77 The Firefighter Problem: A survey of results, directions and questions , 2022 .

[34]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[35]  Naren Ramakrishnan,et al.  EpiDeep: Exploiting Embeddings for Epidemic Forecasting , 2019, KDD.

[36]  Jingrui He,et al.  Towards Explainable Representation of Time-Evolving Graphs via Spatial-Temporal Graph Attention Networks , 2019, CIKM.

[37]  Alán Aspuru-Guzik,et al.  Convolutional Networks on Graphs for Learning Molecular Fingerprints , 2015, NIPS.

[38]  Shie Mannor,et al.  Localized Epidemic Detection in Networks with Overwhelming Noise , 2014, SIGMETRICS.

[39]  Anil Vullikanti,et al.  Designing Effective and Practical Interventions to Contain Epidemics , 2020, AAMAS.

[40]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[41]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[42]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[43]  X. Rodó,et al.  A modified SEIR model to predict the COVID-19 outbreak in Spain and Italy: Simulating control scenarios and multi-scale epidemics , 2020, Results in Physics.

[44]  Zhijian Li,et al.  A Study on Graph-Structured Recurrent Neural Networks and Sparsification with Application to Epidemic Forecasting , 2019, WCGO.

[45]  Di Wang,et al.  Targeted Pandemic Containment Through Identifying Local Contact Network Bottlenecks , 2020, ArXiv.

[46]  Alexander J. Smola,et al.  Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning , 2017, ICLR.

[47]  Jan Eric Lenssen,et al.  Fast Graph Representation Learning with PyTorch Geometric , 2019, ArXiv.

[48]  Reuven Cohen,et al.  Efficient immunization strategies for computer networks and populations. , 2002, Physical review letters.

[49]  Shie Mannor,et al.  Detecting Cascades from Weak Signatures , 2018, IEEE Transactions on Network Science and Engineering.

[50]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[51]  J. Medlock,et al.  Optimizing Influenza Vaccine Distribution , 2009, Science.

[52]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[53]  Yada Zhu,et al.  Towards Fine-Grained Temporal Network Representation via Time-Reinforced Random Walk , 2020, AAAI.