论文信息 - Reinforcement Learning and Reactive Search: an adaptive MAX-SAT solver

Reinforcement Learning and Reactive Search: an adaptive MAX-SAT solver

This paper investigates Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular, a novel application of RL is proposed in the Reactive Tabu Search (RTS) scheme, where the appropriate amount of diversification in prohibition-based local search is adapted in a fast online manner to the characteristics of a task and of the local configuration. The experimental tests demonstrate promising results on Maximum Satisfiability (MAX-SAT) instances when compared with state-of-the-art SLS SAT solvers, such us AdaptNovelty, rSAPS and gNovelty.

Roberto Battiti | Paolo Campigotto

[1] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..

[2] Holger H. Hoos,et al. Scaling and Probabilistic Smoothing: Efficient Dynamic Local Search for SAT , 2002, CP.

[3] Hector J. Levesque,et al. Hard and Easy Distributions of SAT Problems , 1992, AAAI.

[4] Holger H. Hoos,et al. Novelty + and Adaptive Novelty + , 2004 .

[5] Abdul Sattar,et al. Advances in Local Search for Satisfiability , 2007, Australian Conference on Artificial Intelligence.

[6] Roberto Battiti,et al. Reactive search, a history-sensitive heuristic for MAX-SAT , 1997, JEAL.

[7] Mauro Brunato,et al. Learning While Optimizing an Unknown Fitness Surface , 2008, LION.