论文信息 - Learning Instance-Independent Value Functions to Enhance Local Search

Learning Instance-Independent Value Functions to Enhance Local Search

Reinforcement learning methods can be used to improve the performance of local search algorithms for combinatorial optimization by learning an evaluation function that predicts the outcome of search. The evaluation function is therefore able to guide search to low-cost solutions better than can the original cost function. We describe a reinforcement learning method for enhancing local search that combines aspects of previous work by Zhang and Dietterich (1995) and Boyan and Moore (1997, Boyan 1998). In an off-line learning phase, a value function is learned that is useful for guiding search for multiple problem sizes and instances. We illustrate our technique by developing several such functions for the Dial-A-Ride Problem. Our learning-enhanced local search algorithm exhibits an improvement of more then 30% over a standard local search algorithm.

[1] David M. Stein,et al. An Asymptotic, Probabilistic Analysis of a Routing Problem , 1978, Math. Oper. Res..

[2] Harilaos N. Psaraftis,et al. k-Interchange procedures for local search in a precedence-constrained routing problem , 1983 .

[3] P. Healy,et al. A new extension of local search applied to the Dial-A-Ride Problem , 1995 .

[4] Wei Zhang,et al. A Reinforcement Learning Approach to job-shop Scheduling , 1995, IJCAI.

[5] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[6] Andrew W. Moore,et al. Learning evaluation functions for global optimization , 1998 .