Article ID: | iaor19992799 |
Country: | Netherlands |
Volume: | 108 |
Issue: | 2 |
Start Page Number: | 306 |
End Page Number: | 318 |
Publication Date: | Jul 1998 |
Journal: | European Journal of Operational Research |
Authors: | Stafylopatis A., Blekas K. |
Keywords: | vehicle routing & scheduling |
Reinforcement learning schemes perform direct on-line search in control space. This makes them appropiate for modifying control rules to obtain improvements in the performance of a system. The effectiveness of a reinforcement learning strategy is studied here through the training of a learning classifier system (LCS) that controls the movement of an autonomous vehicle in simulated paths including left and right turns. The LCS comprises a set of condition–action rules (classifiers) that compete to control the system and evolve by means of a genetic algorithm (GA). Evolution and operation of classifiers depend upon an appropriate credit assignment mechanism based on reinforcement learning. Different design options and the role of various parameters have been investigated experimentally. The performance of vehicle movement under the proposed evolutionary approach is superior compared with that of other (neural) approaches based on reinforcement learning that have been applied previously to the same benchmark problem.