A Cross Entropy Multiagent Learning Algorithm for Solving Vehicle Routing Problems with Time Windows

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The vehicle routing problem with time windows (VRPTW) has been the subject of intensive study because of its importance in real applications. In this paper, we propose a cross entropy multiagent learning algorithm, which considers an optimum solution as a rare event to be learned. The routing policy is node-distributed, controlled by a set of parameterized probability distribution functions. Based on the performance of experienced tours of vehicle agents, these parameters are updated iteratively by minimizing Kullback-Leibler cross entropy in order to generate better solutions in next iterations. When applying the proposed algorithm on Solomon’s 100-customer problem set, it shows outperforming results in comparison with the classical cross entropy approach. Moreover, this method needs only very small number of parameter settings. Its implementation is also relatively simple and flexible to solve other vehicle routing problems under various dynamic scenarios.
Original languageEnglish
Title of host publicationComputational Logistics
Subtitle of host publicationInternational Conference on Computational Logistics ICCL 2011
Editors J.W. Böse, H. Hu , C. Jahn , X. Shi , R. Stahlbock , R. Stahlbock , S. Voß
PublisherSpringer
Number of pages2
ISBN (Electronic)978-3-642-24264-9
ISBN (Print)978-3-642-24263-2
DOIs
Publication statusPublished - 2011
Externally publishedYes

Publication series

NameLecture Notes in Computer Science book series (LNCS)
Volume6971

Keywords

  • Local Search
  • Vehicle Route Problem
  • Local Search Algorithm
  • Local Search Procedure
  • Cross Entropy

Cite this