Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture
This paper take a novel perspective on model architecture to enhance the generalization of neural VRP solvers, specifically propose an Entropy-based Scaling Factor (ESF) and a Distribution-Specific (DS) decoder to enhance the size and distribution generalization, respectively.
1. Implementation of Entropy-based Scaling Factor:- Given a fix size-trained (
$n_{tr}$ ) model (e.g., POMO), just need apply$\log_{n_{tr}}n_{te}$ within each attention module when solving VRPs of size$n_{te}$ ; - Given an unfixed size-trained model (e.g., OMNI-VRP), just set a baseline
$n_{b}$ (e.g., 50), and then apply$\log_{n_{b}}n_{tr}$ and$\log_{n_{b}}n_{te}$ during training and testing, respectively.
We present the results of 2) on OMNI-VRP of solving CVRP (Just apply the ESF within each attention module). For 1), you can verify it by yourself.
2. Implementation of DS decoder:The DS decoder explicitly models VRPs of multiple training distribution patterns through multiple auxiliary light decoders.
-
We also would like to thank the following open-source repositories, which are baselines of our code:
If you find our paper and code useful, please cite our paper:
@misc{xiao2024improving,
title={Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture},
author={Yubin Xiao and Di Wang and Xuan Wu and Yuesong Wu and Boyang Li and Wei Du and Liupu Wang and You Zhou},
year={2024},
eprint={2406.06652},
archivePrefix={arXiv},
primaryClass={cs.LG}
}