Article ID: | iaor20122084 |
Volume: | 219 |
Issue: | 2 |
Start Page Number: | 335 |
End Page Number: | 346 |
Publication Date: | Jun 2012 |
Journal: | European Journal of Operational Research |
Authors: | Khmelnitsky Eugene, Bukchin Yossi, Yakuel Pini |
Keywords: | combinatorial optimization, programming: dynamic, programming: markov decision, decision theory, heuristics |
This research studies the problem of batching orders in a dynamic, finite‐horizon environment to minimize order tardiness and overtime costs of the pickers. The problem introduces the following trade‐off: at every period, the picker has to decide whether to go on a tour and pick the accumulated orders, or to wait for more orders to arrive. By waiting, the picker risks higher tardiness of existing orders on the account of lower tardiness of future orders. We use a Markov decision process (MDP) based approach to set an optimal decision making policy. In order to evaluate the potential improvement of the proposed approach in practice, we compare the optimal policy with two naïve heuristics: (1) ‘Go on tour immediately after an order arrives’, and, (2) ‘Wait as long as the current orders can be picked and supplied on time’. The optimal policy shows a considerable improvement over the naïve heuristics, in the range of 7–99%, where the specific values depend on the picking process parameters. We have found that one measure, the