Article ID: | iaor20171227 |
Volume: | 29 |
Issue: | 2 |
Start Page Number: | 301 |
End Page Number: | 317 |
Publication Date: | May 2017 |
Journal: | INFORMS Journal on Computing |
Authors: | Williams Loren, Jiang Yan, Klabjan Diego, Park Young Woong |
Keywords: | statistics: regression, programming: mathematical, heuristics, inventory, agriculture & food, marketing, forecasting: applications |
Clusterwise linear regression (CLR), a clustering problem intertwined with regression, finds clusters of entities such that the overall sum of squared errors from regressions performed over these clusters is minimized, where each cluster may have different variances. We generalize the CLR problem by allowing each entity to have more than one observation and refer to this as generalized CLR. We propose an exact mathematical programming‐based approach relying on column generation, a column generation–based heuristic algorithm that clusters predefined groups of entities, a metaheuristic genetic algorithm with adapted Lloyd’s algorithm for