Information relaxations and duality in stochastic dynamic programs

0.00 Avg rating—0 Votes

Article ID:	iaor20105581
Volume:	58
Issue:	4-Part-1
Start Page Number:	785
End Page Number:	801
Publication Date:	Jul 2010
Journal:	Operations Research
Authors:	Sun Peng, Brown David B, Smith James E
Keywords:	duality

Abstract:

We describe a general technique for determining upper bounds on maximal values (or lower bounds on minimal costs) in stochastic dynamic programs. In this approach, we relax the nonanticipativity constraints that require decisions to depend only on the information available at the time a decision is made and impose a “penalty” that punishes violations of nonanticipativity. In applications, the hope is that this relaxed version of the problem will be simpler to solve than the original dynamic program. The upper bounds provided by this dual approach complement lower bounds on values that may be found by simulating with heuristic policies. We describe the theory underlying this dual approach and establish weak duality, strong duality, and complementary slackness results that are analogous to the duality results of linear programming. We also study properties of good penalties. Finally, we demonstrate the use of this dual approach in an adaptive inventory control problem with an unknown and changing demand distribution and in valuing options with stochastic volatilities and interest rates. These are complex problems of significant practical interest that are quite difficult to solve to optimality. In these examples, our dual approach requires relatively little additional computation and leads to tight bounds on the optimal values.

Reviews

Required fields are marked *. Your email address will not be published.