Article ID: | iaor20117593 |
Volume: | 96 |
Issue: | 10 |
Start Page Number: | 1292 |
End Page Number: | 1310 |
Publication Date: | Oct 2011 |
Journal: | Reliability Engineering and System Safety |
Authors: | Flandoli F, Giorgi E, Aspinall W P, Neri A |
Keywords: | estimation, performance evaluation, ranking, uncertainty |
The problem of ranking and weighting experts' performances when quantitative judgments are being elicited for decision support is considered. A new scoring model, the Expected Relative Frequency model, is presented, based on the closeness between central values provided by the expert and known values used for calibration. Using responses from experts in five different elicitation datasets, a cross‐validation technique is used to compare this new approach with the Cooke Classical Model, the Equal Weights model, and individual experts. The analysis is performed using alternative reward schemes designed to capture proficiency either in quantifying uncertainty, or in estimating true central values. Results show that although there is only a limited probability that one approach is consistently better than another, the Cooke Classical Model is generally the most suitable for assessing uncertainties, whereas the new ERF model should be preferred if the goal is central value estimation accuracy.