Article ID: | iaor20061446 |
Country: | South Africa |
Volume: | 19 |
Issue: | 1/2 |
Start Page Number: | 75 |
End Page Number: | 85 |
Publication Date: | Jan 2003 |
Journal: | Orion |
Authors: | Jenkins Larry |
Keywords: | finance & banking |
In many studies where data are collected on several variables, there is a motivation to find if fewer variables would provide almost as much information. Variance of a variable about its mean is the common statistical measure of information content, and that is used here. We are interested whether the variability in one variable is sufficiently correlated with that in one or more of the other variables that the first variable is redundant. We wish to find one or more ‘principal variables’ that sufficiently reflect the information content in all the original variables. The paper explains the method of principal variables and reports experiments using the technique to see if just a few variables are sufficient to reflect the information in 11 socioeconomic variables on 130 countries from a World Bank (WB) database. While the method of principal variables is highly successful in a statistical sense, the WB data vary greatly from year to year, demonstrating that fewer variables would be inadequate for these data.