Article ID: | iaor201110911 |
Volume: | 52 |
Issue: | 1 |
Start Page Number: | 189 |
End Page Number: | 198 |
Publication Date: | Dec 2011 |
Journal: | Decision Support Systems |
Authors: | Chan Samuel W K, Franklin James |
Keywords: | datamining, statistics: decision, decision |
Although most quantitative financial data are analyzed using traditional statistical, artificial intelligence or data mining techniques, the abundance of online electronic financial news articles has opened up new possibilities for intelligent systems that can extract and organize relevant knowledge automatically in a usable format. Most information extraction systems require a hand‐built dictionary of templates and thus need continual modification to accommodate new patterns that are observed in the text. In this research, we propose a novel text‐based decision support system (DSS) that (i) extracts event sequences from shallow text patterns, and (ii) predicts the likelihood of the occurrence of events using a classifier‐based inference engine. The prediction relies on two major, but complementary, feature sets: adjacent events and a set of information‐theoretic functions. In contrast to other approaches, the proposed text‐based DSS gives explanatory hypotheses about its predictions from a coalition of intimations learned from the inference engine, while preserving robustness and without indulging in formalism. We investigate more than 2000 financial reports with 28,000 sentences. Experiments show that the prediction accuracy of our model outperforms similar statistical models by 7% for the seen data while significantly improving the prediction accuracy for the unseen data. Further comparisons substantiate the experimental findings.