Article ID: | iaor2006916 |
Country: | Netherlands |
Volume: | 41 |
Issue: | 1 |
Start Page Number: | 205 |
End Page Number: | 227 |
Publication Date: | Nov 2005 |
Journal: | Decision Support Systems |
Authors: | Lee Jae Kyu, Kang Juyoung |
Keywords: | internet |
In the world of Web pages, there are oceans of documents in natural language texts and tables. To extract rules from Web pages and maintain consistency between them, we have developed the framework of XRML (eXtensible Rule Markup Language). XRML allows the identification of rules on Web pages and generates the identified rules automatically. For this purpose, we have designed the Rule Identification Markup Language (RIML), which is similar to the formal Rule Structure Markup Language (RSML), both as parts of XRML. RIML 2.0 is designed to identify rules not only from texts, but also from tables on Web pages, and to transform to the formal rules in RSML syntax automatically. While designing RIML 2.0, we considered the features of sharing variables and values, omitted terms, and synonyms.