P-top-k queries in a probabilistic framework from information extraction models

P-top-k queries in a probabilistic framework from information extraction models

0.00 Avg rating0 Votes
Article ID: iaor20119573
Volume: 62
Issue: 7
Start Page Number: 2755
End Page Number: 2769
Publication Date: Oct 2011
Journal: Computers and Mathematics with Applications
Authors: ,
Keywords: computers: information, datamining, statistics: empirical
Abstract:

Many applications today need to manage uncertain data, such as information extraction (IE), data integration, sensor RFID networks, and scientific experiments. Top‐ k equ1 queries are often natural and useful in analyzing uncertain data in those applications. In this paper, we study the problem of answering top‐ k equ2 queries in a probabilistic framework from a state‐of‐the‐art statistical IE model–semi‐conditional random fields (CRFs)–in the setting of probabilistic databases that treat statistical models as first‐class data objects. We investigate the problem of ranking the answers to probabilistic database queries. We present an efficient algorithm for finding the best approximating parameters in such a framework for efficiently retrieving the top‐ k equ3 ranked results. An empirical study using real data sets demonstrates the effectiveness of probabilistic top‐ k equ4 queries and the efficiency of our method.

Reviews

Required fields are marked *. Your email address will not be published.