On the semantics of top-k ranking for objects with uncertain data

0.00 Avg rating—0 Votes

Article ID:	iaor20119611
Volume:	62
Issue:	7
Start Page Number:	2812
End Page Number:	2823
Publication Date:	Oct 2011
Journal:	Computers and Mathematics with Applications
Authors:	Wang Chonghai, Yuan Li Yan, You Jia-Huai
Keywords:	datamining

Abstract:

The goal of top‐ $k$ equ1 ranking for objects is to rank the objects so that the best $k$ equ2 of them can be determined. In this paper we consider an object to be an entity which consists of a number of attributes whose roles in the object are determined by an aggregation function. The problem of top‐ranking in this case is conceptually simple for data that are complete and certain – the aggregation value of an object represents its strength and therefore its rank. For uncertain data, the semantic basis of top‐ $k$ equ3 objects becomes unclear. In this paper, we formulate a semantics of top‐ $k$ equ4 ranking for objects modeled by uncertain data, where the values of an object’s attributes are expressed by probability distributions and constrained by some stated conditions. Under this setting, we present a theory of top‐ $k$ equ5 ranking for objects so that their strengths can be determined in the presence of uncertain data. We present our theory in three stages. The first deals with discrete domains, which is extended to include continuous domains. We show that top‐ $k$ equ6 ranking for objects in this context is closely related to high‐dimensional space studied in mathematics. In particular, the computation of the volumes of a high‐dimensional polyhedron represented by a system of linear inequations is a special case of top‐ $k$ equ7 ranking under our theory. We further extend this theory to add weights to objects’ positions and aggregation values in determining ranking results. We show that a number of previous proposals for top‐ $k$ equ8 ranking are special cases of our theory.

Reviews

Required fields are marked *. Your email address will not be published.