Truth selection for truth discovery models exploiting ordering relationship among values

Abstract : Data veracity is one of the main issues regarding Web data. Truth Discovery models can be used to assess it by estimating value confidence and source trustworthiness through analysis of claims on the same real-world entities provided by different sources. Many studies have been conducted in this domain. True values selected by most models have the highest confidence estimation. This naive strategy cannot be applied to identify true values when there is a partial order among values that is considered to enhance the final performance. Indeed, in this case, the resulting estimations monotonically increase with respect to the partial order of values. The highest confidence is always assigned to the most general value that is implicitly supported by all the others. Thus, using the highest confidence as criterion to select the true values is not appropriate because it will always return the most general values. To address this problem, we propose a post-processing procedure that, leveraging the partial order among values and their monotonic confidence estimations, is able to identify the expected true value. Experimental results on synthetic datasets show the effectiveness of our approach.
Complete list of metadatas
Contributor : Sylvie Ranwez <>
Submitted on : Tuesday, November 6, 2018 - 2:26:03 PM
Last modification on : Friday, May 17, 2019 - 1:19:15 AM
Long-term archiving on : Thursday, February 7, 2019 - 3:34:10 PM


Files produced by the author(s)



Valentina Beretta, Sébastien Harispe, Sylvie Ranwez, Isabelle Mougenot. Truth selection for truth discovery models exploiting ordering relationship among values. Knowledge-Based Systems, Elsevier, 2018, 159, pp.298-308. ⟨10.1016/j.knosys.2018.06.023⟩. ⟨hal-01912290⟩



Record views


Files downloads