Towards Expertise Modelling for Routing Data Cleaning Tasks within a Community of Knowledge Workers (bibtex)
by Umair ul Hassan, Sean O'Riain, Edward Curry
Abstract:
Applications consuming data have to deal with variety of data quality issues such as missing values, duplication, incorrect values, etc. Although automatic approaches can be utilized for data cleaning the results can remain uncertain. Therefore updates suggested by automatic data cleaning algorithms require further human verification. This paper presents an approach for generating tasks for uncertain updates and routing these tasks to appropriate workers based on their expertise. Specifically the paper tackles the problem of modelling the expertise of knowledge workers for the purpose of routing tasks within collaborative data quality management. The proposed expertise model represents the profile of a worker against a set of concepts describing the data. A simple routing algorithm is employed for leveraging the expertise profiles for matching data cleaning tasks with workers. The proposed approach is evaluated on a real world dataset using human workers. The results demonstrate the effectiveness of using concepts described the data for modelling expertise, in terms of likelihood of receiving responses to tasks routed to workers.
Reference:
Umair ul Hassan, Sean O'Riain, Edward Curry, "Towards Expertise Modelling for Routing Data Cleaning Tasks within a Community of Knowledge Workers", In 17th International Conference on Information Quality (ICIQ 2012), Paris, France, pp. 58-69, 2012.
Bibtex Entry:
@inproceedings{UlHassan,
abstract = {Applications consuming data have to deal with variety of data quality issues such as missing values, duplication, incorrect values, etc. Although automatic approaches can be utilized for data cleaning the results can remain uncertain. Therefore updates suggested by automatic data cleaning algorithms require further human verification. This paper presents an approach for generating tasks for uncertain updates and routing these tasks to appropriate workers based on their expertise. Specifically the paper tackles the problem of modelling the expertise of knowledge workers for the purpose of routing tasks within collaborative data quality management. The proposed expertise model represents the profile of a worker against a set of concepts describing the data. A simple routing algorithm is employed for leveraging the expertise profiles for matching data cleaning tasks with workers. The proposed approach is evaluated on a real world dataset using human workers. The results demonstrate the effectiveness of using concepts described the data for modelling expertise, in terms of likelihood of receiving responses to tasks routed to workers.},
address = {Paris, France},
author = {ul Hassan, Umair and O'Riain, Sean and Curry, Edward},
booktitle = {17th International Conference on Information Quality (ICIQ 2012)},
file = {:Users/ed/Library/Application Support/Mendeley Desktop/Downloaded/ul Hassan, O'Riain, Curry - 2012 - Towards Expertise Modelling for Routing Data Cleaning Tasks within a Community of Knowledge Workers.pdf:pdf},
keywords = {CAMEE,LEIdataspace,crowd sourcing,data cleaning,linked data,web 2.0},
mendeley-tags = {CAMEE,LEIdataspace},
pages = {58--69},
title = {{Towards Expertise Modelling for Routing Data Cleaning Tasks within a Community of Knowledge Workers}},
url = {http://www.edwardcurry.org/publications/ulHassan_iciq2012.pdf},
year = {2012}
}
Powered by bibtexbrowser