[论文笔记] Task Matching in Crowdsourcing (iThings & CPSCom, 2011)

Time: 2.5 hours
Timespan: Mar 28 – Mar 30, 2012
Yuen Man-Ching, King Irwin, Leung Kwong-Sak. Task Matching in Crowdsourcing. Proceedings - 2011 IEEE International Conferences on Internet of Things and Cyber, Physical and Social Computing, iThings/CPSCom 2011, p 409-412, 2011

[背景和问题]
    纵包网站上罗列的任务数量已经越来越多了，比如在2011年2月份，mTurk上的日均任务数量：80000个HIT（针对qualified worker）， 2000个HIT（针对unqualified worker)。
    那么对于worker来说，就面临着一个问题：如何快速有效的找到一个合适的任务？（要知道纵包网站上任务的粒度通常都是很小的，有时候找到一个合适的任务要比完成这个任务花更多的时间）
    有研究指出，worker通常只会查看前几页的任务(文献：Task search in a human computation market)：
         (1) "mostly at the first page of the most recently posted tasks"
         (2) "the first two pages of the tasks with the most available instances"
         (3) "in the categories the position on the result page is unimportant to workers"
         (4) "some workers searching by almost all the possible categories and looking more than 10 pages deep"
    目前还少有这方面的研究。

[主要贡献]
提出了一种为worker推荐合适任务的方法，主要基于该worker过去的任务偏好和表现。

本文提出的方法有如下价值：

激励不同背景的worker长期参与纵包任务
提高工作质量

以下是相关的论文笔记：

1. 本文的技术细节在第三部分，提出了模型和算法。符号虽多，但不难理解。
文中提出了TaskRank指标，用来度量某个类别的某个任务，针对某个worker的适合程度，主要考虑了这些因素：

任务的类别(task category)
该worker在该类别的任务中收到过的酬金(reward)
该worker在该类别的任务中花费过的时间(time alloted),
该worker针对该类别任务的提交被接收的比例(acceptance rate)。

具体的符号定义和算法可见原文，不是很难懂。