Unsupervised classification for uncertain varying responses: The wisdom-in-the-crowd (WICRO) algorithm

Nir Ratner, Eugene Kagan, Parteek Kumar, Irad Ben-Gal

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

This paper addresses the problem classification of instances/questions based on the opinions (classes) provided by anonymous agents. The solution aggregates the agents’ classifications, aiming to obtain as close as possible to an unknown correct classification. However, the agents’ fields or domains of competence and their levels of expertise are unknown and can vary extensively. Many popular classification algorithms address such a problem by following a “wisdom-of-the-crowd” approach while using different voting methods and expectation–maximization techniques. These algorithms lead to correct classifications when the majority of the agents are experts, thus classifying the instances correctly. However, they often result in erroneous classification when only a small subset of the agents are indeed correct. Moreover, these algorithms often assume a fixed set of classes for all instances. This study presents a fast (one-pass) classification algorithm that can estimate the unknown agents’ expertise level and aggregates their classifications accordingly, even when these are obtained from different questionnaires; thus, when the instances are not necessarily classified to a fixed set of classes. The proposed algorithm finds the experts and the nonexpert agents for each question by analyzing the distance between them. The algorithm identifies the expert agents for each instance and then classifies them accordingly. The suggested algorithm is validated and compared against known methods by using both simulated datasets and real-world datasets collected from various sources. The obtained results clearly demonstrate the effectiveness and advantages of the proposed method.

Original languageEnglish
Article number110551
JournalKnowledge-Based Systems
Volume272
DOIs
StatePublished - 19 Jul 2023

Keywords

  • Data analysis
  • Expert voting
  • Online decision-making
  • Unsupervised classification
  • Wisdom-of-the-crowd

Fingerprint

Dive into the research topics of 'Unsupervised classification for uncertain varying responses: The wisdom-in-the-crowd (WICRO) algorithm'. Together they form a unique fingerprint.

Cite this