TU Darmstadt / ULB / TUprints

Scalable Bayesian preference learning for crowds

Simpson, Edwin ; Gurevych, Iryna (2024)
Scalable Bayesian preference learning for crowds.
In: Machine Learning, 2020, 109 (4)
doi: 10.26083/tuprints-00023931
Article, Secondary publication, Publisher's Version

[img] Text
s10994-019-05867-2.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (3MB)
Item Type: Article
Type of entry: Secondary publication
Title: Scalable Bayesian preference learning for crowds
Language: English
Date: 17 December 2024
Place of Publication: Darmstadt
Year of primary publication: April 2020
Place of primary publication: Dordrecht
Publisher: Springer Science
Journal or Publication Title: Machine Learning
Volume of the journal: 109
Issue Number: 4
DOI: 10.26083/tuprints-00023931
Corresponding Links:
Origin: Secondary publication DeepGreen
Abstract:

We propose a scalable Bayesian preference learning method for jointly predicting the preferences of individuals as well as the consensus of a crowd from pairwise labels. Peoples’ opinions often differ greatly, making it difficult to predict their preferences from small amounts of personal data. Individual biases also make it harder to infer the consensus of a crowd when there are few labels per item. We address these challenges by combining matrix factorisation with Gaussian processes, using a Bayesian approach to account for uncertainty arising from noisy and sparse data. Our method exploits input features, such as text embeddings and user metadata, to predict preferences for new items and users that are not in the training set. As previous solutions based on Gaussian processes do not scale to large numbers of users, items or pairwise labels, we propose a stochastic variational inference approach that limits computational and memory costs. Our experiments on a recommendation task show that our method is competitive with previous approaches despite our scalable inference approximation. We demonstrate the method’s scalability on a natural language processing task with thousands of users and items, and show improvements over the state of the art on this task. We make our software publicly available for future work (https://github.com/UKPLab/tacl2018-preference-convincing/tree/crowdGPPL).

Uncontrolled Keywords: Artificial Intelligence, Control, Robotics, Mechatronics, Artificial Intelligence, Simulation and Modeling, Natural Language Processing (NLP)
Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-239314
Classification DDC: 000 Generalities, computers, information > 004 Computer science
Divisions: 20 Department of Computer Science > Ubiquitous Knowledge Processing
Date Deposited: 17 Dec 2024 12:56
Last Modified: 17 Dec 2024 12:56
SWORD Depositor: Deep Green
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/23931
PPN:
Export:
Actions (login required)
View Item View Item