Simpson, Edwin ; Gurevych, Iryna (2024)
Scalable Bayesian preference learning for crowds.
In: Machine Learning, 2020, 109 (4)
doi: 10.26083/tuprints-00023931
Article, Secondary publication, Publisher's Version
Text
s10994-019-05867-2.pdf Copyright Information: CC BY 4.0 International - Creative Commons, Attribution. Download (3MB) |
Item Type: | Article |
---|---|
Type of entry: | Secondary publication |
Title: | Scalable Bayesian preference learning for crowds |
Language: | English |
Date: | 17 December 2024 |
Place of Publication: | Darmstadt |
Year of primary publication: | April 2020 |
Place of primary publication: | Dordrecht |
Publisher: | Springer Science |
Journal or Publication Title: | Machine Learning |
Volume of the journal: | 109 |
Issue Number: | 4 |
DOI: | 10.26083/tuprints-00023931 |
Corresponding Links: | |
Origin: | Secondary publication DeepGreen |
Abstract: | We propose a scalable Bayesian preference learning method for jointly predicting the preferences of individuals as well as the consensus of a crowd from pairwise labels. Peoples’ opinions often differ greatly, making it difficult to predict their preferences from small amounts of personal data. Individual biases also make it harder to infer the consensus of a crowd when there are few labels per item. We address these challenges by combining matrix factorisation with Gaussian processes, using a Bayesian approach to account for uncertainty arising from noisy and sparse data. Our method exploits input features, such as text embeddings and user metadata, to predict preferences for new items and users that are not in the training set. As previous solutions based on Gaussian processes do not scale to large numbers of users, items or pairwise labels, we propose a stochastic variational inference approach that limits computational and memory costs. Our experiments on a recommendation task show that our method is competitive with previous approaches despite our scalable inference approximation. We demonstrate the method’s scalability on a natural language processing task with thousands of users and items, and show improvements over the state of the art on this task. We make our software publicly available for future work (https://github.com/UKPLab/tacl2018-preference-convincing/tree/crowdGPPL). |
Uncontrolled Keywords: | Artificial Intelligence, Control, Robotics, Mechatronics, Artificial Intelligence, Simulation and Modeling, Natural Language Processing (NLP) |
Status: | Publisher's Version |
URN: | urn:nbn:de:tuda-tuprints-239314 |
Classification DDC: | 000 Generalities, computers, information > 004 Computer science |
Divisions: | 20 Department of Computer Science > Ubiquitous Knowledge Processing |
Date Deposited: | 17 Dec 2024 12:56 |
Last Modified: | 17 Dec 2024 12:56 |
SWORD Depositor: | Deep Green |
URI: | https://tuprints.ulb.tu-darmstadt.de/id/eprint/23931 |
PPN: | |
Export: |
View Item |