Live Session
Teatro Petruzzelli
Paper
16 Oct
 
9:30
CEST
Session 7: Cold Start
Add Session to Calendar 2024-10-16 09:30 am 2024-10-16 10:25 am Europe/Rome Session 7: Cold Start Session 7: Cold Start is taking place on the RecSys Hub. Https://recsyshub.org
Main Track

A multimodal single-branch embedding network for recommendation in cold-start and missing modality scenarios

View on ACM Digital Library

Christian Ganhör (Johannes Kepler University Linz), Marta Moscati (Johannes Kepler University Linz), Shah Nawaz (Johannes Kepler University Linz), Anna Hausberger (Johannes Kepler University Linz) and Markus Schedl (Johannes Kepler University Linz)

View Paper PDFView Poster
Abstract

Most recommender systems adopt collaborative filtering (CF) and provide recommendations based on past collective interactions. Therefore, the performance of CF algorithms degrades when few or no interactions are available, a scenario referred to as cold-start. To address this issue, previous work relies on models leveraging both collaborative data and side information on the users or items. Simlar to multimodal learning, these models aim at combining collaborative and content representations in a shared embedding space. In this work we propose a novel technique for multimodal recommendation, relying on a multimodal Single-Branch embedding network for Recommendation (SiBraR). Leveraging weight-sharing, SiBraR encodes interaction data as well as multimodal side information using the same single-branch embedding network on different modalities. This makes SiBraR effective in scenarios of missing modality, including cold start. Our extensive experiments on large-scale recommendation datasets from three different recommendation domains (music, movie, and e-commerce) and providing multimodal content information (audio, text, image, labels, and interactions) show that SiBraR significantly outperforms CF as well as state-of-the-art CBRS in cold-start scenarios, and is competitive in warm scenarios. We show that SiBraR’s recommendations are accurate in missing modality scenarios, and that the model is able to map different modalities to the same region of the shared embedding space, hence reducing the modality gap.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →
No items found.