Live Session
Chamber of Commerce
Poster
15 Oct
 
8:00
CEST
Tuesday Posters
Add Session to Calendar 2024-10-15 08:00 am 2024-10-15 05:30 pm Europe/Rome Tuesday Posters Tuesday Posters is taking place on the RecSys Hub. Https://recsyshub.org
Research

Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce

View on ACM Digital Library

Yuan Wang (Alibaba Group), Zhiyu Li (Alibaba Group), Changshuo Zhang (Gaoling School of Artificial Intelligence, Renmin University of China), Sirui Chen (School of Information, Renmin University of China), Xiao Zhang (Gaoling School of Artificial Intelligence, Renmin University of China), Jun Xu (Gaoling School of Artificial Intelligence, Renmin University of China) and Quan Lin (Alibaba Group)

View Paper PDFView Poster
Abstract

Recommender systems have been widely used in e-commerce, and re-ranking models are playing an increasingly significant role in the domain, which leverages the inter-item influence and determines the final recommendation lists. Online learning methods keep updating a deployed model with the latest available samples to capture the shifting of the underlying data distribution in e-commerce. However, they depend on the availability of real user feedback, which may be delayed by hours or even days, such as item purchases, leading to a lag in model enhancement. In this paper, we propose a novel extension of online learning methods for re-ranking modeling, which we term LAST, an acronym for Learning At Serving Time. It circumvents the requirement of user feedback by using a surrogate model to provide the instructional signal needed to steer model improvement. Upon receiving an online request, LAST finds and applies a model modification on the fly before generating a recommendation result for the request. The modification is request-specific and transient. It means the modification is tailored to and only to the current request to capture the specific context of the request. After a request, the modification is discarded, which helps to prevent error propagation and stabilizes the online learning procedure since the predictions of the surrogate model may be inaccurate. Most importantly, as a complement to feedback-based online learning methods, LAST can be seamlessly integrated into existing online learning systems to create a more adaptive and responsive recommendation experience. Comprehensive experiments, both offline and online, affirm that LAST outperforms state-of-the-art re-ranking models.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →
No items found.