Recsys Hub

← Back to Library

Live Session

Teatro Petruzzelli

Paper

17 Oct

12:30

CEST

Session 15: Off-policy Learning

Add Session to Calendar 2024-10-17 12:30 pm 2024-10-17 01:15 pm Europe/Rome Session 15: Off-policy Learning Session 15: Off-policy Learning is taking place on the RecSys Hub. Https://recsyshub.org

Main Track

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

View on ACM Digital Library

Tatsuhiro Shimizu (Independent Researcher), Koichi Tanaka (Keio Univercity), Ren Kishimoto (Tokyo Institute of Technology), Haruka Kiyohara (Cornell University), Masahiro Nomura (CyberAgent, Inc.) and Yuta Saito (Cornell University)

View Paper PDF View Poster

Abstract

We explore off-policy evaluation and learning in contextual combinatorial bandits (CCB), where a policy selects a subset in the action space. For example, it might choose a set of furniture pieces (a bed and a drawer) from available items (bed, drawer, chair, etc.) for interior design sales. This setting is widespread in fields such as recommender systems and healthcare, yet OPE/L of CCB remains unexplored in the relevant literature. Standard OPE methods typically employ regression and importance sampling in the action subset space. However, they often face significant challenges due to high bias or variance, exacerbated by the exponential growth in the number of available subsets. To address these challenges, we introduce a concept of factored action space, which allows us to decompose each subset into binary indicators. These indicators signify whether each action is included in the selected subset. This formulation allows us to distinguish between the “main effect” derived from the main actions, and the “residual effect”, originating from the supplemental actions, facilitating more effective OPE. Specifically, our estimator, called OPCB, leverages an importance sampling-based approach to unbiasedly estimate the main effect, while employing regression-based approach to deal with the residual effect with low variance. OPCB achieves substantial variance reduction compared to conventional importance sampling methods and bias reduction relative to regression methods under certain conditions, as illustrated in our theoretical analysis. Experiments on both synthetic and real-world datasets demonstrate OPCB’s superior performance over the typical methods, particularly when navigating the complexities of a large action subset space.

Join the Conversation

Head to Slido and select the paper's assigned session to join the live discussion.

Conference Agenda

View Full Agenda →

8:00

CEST

Monday Registration and Badge Pick-Up

9:00

CEST

CARS: Workshop on Context-Aware Recommender Systems

9:00

CEST

CONSEQUENCES: The 3rd Workshop on Causality, Counterfactuals and Sequential Decision-Making for Recommender Systems

9:00

CEST

Doctoral Symposium

9:00

CEST

FAccTRec 2024: The 7th Workshop on Responsible Recommendation

9:00

CEST

MuRS2024: 2nd Music Recommender Systems Workshop

9:00

CEST

RecSys Challenge

9:00

CEST

SURE 2024: Workshop on Strategic and Utility-aware Recommendation

9:00

CEST

Tutorial: Computational Methods for Designing Human-Centered Recommender Systems: A Case Study Approach Intersecting Visual Arts and Healthcare

9:00

CEST

Tutorial: Deep Recommendation using Graphs

9:00

CEST

VideoRecSys + LargeRecSys 2024

10:30

CEST

Monday AM Coffee Break

12:45

CEST

Monday Lunch

14:30

CEST

RecSys in HR 2024: Fourth Workshop on Recommender Systems for Human Resources

14:30

CEST

RecTemp: Temporal Reasoning in Recommendation Systems

14:30

CEST

Tutorial: A Tutorial on Feature Interpretation in Recommender Systems

14:30

CEST

Tutorial: Economics of Recommender Systems

16:00

CEST

Monday PM Coffee Break

19:00

CEST

Welcome Reception

8:00

CEST

Tuesday Posters

8:00

CEST

Tuesday Registration and Badge Pick-Up

9:00

CEST

RecSys Welcome and Opening

9:30

CEST

Keynote: Mark Riedl

10:15

CEST

Session 1: Large Language Models 1

11:00

CEST

Google Sponsor Meet Up

11:00

CEST

Huawei Sponsor Meet Up

11:00

CEST

Tuesday AM Break

12:00

CEST

Session 2: Bias and Fairness 1

13:15

CEST

Tuesday Lunch Break (on own)

14:30

CEST

Session 3: Bias and Fairness 2

15:15

CEST

Session 4: Collaborative Filtering

16:25

CEST

Huawei Sponsor Meet Up

16:25

CEST

IBM Sponsor Meet Up

16:25

CEST

Tuesday PM Break

17:20

CEST

Session 5: Cross-domain and Cross-modal Learning

8:00

CEST

Wednesday Posters

8:00

CEST

Wednesday Registration and Badge Pick-Up

8:30

CEST

Session 6: Multi-task Learning

9:30

CEST

Session 7: Cold Start

10:25

CEST

Session 8: Sequential Recommendation 1

11:05

CEST

Amazon Science Sponsor Meet Up

11:05

CEST

Netflix Sponsor Meet Up

11:05

CEST

Wednesday AM Break

12:00

CEST

Session 9: Sequential Recommendation 2

13:10

CEST

Wednesday Lunch Break (On Own)

14:30

CEST

Keynote: Michael I. Jordan

15:15

CEST

Session 10: Graph Learning

16:20

CEST

Google Sponsor Meet Up

16:20

CEST

Wednesday PM Break

17:20

CEST

Session 11: Optimisation and Evaluation 1

20:00

CEST

Social Event

8:00

CEST

Thursday Posters

8:00

CEST

Thursday Registration and Badge Pick-Up

8:00

CEST

Women in RecSys Breakfast

9:00

CEST

Keynote: Mounia Lalmas

9:45

CEST

Session 12: Optimisation and Evaluation 2

10:35

CEST

Session 13: Robust RecSys 1

10:55

CEST

PopRox Meeting (PRIVATE)

10:55

CEST

Thursday AM Break

11:00

CEST

Booking.com Sponsor Meet Up

11:00

CEST

OVS Sponsor Meet Up

12:00

CEST

Session 14: Robust RecSys 2

12:30

CEST

Session 15: Off-policy Learning

13:15

CEST

Thursday Lunch Break (On Own)

14:30

CEST

Session 16: Large Language Models 2

16:20

CEST

Thursday PM Break

17:10

CEST

Session 17: Women in RecSys

18:10

CEST

Closing

21:00

CEST

Erasmus Orchestra Concert

8:00

CEST

Friday Registration and Badge Pick-Up

9:00

CEST

HealthRecSys: 6th ACM RecSys Workshop on Health Recommender Systems

9:00

CEST

INRA: 12th International Workshop on News Recommendation and Analytics

9:00

CEST

INTROSPECTIVES: Reflections on Recommender Systems Past, Present, and Future

9:00

CEST

IntRS 2024: 11th Joint Workshop on Interfaces and Human Decision Making for Recommender Systems

9:00

CEST

KaRS: Sixth Knowledge-aware and Conversational Recommender Systems Workshop

9:00

CEST

NORMalize: The Second Workshop on Normative Design and Evaluation of Recommender Systems

9:00

CEST

ROEGEN: The 1st International Workshop on Risks, Opportunities, and Evaluation of Generative Models in Recommendation

9:00

CEST

RecSoGood 2024: First International Workshop on Recommender Systems for Sustainability and Social Good

9:00

CEST

RecTour: Workshop on Recommenders in Tourism

9:00

CEST

Tutorial: Conducting User Experiments in Recommender Systems

10:30

CEST

Friday AM Coffee Break

11:00

CEST

Tutorial: Conducting Recommender Systems User Studies Using POPROX

12:30

CEST

Friday Lunch

14:15

CEST

AltRecSys: A Workshop on Alternative, Unexpected, and Critical Work on Recommendation

14:15

CEST

EARL: Workshop on Evaluating and Applying Recommendation Systems with Large Language Models

14:15

CEST

RobustRecSys @ RecSys2024: Design, Evaluation and Deployment of Robust Recommender Systems

15:45

CEST

Friday PM Coffee Break

No items found.