Back to AIBriefs
AnalysisAI Models

New paper studies offline-to-online learning in linear bandits

The paper proposes a method combining offline data and online exploration in stochastic linear bandits. A key finding is a phase transition based on the offline dataset's coverage.

·
7 days ago
New paper studies offline-to-online learning in linear bandits — AIBriefs