AnalysisAI Models
7 days ago
New paper studies offline-to-online learning in linear bandits
The paper proposes a method combining offline data and online exploration in stochastic linear bandits. A key finding is a phase transition based on the offline dataset's coverage.
·
7 days ago