AnalysisAI Models
7 days ago
Paper argues deployed RL should be continual
The paper critiques the train-then-fix paradigm in deployed RL, where agents stop learning after initial training. It advocates for continual learning approaches to maintain performance over time.
·
7 days ago