XIPER: Cross-domain Video Prediction Reward for RL

AnalysisAI Models

8 days ago

XIPER: Cross-domain Video Prediction Reward for RL

XIPER (Cross-domain Video Prediction Reward) addresses the challenge of reinforcement learning from expert videos across visually distinct domains, where reward signals are absent and domain gaps exist. It uses a video prediction model to generate reward signals for imitation learning without requiring domain adaptation efforts. The method is validated on multiple cross-domain transfer tasks.

8 days ago