Back to AIBriefs
AnalysisAI Models

XIPER: Cross-domain Video Prediction Reward for RL

XIPER (Cross-domain Video Prediction Reward) addresses the challenge of reinforcement learning from expert videos across visually distinct domains, where reward signals are absent and domain gaps exist. It uses a video prediction model to generate reward signals for imitation learning without requiring domain adaptation efforts. The method is validated on multiple cross-domain transfer tasks.

·
8 days ago
XIPER: Cross-domain Video Prediction Reward for RL — AIBriefs