AnalysisPolicyAI Models
1 day ago
New papers propose shielding methods for safe RL
Two arXiv papers introduce shielding approaches for safe reinforcement learning. 'Contract-Based Compositional Shielding' addresses multi-agent coordination under global safety constraints. 'Robust Shielding' handles unknown transition dynamics in MDPs.