Multi-agent reinforcement learning (MARL) holds significant promise across domains such as autonomous driving, warehouse logistics, intelligent rail networks, and satellite alignment. Yet deploying MARL in the real world remains difficult. Training typically requires vast amounts of interactive data, which is both costly and potentially risky, particularly in safety-critical settings where trial and error is not… Read more »









