HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation

Researchers introduce HyPOLE, a novel framework that guides Multi-Agent Reinforcement Learning (MARL) under partial observability using formal specifications and HyperLTL temporal logic, outperforming traditional baselines.

Computer Science > Artificial Intelligence

Title:HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation

View PDFAbstract:Formal specification is a powerful tool to guide the learning process and provides significant advantages over reward shaping: (1) mathematical rigor; (2) expressiveness to specify objectives and constraints, and (3) the ability to define tactics to achieve objectives. However, these benefits remain largely unexplored in the context of Multi-Agent Reinforcement Learning (MARL). This paper introduces HyPOLE, a novel framework for MARL under partial observability, where learning is guided by the expressive power of the so-called hyperproperties and, in particular, the temporal logic HyperLTL. We integrate Centralized Training for Decentralized Execution (CTDE) techniques with HyPOLE to synthesize decentralized policies, and our evaluation on SMAC, MessySMAC, and WildFire benchmark demonstrates clear advantages over baselines.

Current browse context:

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Source: arXiv cs.AI Recent

HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation

Computer Science > Artificial Intelligence

Title:HyPOLE: Hyperproperty-Guided Multi-Agent Reinforcement Learning under Partial Observation

Current browse context:

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

More in this category

What Drives Interactive Improvement from Feedback?

MultiUAV-Plat: An LLM-Oriented Platform, Benchmark and Framework for Multi-UAV Collaborative Task Planning

AgRefactor: Self-Evolving Agentic Workflow for HLS Compatibility and Performance

Contrastive Reflection for Iterative Prompt Optimization

Neuro-Bayesian-Symbolic Residual Attention Shallow Network: Explainable Deep Learning for Cybersecurity Risk Assessment

DDIAgents: Mechanism-Conditioned Context Flow for Drug-Drug Interaction Prediction

Discover All Categories