Abstract
This article considers the time-varying zero-sum game between two multiagent networks with different topologies. The two networks are modeled as adversarial players, where agents within each network communicate with their local neighbors while simultaneously gathering information from the opposing network through a bipartite interaction framework. The payoffs of the agents can be quantified by time-varying cost functions, and the objective is to design an online distributed algorithm to optimize the payoff for each network in the game. We use dynamic Nash equilibrium regret and duality gap as performance metrics and propose a projection-free algorithm called Online Distributed Frank-Wolfe in two-network (ODFW-TN). We assess the convergence of Algorithm ODFW-TN through regret analysis, and establish sublinear bounds for dynamic Nash equilibrium regret and duality gap of Algorithm ODFW-TN with respect to T, where T is the total number of games. Moreover, we validate the effectiveness of Algorithm ODFW-TN via a numerical experiment of a time-varying bilinear matrix game.
| Original language | English |
|---|---|
| Pages (from-to) | 183-197 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Automatic Control |
| Volume | 71 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2026 |
Keywords
- Frank-Wolfe
- nash equilibrium
- no-regret learning
- online distributed optimization
- zero-sum game
Fingerprint
Dive into the research topics of 'Distributed dynamic no-regret learning in two-network zero-sum games'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver