| dc.contributor.author | Soleymani, Ashkan | |
| dc.contributor.author | Piliouras, Georgios | |
| dc.contributor.author | Farina, Gabriele | |
| dc.date.accessioned | 2026-01-22T15:27:08Z | |
| dc.date.available | 2026-01-22T15:27:08Z | |
| dc.date.issued | 2025-06-15 | |
| dc.identifier.isbn | 979-8-4007-1510-5 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/164613 | |
| dc.description | STOC ’25, Prague, Czechia | en_US |
| dc.description.abstract | We establish the first uncoupled learning algorithm that attains O(n log2 d logT) per-player regret in multi-player general-sum games, where n is the number of players, d is the number of actions available to each player, and T is the number of repetitions of the game. Our results exponentially improve the dependence on d compared to the O(n d logT) regret attainable by Log-Regularized Lifted Optimistic FTRL introduced by Farina, Anagnostides, Luo, Lee, Kroer, and Sandholm [2022], and also reduce the dependence on the number of iterations T from log4 T to logT compared to Optimistic Hedge, the previously well-studied algorithm with O(n logd log4 T) regret shown by Daskalakis, Fishelson, and Golowich [2021]. Our algorithm is obtained by combining the classic Optimistic Multiplicative Weights Update (OMWU) with an adaptive, non-monotonic learning rate that paces the learning process of the players, making them more cautious when their regret becomes too negative. | en_US |
| dc.publisher | ACM|Proceedings of the 57th Annual ACM Symposium on Theory of Computing | en_US |
| dc.relation.isversionof | https://doi.org/10.1145/3717823.3718242 | en_US |
| dc.rights | Creative Commons Attribution | en_US |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
| dc.source | Association for Computing Machinery | en_US |
| dc.title | Faster Rates for No-Regret Learning in General Games via Cautious Optimism | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Ashkan Soleymani, Georgios Piliouras, and Gabriele Farina. 2025. Faster Rates for No-Regret Learning in General Games via Cautious Optimism. In Proceedings of the 57th Annual ACM Symposium on Theory of Computing (STOC '25). Association for Computing Machinery, New York, NY, USA, 518–529. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.identifier.mitlicense | PUBLISHER_POLICY | |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2025-08-01T08:43:47Z | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | The author(s) | |
| dspace.date.submission | 2025-08-01T08:43:47Z | |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |