1

[2310.12370] No-Regret Learning in Bilateral Trade via Global Budget Balance

 1 month ago
source link: https://arxiv.org/abs/2310.12370
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
[Submitted on 18 Oct 2023 (v1), last revised 27 Mar 2024 (this version, v2)]

No-Regret Learning in Bilateral Trade via Global Budget Balance

View PDF HTML (experimental)

Bilateral trade models the problem of intermediating between two rational agents -- a seller and a buyer -- both characterized by a private valuation for an item they want to trade. We study the online learning version of the problem, in which at each time step a new seller and buyer arrive and the learner has to set prices for them without any knowledge about their (adversarially generated) valuations.
In this setting, known impossibility results rule out the existence of no-regret algorithms when budget balanced has to be enforced at each time step. In this paper, we introduce the notion of \emph{global budget balance}, which only requires the learner to fulfill budget balance over the entire time horizon. Under this natural relaxation, we provide the first no-regret algorithms for adversarial bilateral trade under various feedback models. First, we show that in the full-feedback model, the learner can guarantee O~(T−−√) regret against the best fixed prices in hindsight, and that this bound is optimal up to poly-logarithmic terms. Second, we provide a learning algorithm guaranteeing a O~(T3/4) regret upper bound with one-bit feedback, which we complement with a Ω(T5/7) lower bound that holds even in the two-bit feedback model. Finally, we introduce and analyze an alternative benchmark that is provably stronger than the best fixed prices in hindsight and is inspired by the literature on bandits with knapsacks.
Comments: Accepted at STOC 2024
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
Cite as: arXiv:2310.12370 [cs.GT]
  (or arXiv:2310.12370v2 [cs.GT] for this version)
  https://doi.org/10.48550/arXiv.2310.12370

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK