Panayotis Mertikopoulos

ORCID: 0000-0003-2026-9616
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Bandit Algorithms Research
  • Game Theory and Applications
  • Stochastic Gradient Optimization Techniques
  • Advanced MIMO Systems Optimization
  • Sparse and Compressive Sensing Techniques
  • Reinforcement Learning in Robotics
  • Cooperative Communication and Network Coding
  • Auction Theory and Applications
  • Evolutionary Game Theory and Cooperation
  • Experimental Behavioral Economics Studies
  • Energy Harvesting in Wireless Networks
  • Advanced Wireless Network Optimization
  • Cognitive Radio Networks and Spectrum Sensing
  • Opinion Dynamics and Social Influence
  • Economic theories and models
  • Optimization and Variational Analysis
  • Evolution and Genetic Dynamics
  • Markov Chains and Monte Carlo Methods
  • Optimization and Search Problems
  • Advanced Optimization Algorithms Research
  • Wireless Communication Networks Research
  • Complex Network Analysis Techniques
  • IoT and Edge/Fog Computing
  • Quantum Information and Cryptography
  • Wireless Communication Security Techniques

Centre National de la Recherche Scientifique
2016-2025

Laboratoire d'Informatique de Grenoble
2016-2025

Université Grenoble Alpes
2016-2025

Centre Inria de l'Université Grenoble Alpes
2015-2025

Institut polytechnique de Grenoble
2017-2025

Criteo (France)
2018-2022

Institut national de recherche en informatique et en automatique
2014-2021

Laboratoire Jean Kuntzmann
2021

Maastricht University
2018

University of Vienna
2018

We investigate a class of reinforcement learning dynamics where players adjust their strategies based on actions’ cumulative payoffs over time—specifically, by playing mixed that maximize expected payoff minus regularization term. A widely studied example is exponential learning, process induced an entropic term which leads to evolve according the replicator dynamics. However, in contrast functions used define smooth best responses models stochastic fictitious play, this paper need not be...

10.1287/moor.2016.0778 article EN Mathematics of Operations Research 2016-08-18

Telecommunication networks are converging to a massively distributed cloud infrastructure interconnected with software defined networks. In the envisioned architecture, services will be deployed flexibly and quickly as network slices. Our paper addresses major bottleneck in this context, namely challenge of computing best resource provisioning for slices robust efficient manner. With tractability mind, we propose novel optimization framework which allows fine-grained allocation both terms...

10.1109/infocom.2018.8486303 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2018-04-01

Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To make piecemeal progress along these lines, we analyze the behavior of mirror descent (MD) a class...

10.48550/arxiv.1807.02629 preprint EN other-oa arXiv (Cornell University) 2018-01-01

We analyze the power allocation problem for orthogonal multiple access channels by means of a non-cooperative potential game in which each user distributes his over available to him. When are static, we show that this possesses unique equilibrium; moreover, if network's users follow distributed learning scheme based on replicator dynamics evolutionary theory, then they converge equilibrium exponentially fast. On other hand, fluctuate stochastically time, associated still admits equilibrium,...

10.1109/jsac.2012.120109 article EN IEEE Journal on Selected Areas in Communications 2012-01-01

This work proposes a distributed power allocation scheme for maximizing energy efficiency in the uplink of orthogonal frequency-division multiple access (OFDMA)-based heterogeneous networks (HetNets). The user equipment (UEs) network are modeled as rational agents that engage non-cooperative game where each UE allocates its available transmit over set assigned subcarriers so to maximize individual utility (defined user's throughput per Watt power) subject minimum-rate constraints. In this...

10.1109/twc.2015.2425397 article EN IEEE Transactions on Wireless Communications 2015-04-22

To this day, the Internet of Things (IoT) continues its explosive growth. Nevertheless, with exceptional evolution traffic demand, existing infrastructures are struggling to resist. In context, Fog computing is shaping future IoT applications. It offers nearby computational, networking and storage resources respond stringent requirements these However, despite several advantages, raises new challenges which slow adoption down. Hence, there a lack practical solutions enable exploitation novel...

10.1109/ccnc.2019.8651835 preprint EN 2019-01-01

Using a large deviations approach we calculate the probability distribution of mutual information MIMO channels in limit antenna numbers. In contrast to previous methods that only focused at close its mean (thus obtaining an asymptotically Gaussian distribution), full distribution, including tails which strongly deviate from behavior near mean. The resulting interpolates seamlessly between approximation for rates $R$ ergodic value and Zheng Tse signal noise ratios $\rho$. This calculation...

10.1109/tit.2011.2112050 article EN IEEE Transactions on Information Theory 2011-03-15

Starting from a heuristic learning scheme for strategic N-person games, we derive new class of continuous-time dynamics consisting replicator-like drift adjusted by penalty term that renders the boundary game’s strategy space repelling. These penalty-regulated are equivalent to players keeping an exponentially discounted aggregate their ongoing payoffs and then using smooth best response pick action based on these performance scores. Owing this inherent duality, proposed satisfy variant folk...

10.1287/moor.2014.0687 article EN Mathematics of Operations Research 2014-11-17

We develop a new stochastic algorithm for solving pseudomonotone variational inequalities. Our method builds on Tseng’s forward-backward-forward algorithm, which is known in the deterministic literature to be valuable alternative Korpelevich’s extragradient when inequalities over convex and closed set governed by Lipschitz continuous operators. The main computational advantage of that it relies only single projection step two independent queries oracle. incorporates minibatch sampling...

10.1287/stsy.2019.0064 article EN cc-by Stochastic Systems 2021-02-25

10.1016/j.jet.2013.08.002 article EN Journal of Economic Theory 2013-08-12

Motivated by the recent applications of game-theoretical learning to design distributed control systems, we study a class problems that can be formulated as potential games with continuous action sets. We propose an actor-critic reinforcement algorithm adapts mixed strategies over spaces. To analyze algorithm, extend theory finite-dimensional two-timescale stochastic approximation Banach space setting, and prove dynamics process converge equilibrium in case games. These results combine give...

10.1109/tac.2015.2511930 article EN cc-by IEEE Transactions on Automatic Control 2015-12-23

In this paper, we examine the maximization of energy efficiency (EE) in next-generation multiuser MIMO-OFDM networks that vary dynamically over time-e.g., due to user mobility, fluctuations wireless medium, modulations users' load, etc. Contrary static/stationary regime, system may evolve an arbitrary manner, so users must adjust "on fly," without being able predict state advance. To tackle these issues, propose a simple and distributed online optimization policy leads no regret, i.e., it...

10.1109/jsac.2016.2544600 article EN IEEE Journal on Selected Areas in Communications 2016-03-21

Spurred by the enthusiasm surrounding "Big Data" paradigm, mathematical and algorithmic tools of online optimization have found widespread use in problems where trade-off between data exploration exploitation plays a predominant role. This is particular importance to several branches applications signal processing, such as mining, statistical inference, multimedia indexing wireless communications (to name but few). With this mind, aim tutorial paper provide gentle introduction learning...

10.48550/arxiv.1804.04529 preprint EN other-oa arXiv (Cornell University) 2018-01-01

This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand algorithm's convergence properties in non-convex problems. We first show that sequence iterates generated by SGD remains bounded and converges with probability $1$ under a very broad range step-size schedules. Subsequently, going beyond existing positive guarantees, we avoids strict saddle points/manifolds for entire spectrum policies considered. Finally, prove rate Hurwicz minimizers is...

10.48550/arxiv.2006.11144 preprint EN other-oa arXiv (Cornell University) 2020-01-01

In this paper, we investigate a distributed learning scheme for broad class of stochastic optimization problems and games that arise in signal processing wireless communications. The proposed algorithm relies on the method matrix exponential (MXL) only requires locally computable gradient observations are possibly imperfect and/or obsolete. To analyze it, introduce notion stable Nash equilibrium show is globally convergent to such equilibria - or when an stable. We also derive explicit...

10.1109/tsp.2017.2656847 article EN IEEE Transactions on Signal Processing 2017-01-23

We consider a family of mirror descent strategies for online optimization in continuous-time and we show that they lead to no regret. From more traditional, discrete-time viewpoint, this approach allows us derive the no-regret properties large class algorithms including as special cases exponential weights algorithm, descent, smooth fictitious play vanishingly play. In so doing, obtain unified view many classical regret bounds, can be decomposed into term stemming from considerations which...

10.3934/jdg.2017008 article EN Journal of Dynamics and Games 2017-01-01

10.1016/j.jet.2018.06.002 article EN publisher-specific-oa Journal of Economic Theory 2018-06-28

In this paper, we present a distributed learning algorithm for the optimization of signal covariance matrices in Gaussian multiple-input and multiple-output (MIMO) multiple access channel with imperfect (and possibly delayed) feedback. The is based on method matrix exponential (MXL) it has same information computation requirements as water-filling. However, unlike water-filling, proposed converges to system's optimum profile even under stochastic uncertainty Moreover, also retains its...

10.1109/tsp.2015.2477053 article EN IEEE Transactions on Signal Processing 2015-09-07

We study repeated games where players use an exponential learning scheme in order to adapt ever-changing environment. If the game's payoffs are subject random perturbations, this leads a new stochastic version of replicator dynamics that is quite different from "aggregate shocks" approach evolutionary game theory. Irrespective perturbations' magnitude, we find strategies which dominated (even iteratively) eventually become extinct and strict Nash equilibria stochastically asymptotically...

10.1214/09-aap651 article EN The Annals of Applied Probability 2010-07-20
Coming Soon ...