NFDI4DS | UHH-SEMS - Publication Details

Panayotis Mertikopoulos

ORCID: 0000-0003-2026-9616

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5018966648

Research Areas

Advanced Bandit Algorithms Research
Game Theory and Applications
Stochastic Gradient Optimization Techniques
Advanced MIMO Systems Optimization
Sparse and Compressive Sensing Techniques
Reinforcement Learning in Robotics
Cooperative Communication and Network Coding
Auction Theory and Applications
Evolutionary Game Theory and Cooperation
Experimental Behavioral Economics Studies
Energy Harvesting in Wireless Networks
Advanced Wireless Network Optimization
Cognitive Radio Networks and Spectrum Sensing
Opinion Dynamics and Social Influence
Economic theories and models
Optimization and Variational Analysis
Evolution and Genetic Dynamics
Markov Chains and Monte Carlo Methods
Optimization and Search Problems
Advanced Optimization Algorithms Research
Wireless Communication Networks Research
Complex Network Analysis Techniques
IoT and Edge/Fog Computing
Quantum Information and Cryptography
Wireless Communication Security Techniques

Centre National de la Recherche Scientifique
2016-2025

Laboratoire d'Informatique de Grenoble
2016-2025

Université Grenoble Alpes
2016-2025

Centre Inria de l'Université Grenoble Alpes
2015-2025

Institut polytechnique de Grenoble
2017-2025

Criteo (France)
2018-2022

Institut national de recherche en informatique et en automatique
2014-2021

Laboratoire Jean Kuntzmann
2021

Maastricht University
2018

University of Vienna
2018

Learning in games with continuous action sets and unknown payoff functions

OPENALEX - Publications

Panayotis Mertikopoulos Zhengyuan Zhou

10.1007/s10107-018-1254-8 article EN Mathematical Programming 2018-03-12

Learning in Games via Reinforcement and Regularization

OPENALEX - Publications

Panayotis Mertikopoulos William H. Sandholm

We investigate a class of reinforcement learning dynamics where players adjust their strategies based on actions’ cumulative payoffs over time—specifically, by playing mixed that maximize expected payoff minus regularization term. A widely studied example is exponential learning, process induced an entropic term which leads to evolve according the replicator dynamics. However, in contrast functions used define smooth best responses models stochastic fictitious play, this paper need not be...

10.1287/moor.2016.0778 article EN Mathematics of Operations Research 2016-08-18

A Resource Allocation Framework for Network Slicing

OPENALEX - Publications

Mathieu Leconte Georgios S. Paschos Panayotis Mertikopoulos Ulaş C. Kozat

Telecommunication networks are converging to a massively distributed cloud infrastructure interconnected with software defined networks. In the envisioned architecture, services will be deployed flexibly and quickly as network slices. Our paper addresses major bottleneck in this context, namely challenge of computing best resource provisioning for slices robust efficient manner. With tractability mind, we propose novel optimization framework which allows fine-grained allocation both terms...

10.1109/infocom.2018.8486303 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2018-04-01

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

OPENALEX - Publications

Panayotis Mertikopoulos Bruno Lecouat Houssam Zenati Chuan-Sheng Foo Vijay Chandrasekhar and 1 more

Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To make piecemeal progress along these lines, we analyze the behavior of mirror descent (MD) a class...

10.48550/arxiv.1807.02629 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Distributed Learning Policies for Power Allocation in Multiple Access Channels

OPENALEX - Publications

Panayotis Mertikopoulos E. Veronica Belmega Aris L. Moustakas Samson Lasaulce

We analyze the power allocation problem for orthogonal multiple access channels by means of a non-cooperative potential game in which each user distributes his over available to him. When are static, we show that this possesses unique equilibrium; moreover, if network's users follow distributed learning scheme based on replicator dynamics evolutionary theory, then they converge equilibrium exponentially fast. On other hand, fluctuate stochastically time, associated still admits equilibrium,...

10.1109/jsac.2012.120109 article EN IEEE Journal on Selected Areas in Communications 2012-01-01

Energy-Aware Competitive Power Allocation for Heterogeneous Networks Under QoS Constraints

OPENALEX - Publications

Giacomo Bacci E. Veronica Belmega Panayotis Mertikopoulos Luca Sanguinetti

This work proposes a distributed power allocation scheme for maximizing energy efficiency in the uplink of orthogonal frequency-division multiple access (OFDMA)-based heterogeneous networks (HetNets). The user equipment (UEs) network are modeled as rational agents that engage non-cooperative game where each UE allocates its available transmit over set assigned subcarriers so to maximize individual utility (defined user's throughput per Watt power) subject minimum-rate constraints. In this...

10.1109/twc.2015.2425397 article EN IEEE Transactions on Wireless Communications 2015-04-22

Fog Based Framework for IoT Service Provisioning

OPENALEX - Publications

Bruno Donassolo Ilhem Fajjari Arnaud Legrand Panayotis Mertikopoulos

To this day, the Internet of Things (IoT) continues its explosive growth. Nevertheless, with exceptional evolution traffic demand, existing infrastructures are struggling to resist. In context, Fog computing is shaping future IoT applications. It offers nearby computational, networking and storage resources respond stringent requirements these However, despite several advantages, raises new challenges which slow adoption down. Hence, there a lack practical solutions enable exploitation novel...

10.1109/ccnc.2019.8651835 preprint EN 2019-01-01

Living at the Edge: A Large Deviations Approach to the Outage MIMO Capacity

OPENALEX - Publications

Pavlos Kazakopoulos Panayotis Mertikopoulos Aris L. Moustakas Giuseppe Caire

Using a large deviations approach we calculate the probability distribution of mutual information MIMO channels in limit antenna numbers. In contrast to previous methods that only focused at close its mean (thus obtaining an asymptotically Gaussian distribution), full distribution, including tails which strongly deviate from behavior near mean. The resulting interpolates seamlessly between approximation for rates $R$ ergodic value and Zheng Tse signal noise ratios $\rho$. This calculation...

10.1109/tit.2011.2112050 article EN IEEE Transactions on Information Theory 2011-03-15

Penalty-Regulated Dynamics and Robust Learning Procedures in Games

OPENALEX - Publications

Pierre Coucheney Bruno Gaujal Panayotis Mertikopoulos

Starting from a heuristic learning scheme for strategic N-person games, we derive new class of continuous-time dynamics consisting replicator-like drift adjusted by penalty term that renders the boundary game’s strategy space repelling. These penalty-regulated are equivalent to players keeping an exponentially discounted aggregate their ongoing payoffs and then using smooth best response pick action based on these performance scores. Owing this inherent duality, proposed satisfy variant folk...

10.1287/moor.2014.0687 article EN Mathematics of Operations Research 2014-11-17

Minibatch Forward-Backward-Forward Methods for Solving Stochastic Variational Inequalities

OPENALEX - Publications

Radu Ioan Boţ Panayotis Mertikopoulos Mathias Staudigl Phan Tu Vuong

We develop a new stochastic algorithm for solving pseudomonotone variational inequalities. Our method builds on Tseng’s forward-backward-forward algorithm, which is known in the deterministic literature to be valuable alternative Korpelevich’s extragradient when inequalities over convex and closed set governed by Lipschitz continuous operators. The main computational advantage of that it relies only single projection step two independent queries oracle. incorporates minibatch sampling...

10.1287/stsy.2019.0064 article EN cc-by Stochastic Systems 2021-02-25

Higher order game dynamics

OPENALEX - Publications

Rida Laraki Panayotis Mertikopoulos

10.1016/j.jet.2013.08.002 article EN Journal of Economic Theory 2013-08-12

Mixed-Strategy Learning With Continuous Action Sets

OPENALEX - Publications

Steven W. Perkins Panayotis Mertikopoulos David S. Leslie

Motivated by the recent applications of game-theoretical learning to design distributed control systems, we study a class problems that can be formulated as potential games with continuous action sets. We propose an actor-critic reinforcement algorithm adapts mixed strategies over spaces. To analyze algorithm, extend theory finite-dimensional two-timescale stochastic approximation Banach space setting, and prove dynamics process converge equilibrium in case games. These results combine give...

10.1109/tac.2015.2511930 article EN cc-by IEEE Transactions on Automatic Control 2015-12-23

Learning to Be Green: Robust Energy Efficiency Maximization in Dynamic MIMO–OFDM Systems

OPENALEX - Publications

Panayotis Mertikopoulos E. Veronica Belmega

In this paper, we examine the maximization of energy efficiency (EE) in next-generation multiuser MIMO-OFDM networks that vary dynamically over time-e.g., due to user mobility, fluctuations wireless medium, modulations users' load, etc. Contrary static/stationary regime, system may evolve an arbitrary manner, so users must adjust "on fly," without being able predict state advance. To tackle these issues, propose a simple and distributed online optimization policy leads no regret, i.e., it...

10.1109/jsac.2016.2544600 article EN IEEE Journal on Selected Areas in Communications 2016-03-21

Online convex optimization and no-regret learning: Algorithms, guarantees and applications

OPENALEX - Publications

E. Veronica Belmega Panayotis Mertikopoulos Romain Negrel Luca Sanguinetti

Spurred by the enthusiasm surrounding "Big Data" paradigm, mathematical and algorithmic tools of online optimization have found widespread use in problems where trade-off between data exploration exploitation plays a predominant role. This is particular importance to several branches applications signal processing, such as mining, statistical inference, multimedia indexing wireless communications (to name but few). With this mind, aim tutorial paper provide gentle introduction learning...

10.48550/arxiv.1804.04529 preprint EN other-oa arXiv (Cornell University) 2018-01-01

On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

OPENALEX - Publications

Panayotis Mertikopoulos Nadav Hallak Ali Kavis Volkan Cevher

This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand algorithm's convergence properties in non-convex problems. We first show that sequence iterates generated by SGD remains bounded and converges with probability $1$ under a very broad range step-size schedules. Subsequently, going beyond existing positive guarantees, we avoids strict saddle points/manifolds for entire spectrum policies considered. Finally, prove rate Hurwicz minimizers is...

10.48550/arxiv.2006.11144 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Distributed Stochastic Optimization via Matrix Exponential Learning

OPENALEX - Publications

Panayotis Mertikopoulos E. Veronica Belmega Romain Negrel Luca Sanguinetti

In this paper, we investigate a distributed learning scheme for broad class of stochastic optimization problems and games that arise in signal processing wireless communications. The proposed algorithm relies on the method matrix exponential (MXL) only requires locally computable gradient observations are possibly imperfect and/or obsolete. To analyze it, introduce notion stable Nash equilibrium show is globally convergent to such equilibria - or when an stable. We also derive explicit...

10.1109/tsp.2017.2656847 article EN IEEE Transactions on Signal Processing 2017-01-23

A continuous-time approach to online optimization

OPENALEX - Publications

Joon Kwon Panayotis Mertikopoulos

We consider a family of mirror descent strategies for online optimization in continuous-time and we show that they lead to no regret. From more traditional, discrete-time viewpoint, this approach allows us derive the no-regret properties large class algorithms including as special cases exponential weights algorithm, descent, smooth fictitious play vanishingly play. In so doing, obtain unified view many classical regret bounds, can be decomposed into term stemming from considerations which...

10.3934/jdg.2017008 article EN Journal of Dynamics and Games 2017-01-01

Riemannian game dynamics

OPENALEX - Publications

Panayotis Mertikopoulos William H. Sandholm

10.1016/j.jet.2018.06.002 article EN publisher-specific-oa Journal of Economic Theory 2018-06-28

Learning in an Uncertain World: MIMO Covariance Matrix Optimization With Imperfect Feedback

OPENALEX - Publications

Panayotis Mertikopoulos Aris L. Moustakas

In this paper, we present a distributed learning algorithm for the optimization of signal covariance matrices in Gaussian multiple-input and multiple-output (MIMO) multiple access channel with imperfect (and possibly delayed) feedback. The is based on method matrix exponential (MXL) it has same information computation requirements as water-filling. However, unlike water-filling, proposed converges to system's optimum profile even under stochastic uncertainty Moreover, also retains its...

10.1109/tsp.2015.2477053 article EN IEEE Transactions on Signal Processing 2015-09-07

The emergence of rational behavior in the presence of stochastic perturbations

OPENALEX - Publications

Panayotis Mertikopoulos Aris L. Moustakas

We study repeated games where players use an exponential learning scheme in order to adapt ever-changing environment. If the game's payoffs are subject random perturbations, this leads a new stochastic version of replicator dynamics that is quite different from "aggregate shocks" approach evolutionary game theory. Irrespective perturbations' magnitude, we find strategies which dominated (even iteratively) eventually become extinct and strict Nash equilibria stochastically asymptotically...

10.1214/09-aap651 article EN The Annals of Applied Probability 2010-07-20

Coming Soon ...