NFDI4DS | UHH-SEMS - Publication Details

Pingzhong Tang

ORCID: 0000-0003-1330-1999

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5013558969

Research Areas

Auction Theory and Applications
Consumer Market Behavior and Pricing
Game Theory and Applications
Game Theory and Voting Systems
Economic theories and models
Experimental Behavioral Economics Studies
Optimization and Search Problems
Reinforcement Learning in Robotics
Advanced Bandit Algorithms Research
Supply Chain and Inventory Management
Logic, Reasoning, and Knowledge
Digital Platforms and Economics
Privacy-Preserving Technologies in Data
Scheduling and Optimization Algorithms
Mobile Crowdsensing and Crowdsourcing
Artificial Intelligence in Games
Transportation and Mobility Innovations
Transportation Planning and Optimization
Adaptive Dynamic Programming Control
Financial Markets and Investment Strategies
Sharing Economy and Platforms
Distributed systems and fault tolerance
Urban Transport and Accessibility
Smart Parking Systems Research
Imbalanced Data Classification Techniques

Tsinghua University
2015-2024

Carnegie Mellon University
2011-2021

New York University
2020

Institute of Computing Technology
2019

Chinese Academy of Sciences
2019

Microsoft Research Asia (China)
2016

PLA Academy of Military Science
2016

Beijing University of Technology
2016

Laboratoire d'Informatique de Paris-Nord
2012

Hong Kong University of Science and Technology
2008-2011

Warm Up Cold-start Advertisements

OPENALEX - Publications

Feiyang Pan Shuokai Li Xiang Ao Pingzhong Tang Qing He

Click-through rate (CTR) prediction has been one of the most central problems in computational advertising. Lately, embedding techniques that produce low-dimensional representations ad IDs drastically improve CTR accuracies. However, such learning are data demanding and work poorly on new ads with little logging data, which is known as cold-start problem.

10.1145/3331184.3331268 article EN 2019-07-18

A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems

OPENALEX - Publications

Ling Pan Qingpeng Cai Zhixuan Fang Pingzhong Tang Longbo Huang

Bike sharing provides an environment-friendly way for traveling and is booming all over the world. Yet, due to high similarity of user travel patterns, bike imbalance problem constantly occurs, especially dockless systems, causing significant impact on service quality company revenue. Thus, it has become a critical task operators resolve such efficiently. In this paper, we propose novel deep reinforcement learning framework incentivizing users rebalance systems. We model as Markov decision...

10.1609/aaai.v33i01.33011393 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

Reinforcement Mechanism Design for e-commerce

OPENALEX - Publications

Qingpeng Cai Aris Filos-Ratsikas Pingzhong Tang Yiwei Zhang

We study the problem of allocating impressions to sellers in e-commerce websites, such as Amazon, eBay or Taobao, aiming maximize total revenue generated by platform. employ a general framework reinforcement mechanism design, which uses deep learning design efficient algorithms, taking strategic behaviour into account. Specifically, we model impression allocation Markov decision process, where states encode history impressions, prices, transactions and actions are possible allocations each...

10.1145/3178876.3186039 article EN 2018-01-01

Computer-aided proofs of Arrow's and other impossibility theorems

OPENALEX - Publications

Pingzhong Tang Fangzhen Lin

10.1016/j.artint.2009.02.005 article EN publisher-specific-oa Artificial Intelligence 2009-03-05

Reinforcement mechanism design

OPENALEX - Publications

Pingzhong Tang

We put forward a modeling and algorithmic framework to design optimize mechanisms in dynamic industrial environments where designer can make use of the data generated process automatically improve future design. Our solution, coined reinforcement mechanism design, is rooted game theory but incorporates recent AI techniques get rid nonrealistic assumptions automated optimization feasible. instantiate our on key application scenarios Baidu Taobao, two largest mobile app companies China. For...

10.24963/ijcai.2017/739 article EN 2017-07-28

Evolutionary Cooperation in Transboundary River Basins

OPENALEX - Publications

Yang Yu Pingzhong Tang Jianshi Zhao Bo Liu Dennis McLaughlin

Abstract Cooperation in transboundary river basins can make water resources systems more efficient and benefit riparian stakeholders. However, a basin with upstream downstream stakeholders that have different interests, noncooperative outcomes often been observed. These be described by one‐shot prisoners' dilemma game where noncooperation (defection) is dominant equilibrium strategy. cooperative also observed several settings, such as the Lancang‐Mekong River Basin Asia. Such cooperation...

10.1029/2019wr025608 article EN Water Resources Research 2019-10-24

Reinforcement Mechanism Design: With Applications to Dynamic Pricing in Sponsored Search Auctions

OPENALEX - Publications

Weiran Shen Binghui Peng Hanpeng Liu Michael Zhang Ruohan Qian and 5 more

In many social systems in which individuals and organizations interact with each other, there can be no easy laws to govern the rules of environment, agents' payoffs are often influenced by other actions. We examine such a system setting sponsored search auctions tackle engine's dynamic pricing problem combining tools from both mechanism design AI domain. this setting, environment not only changes over time, but also behaves strategically. Over repeated interactions bidders, engine...

10.1609/aaai.v34i02.5600 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Policy Gradients for Contextual Recommendations

OPENALEX - Publications

Feiyang Pan Qingpeng Cai Pingzhong Tang Fuzhen Zhuang Qing He

Decision making is a challenging task in online recommender systems. The decision maker often needs to choose contextual item at each step from set of candidates. Contextual bandit algorithms have been successfully deployed such applications, for the trade-off between exploration and exploitation state-of-art performance on minimizing costs. However, applicability existing methods limited by over-simplified assumptions problem, as assuming simple form reward function or static environment...

10.1145/3308558.3313616 preprint EN 2019-05-13

Optimal mechanisms with simple menus

OPENALEX - Publications

Zihe Wang Pingzhong Tang

We consider revenue-optimal mechanism design for the case with one buyer and two items. The buyer's valuations towards items are independent additive. In this setting, optimal is unknown general valuation distributions. obtain categories of structural results that shed light on mechanisms. These can be summarized into conclusion: under certain conditions, mechanisms have simple menus.

10.1145/2600057.2602863 article EN 2014-05-30

Reinforcement Mechanism Design for Fraudulent Behaviour in e-Commerce

OPENALEX - Publications

Qingpeng Cai Aris Filos-Ratsikas Pingzhong Tang Yiwei Zhang

In large e-commerce websites, sellers have been observed to engage in fraudulent behaviour, faking historical transactions order receive favourable treatment from the platforms, specifically through allocation of additional buyer impressions which results higher revenue for them, but not system as a whole. This emergent phenomenon has attracted considerable attention, with previous approaches focusing on trying detect illicit practices and punish miscreants. this paper, we employ principles...

10.1609/aaai.v32i1.11452 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-25

Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings

OPENALEX - Publications

Feiyang Pan Shuokai Li Xiang Ao Pingzhong Tang Qing He

10.48550/arxiv.1904.11547 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Optimal Auctions for Spiteful Bidders

OPENALEX - Publications

Pingzhong Tang Tüomas Sandholm

Designing revenue-optimal auctions for various settings is perhaps the most important, yet sometimes elusive, problem in mechanism design. Spiteful bidders have been intensely studied recently, especially because spite occurs many applications multiagent system and electronic commerce. We derive optimal auction such (as well as that are altruistic). It a generalization of Myerson’s (1981) auction. chooses an allocation maximizes agents’ virtual valuations, but generalized definition...

10.1609/aaai.v26i1.8235 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-09-20

Non-clairvoyant Dynamic Mechanism Design

OPENALEX - Publications

Vahab Mirrokni Renato Paes Leme Pingzhong Tang Song Zuo

Despite their better revenue and welfare guarantees for repeated auctions, dynamic mechanisms have not been widely adopted in practice. This is partly due to the complexity of implementation as well unrealistic use forecasting future periods. We address these shortcomings present a new family that are simple require no distribution knowledge

10.1145/3219166.3219224 article EN 2018-06-11

Optimal dynamic mechanisms with ex-post IR via bank accounts

OPENALEX - Publications

Vahab Mirrokni Renato Paes Leme Pingzhong Tang Song Zuo

Lately, the problem of designing multi-stage dynamic mechanisms has been shown to be both theoretically challenging and practically important. In this paper, we consider revenue optimal mechanism for a setting where an auctioneer sells set items buyer in multiple stages. At each stage, there could sale but item can only appear one stage. The type at stage is thus multi-dimensional vector characterizing buyer's valuations that assumed stage-wise independent. particular, propose novel class...

10.48550/arxiv.1605.08840 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

OPENALEX - Publications

Feiyang Pan Xiang Ao Pingzhong Tang Min Lu Dapeng Liu and 2 more

It is often observed that the probabilistic predictions given by a machine learning model can disagree with averaged actual outcomes on specific subsets of data, which also known as issue miscalibration. responsible for unreliability practical systems. For example, in online advertising, an ad receive click-through rate prediction 0.1 over some population users where its click 0.15. In such cases, have to be fixed before system deployed.

10.1145/3366423.3380154 preprint EN 2020-04-20

DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis

OPENALEX - Publications

Chuheng Zhang Yuanqi Li Xi Chen Y. J. Jin Pingzhong Tang and 1 more

Modern machine learning models (such as deep neural networks and boosting decision tree models) have become increasingly popular in financial market prediction, due to their superior capacity extract complex non-linear patterns. However, since datasets very low signal-to-noise ratio are non-stationary, often prone overfitting suffer from instability issues. Moreover, various data mining tools more widely used quantitative trading, many trading firms been producing an increasing number of...

10.1109/icdm50108.2020.00087 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2020-11-01

Discovering theorems in game theory: Two-person games with unique pure Nash equilibrium payoffs

OPENALEX - Publications

Pingzhong Tang Fangzhen Lin

10.1016/j.artint.2011.07.001 article EN publisher-specific-oa Artificial Intelligence 2011-07-22

Egalitarian pairwise kidney exchange: fast algorithms vialinear programming and parametric flow

OPENALEX - Publications

Jian Li Yicheng Liu Lingxiao Huang Pingzhong Tang

We revisit the pairwise kidney exchange problem established by Roth Sonmez and Unver [23]. Our goal, explained in terms of graph theory, is to find a maximum fractional matching on an undirected graph, that Lorenz-dominates any other matching. The Lorenz-dominant matching, which can be implemented as lottery integral matchings, some sense fairest allocation also enjoys property being incentive compatible. original algorithm et al. runs time exponential size input. In this paper, we target at...

10.5555/2615731.2615804 article EN Adaptive Agents and Multi-Agents Systems 2014-05-05

Optimal mechanisms with simple menus

OPENALEX - Publications

Pingzhong Tang Zihe Wang

10.1016/j.jmateco.2017.01.002 article EN Journal of Mathematical Economics 2017-01-10

Learning Optimal Strategies to Commit To

OPENALEX - Publications

Binghui Peng Weiran Shen Pingzhong Tang Song Zuo

Over the past decades, various theories and algorithms have been developed under framework of Stackelberg games part these innovations fielded scenarios national security defenses wildlife protections. However, one remaining difficulties in literature is that most theoretical works assume full information payoff matrices, while applications, leader often has no prior knowledge about follower’s matrix, but may gain utility function through repeated interactions. In this paper, we study...

10.1609/aaai.v33i01.33012149 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

Non‐Clairvoyant Dynamic Mechanism Design

OPENALEX - Publications

Vahab Mirrokni Renato Paes Leme Pingzhong Tang Song Zuo

We introduce a new family of dynamic mechanisms that restricts sellers from using future distributional knowledge. Since the allocation and pricing each auction period do not depend on type distributions periods, we call this non‐clairvoyant. develop framework (bank account mechanisms) for characterizing, designing, proving lower bounds (clairvoyant or non‐clairvoyant). use same methods to compare revenue extraction power clairvoyant non‐clairvoyant mechanisms.

10.3982/ecta15530 article EN Econometrica 2020-01-01

Computational Issues in Time-Inconsistent Planning

OPENALEX - Publications

Pingzhong Tang Yifeng Teng Zihe Wang Shenke Xiao Yichong Xu

Time-inconsistency refers to a paradox in decision making where agents exhibit inconsistent behaviors over time. Examples are procrastination tend postpone easy tasks, and abandonments start plan quit the middle. To capture such quantify inefficiency caused by behaviors, Kleinberg Oren (2014) propose graph model with certain cost structure initiate study of several interesting computation problems: 1) ratio: worst ratio between actual agent optimal cost, all instances; 2) motivating...

10.1609/aaai.v31i1.11017 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

Coming Soon ...