NFDI4DS | UHH-SEMS - Publication Details

Yijie Peng

ORCID: 0000-0003-2584-8131

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5005503619

Research Areas

Simulation Techniques and Applications
Advanced Statistical Process Monitoring
Advanced Multi-Objective Optimization Algorithms
Statistical Methods and Inference
Stochastic processes and financial applications
Adversarial Robustness in Machine Learning
Reinforcement Learning in Robotics
Neural Networks and Applications
Financial Risk and Volatility Modeling
Optimal Experimental Design Methods
Anomaly Detection Techniques and Applications
Reservoir Engineering and Simulation Methods
Advanced Bandit Algorithms Research
Probabilistic and Robust Engineering Design
Risk and Portfolio Optimization
Healthcare Operations and Scheduling Optimization
Scheduling and Optimization Algorithms
Manufacturing Process and Optimization
Machine Learning and Algorithms
Supply Chain and Inventory Management
Auction Theory and Applications
Complex Network Analysis Techniques
Data Management and Algorithms
Monetary Policy and Economic Impact
Markov Chains and Monte Carlo Methods

Peking University
2017-2024

Beijing Academy of Artificial Intelligence
2024

Shanghai Zhangjiang Laboratory
2024

King University
2024

Institute of Computing Technology
2021

Chinese Academy of Sciences
2021

Tsinghua University
2021

Fudan University
2012-2017

George Mason University
2017

Dallas County
2013

Ranking and Selection as Stochastic Control

OPENALEX - Publications

Yijie Peng Edwin K. P. Chong Chun‐Hung Chen Michael C. Fu

Under a Bayesian framework, we formulate the fully sequential sampling and selection decision in statistical ranking as stochastic control problem, derive associated Bellman equation. Using value function approximation, an approximately optimal allocation policy. We show that this policy is not only computationally efficient but also possesses both one-step-ahead asymptotic optimality for independent normal distributions. Moreover, proposed easily generalizable approximate dynamic...

10.1109/tac.2018.2797188 article EN IEEE Transactions on Automatic Control 2018-01-23

A New Unbiased Stochastic Derivative Estimator for Discontinuous Sample Performances with Structural Parameters

OPENALEX - Publications

Yijie Peng Michael C. Fu Jian-Qiang Hu Bernd Heidergott

In this paper, we propose a new unbiased stochastic derivative estimator in framework that can handle discontinuous sample performances with structural parameters. This work extends the three most popular estimators: (1) infinitesimal perturbation analysis (IPA), (2) likelihood ratio (LR) method, and (3) weak to setting where they did not previously apply. Examples probability constraints, control charts, financial derivatives demonstrate broad applicability of proposed framework. The...

10.1287/opre.2017.1674 article EN Operations Research 2018-02-02

Dynamic Sampling Allocation and Design Selection

OPENALEX - Publications

Yijie Peng Chun-Hung Chen Michael C. Fu Jian-Qiang Hu

We formulate the statistical selection problem in a general dynamic framework comprising fully sequential sampling allocation and optimal design selection. Because traditional probability of correct measure is not sufficient to capture both aspects this more framework, we introduce integrated better characterize objective. As result, usual policy choosing with largest sample mean as estimate best no longer necessarily optimal. Rather, choose that maximizes posterior selection, which function...

10.1287/ijoc.2015.0673 article EN INFORMS journal on computing 2016-02-17

Myopic Allocation Policy With Asymptotically Optimal Sampling Rate

OPENALEX - Publications

Yijie Peng Michael C. Fu

In this note, we consider the statistical ranking and selection problem of finding best alternative when performances each must be estimated by sampling. We provide a myopic allocation policy that asymptotically achieves sampling ratios given optimal computing budget allocation, an approximate solution large deviations rate for decreasing probability false selection. analyze asymptotic ratio both known variances unknown under Bayesian framework. Numerical results substantiate theoretical results.

10.1109/tac.2016.2592378 article EN publisher-specific-oa IEEE Transactions on Automatic Control 2016-07-18

Production Planning with Generalized Production Relationships

OPENALEX - Publications

Xiaotian Liu Christos Alexopoulos Yijie Peng

10.2139/ssrn.5167613 preprint EN 2025-01-01

Efficient Learning for Clustering and Optimizing Context-Dependent Designs

OPENALEX - Publications

Haidong Li Henry Lam Yijie Peng

Contextual simulation optimization problems have attracted great attention in the healthcare, commercial, and financial fields because of need for personalized decision making. Besides randomness outputs, larger solution space makes learning more challenging. In current work, Li, Lam, Peng use a Gaussian mixture model (GMM) as basic technique to deal with this difficulty. To address computational challenge updating GMM-based Bayesian posterior, they present computationally efficient...

10.1287/opre.2022.2368 article EN Operations Research 2022-09-23

A Q-learning algorithm for Markov decision processes with continuous state spaces

OPENALEX - Publications

Jiaqiao Hu Xiangyu Yang Jian-Qiang Hu Yijie Peng

10.1016/j.sysconle.2024.105782 article EN publisher-specific-oa Systems & Control Letters 2024-03-26

Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer

OPENALEX - Publications

Tao Ren Ziyue Zhang Zehao Li Jingyang Jiang S. Joe Qin and 6 more

The probabilistic diffusion model (DM), generating content by inferencing through a recursive chain structure, has emerged as powerful framework for visual generation. After pre-training on enormous unlabeled data, the needs to be properly aligned meet requirements downstream applications. How efficiently align foundation DM is crucial task. Contemporary methods are either based Reinforcement Learning (RL) or truncated Backpropagation (BP). However, RL and BP suffer from low sample...

10.48550/arxiv.2502.00639 preprint EN arXiv (Cornell University) 2025-02-01

Asymptotically Optimal Sampling Policy for Selecting Top-m Alternatives

OPENALEX - Publications

Gongbo Zhang Yijie Peng Jianghua Zhang Enlu Zhou

We consider selecting the top-m alternatives from a finite number of via Monte Carlo simulation. Under Bayesian framework, we formulate sampling decision as stochastic dynamic programming problem and develop sequential policy that maximizes value function approximation one-step look ahead. To show asymptotic optimality proposed procedure, asymptotically optimal ratios optimize large deviations rate probability false selection for have been rigorously defined. The is not only proved to be...

10.1287/ijoc.2021.0333 article EN INFORMS journal on computing 2023-08-21

Efficient Simulation Resource Sharing and Allocation for Selecting the Best

OPENALEX - Publications

Yijie Peng Chun‐Hung Chen Michael C. Fu Jian-Qiang Hu

Common random numbers and the standard clock method are examples of effective variance reduction techniques that also share information simulation resources when generating realizations different simulated systems whose performances being compared. This sharing computing potentially widely computational requirements for models important considerations in allocating replications among candidate designs with objective maximizing probability selecting best design, we formulate optimal budget...

10.1109/tac.2012.2215533 article EN IEEE Transactions on Automatic Control 2012-08-30

Efficient Simulation Sampling Allocation Using Multifidelity Models

OPENALEX - Publications

Yijie Peng Jie Xu Loo Hay Lee Jian-Qiang Hu Chun‐Hung Chen

Simulation is often used to estimate the performance of alternative system designs for selecting best. For a complex system, high-fidelity simulation usually time-consuming and expensive. In this paper, we provide new framework that integrates information from multifidelity models increase efficiency A Gaussian mixture model introduced capture clustering in models. Posterior obtained by analysis incorporates both cluster-wise idiosyncratic each design. We propose budget allocation method...

10.1109/tac.2018.2886165 article EN publisher-specific-oa IEEE Transactions on Automatic Control 2018-12-10

Maximum Likelihood Estimation by Monte Carlo Simulation: Toward Data-Driven Stochastic Modeling

OPENALEX - Publications

Yijie Peng Michael C. Fu Bernd Heidergott Henry Lam

A Simulation-Based Approach for Calibrating Stochastic Models

10.1287/opre.2019.1978 article EN Operations Research 2020-10-26

A Stochastic Approximation Method for Simulation-Based Quantile Optimization

OPENALEX - Publications

Jiaqiao Hu Yijie Peng Gongbo Zhang Qi Zhang

We present a gradient-based algorithm for solving class of simulation optimization problems in which the objective function is quantile output random variable. In contrast with existing (quantile derivative) estimation techniques, aim to eliminate estimator bias by gradually increasing sample size, our incorporates novel recursive procedure that only requires single at each step simultaneously obtain and derivative estimators are asymptotically unbiased. show these estimators, when coupled...

10.1287/ijoc.2022.1214 article EN INFORMS journal on computing 2022-07-22

Gradient-Based Myopic Allocation Policy: An Efficient Sampling Procedure in a Low-Confidence Scenario

OPENALEX - Publications

Yijie Peng Chun‐Hung Chen Michael C. Fu Jian-Qiang Hu

In this note, we study a simulation optimization problem of selecting the alternative with best performance from finite set, or so-called ranking and selection problem, in special low-confidence scenario. The most popular sampling allocation procedures do not perform well scenario, because they all ignore certain induced correlations that significantly affect probability correct We propose gradient-based myopic policy takes into account, reflecting tradeoff between correlation two factors...

10.1109/tac.2017.2776606 article EN publisher-specific-oa IEEE Transactions on Automatic Control 2017-11-22

Solving Inventory Management Problems through Deep Reinforcement Learning

OPENALEX - Publications

Qinghao Wang Yijie Peng Yaodong Yang

10.1007/s11518-022-5544-6 article EN Journal of Systems Science and Systems Engineering 2022-12-01

Noise Optimization in Artificial Neural Networks

OPENALEX - Publications

Li Xiao Zeliang Zhang Kuihua Huang Jinyang Jiang Yijie Peng

Artificial neural network (ANN) has been widely used in automation. However, the vulnerability of ANN under certain attacks poses a security threat to critical automation systems. Previous research shown that adding noise ANNs can enhance robustness. Nonetheless, striking balance between robustness and task performance remains challenging, as excessive improves but hampers performance, while low offers minor improvement. In this work, we propose learn distribution optimal injected noise,...

10.1109/tase.2024.3384409 article EN IEEE Transactions on Automation Science and Engineering 2024-04-08

Dynamic Sampling Allocation Under Finite Simulation Budget for Feasibility Determination

OPENALEX - Publications

Zhongshun Shi Yijie Peng Leyuan Shi Chun‐Hung Chen Michael C. Fu

Monte Carlo simulation is a commonly used tool for evaluating the performance of complex stochastic systems. In practice, can be expensive, especially when comparing large number alternatives, thus motivating need to intelligently allocate replications. Given finite set alternatives whose means are estimated via simulation, we consider problem determining subset that have smaller than fixed threshold. A dynamic sampling procedure possesses not only asymptotic optimality, but also desirable...

10.1287/ijoc.2020.1057 article EN INFORMS journal on computing 2021-03-23

Applications of generalized likelihood ratio method to distribution sensitivities and steady-state simulation

OPENALEX - Publications

Lei Lei Yijie Peng Michael C. Fu Jian-Qiang Hu

10.1007/s10626-017-0247-8 article EN Discrete Event Dynamic Systems 2017-05-17

Non-monotonicity of probability of correct selection

OPENALEX - Publications

Yijie Peng Chun‐Hung Chen Michael C. Fu Jian-Qiang Hu

In Peng et al. (2015b), we show that the probability of correct selection (PCS), a commonly used metric, is not necessarily monotonically increasing with respect to number simulation replications. A simple counterexample where PCS may decrease additional sampling provided motivate problem. The reference identifies induced correlations as source non-monotonicity, and characterizes general scenario under which phenomenon occurs by condition coefficient variations difference in sample means are...

10.1109/wsc.2015.7408526 article EN 2018 Winter Simulation Conference (WSC) 2015-12-01

Efficient Sampling Allocation Procedures for Optimal Quantile Selection

OPENALEX - Publications

Yijie Peng Chun-Hung Chen Michael C. Fu Jian-Qiang Hu Ilya O. Ryzhov

We propose a dynamic sampling allocation and selection paradigm for finding the alternative with optimal quantile in Bayesian framework. Myopic policies (MAPs), analogous to existing methods classic ranking selecting mean, computationally efficient are derived quantile. Under certain conditions, we prove that proposed MAPs procedures consistent, which means best would be eventually correctly selected as sample size goes infinity. Numerical experiments demonstrate schemes can significantly...

10.1287/ijoc.2019.0946 article EN INFORMS journal on computing 2020-06-25

An Efficient Dynamic Sampling Policy for Monte Carlo Tree Search

OPENALEX - Publications

Gongbo Zhang Yijie Peng Yilong Xu

We consider the popular tree-based search strategy within framework of reinforcement learning, Monte Carlo Tree Search (MCTS), in context finite-horizon Markov decision process. propose a dynamic sampling tree policy that efficiently allocates limited computational budget to maximize probability correct selection best action at root node tree. Experimental results on Tic-Tac-Toe and Gomoku show proposed is more efficient than other competing methods.

10.1109/wsc57314.2022.10015374 article EN 2018 Winter Simulation Conference (WSC) 2022-12-11

Code and Data Repository for An Efficient Node Selection Policy for Monte Carlo Tree Search with Neural Networks

OPENALEX - Publications

Xiaotian Liu Yijie Peng Gongbo Zhang Ruihan Zhou

The software and data in this repository are a snapshot of the that were used research reported on paper An Efficient Node Selection Policy for Monte Carlo Tree Search with Neural Networks by Xiaotian Liu, Yijie Peng, Gongbo Zhang, Ruihan Zhou.

10.1287/ijoc.2023.0307.cd article EN INFORMS journal on computing 2024-09-18

Gradient-based simulated maximum likelihood estimation for stochastic volatility models using characteristic functions

OPENALEX - Publications

Yijie Peng Michael C. Fu Jian-Qiang Hu

Parameter estimation and statistical inference are challenging problems for stochastic volatility (SV) models, especially those driven by pure jump Lévy processes. Maximum likelihood (MLE) is usually preferred when a parametric model correctly specified, but traditional MLE implementation SV models computationally infeasible due to high dimensionality of the integral involved. To overcome this difficulty, we propose gradient-based simulated method under hidden Markov structure which covers...

10.1080/14697688.2016.1185142 article EN Quantitative Finance 2016-06-07

Efficient Learning for Selecting Top-$m$ Context-Dependent Designs

OPENALEX - Publications

Gongbo Zhang Sihua Chen Kuihua Huang Yijie Peng

We consider a simulation optimization problem for context-dependent decision-making, which aims to determine the top- <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$m$</tex-math> </inline-formula> designs all contexts. Under Bayesian framework, we formulate optimal dynamic sampling decision as stochastic programming and develop sequential policy efficiently learn performance of each design under context....

10.1109/tase.2024.3391020 article EN IEEE Transactions on Automation Science and Engineering 2024-04-29

Simulation Budget Allocation for Improving Scheduling and Routing of Automated Guided Vehicles in Warehouse Management

OPENALEX - Publications

Gongbo Zhang Haobin Li Xiao-Tian Liu Yijie Peng

10.1007/s40305-024-00553-0 article EN Journal of the Operations Research Society of China 2024-07-29

Coming Soon ...