NFDI4DS | UHH-SEMS - Publication Details

Learning to schedule multi-NUMA virtual machines via reinforcement learning

OPENALEX - Publications

Junjie Sheng Yiqiu Hu Wenli Zhou Lei Zhu Bo Jin and 2 more

10.1016/j.patcog.2021.108254 article EN Pattern Recognition 2021-08-13

Learning structured communication for multi-agent reinforcement learning

OPENALEX - Publications

Junjie Sheng Xiangfeng Wang Bo Jin Junchi Yan Wenhao Li and 3 more

10.1007/s10458-022-09580-8 article EN Autonomous Agents and Multi-Agent Systems 2022-08-26

GraphThought: Graph Combinatorial Optimization with Thought Generation

OPENALEX - Publications

Zixiao Huang Lifeng Guo Junjie Sheng Haosheng Chen Wenhao Li and 3 more

Large language models (LLMs) have demonstrated remarkable capabilities across various domains, especially in text processing and generative tasks. Recent advancements the reasoning of state-of-the-art LLMs, such as OpenAI-o1, significantly broadened their applicability, particularly complex problem-solving logical inference. However, most existing LLMs struggle with notable limitations handling graph combinatorial optimization (GCO) problems. To bridge this gap, we formally define Optimal...

10.48550/arxiv.2502.11607 preprint EN arXiv (Cornell University) 2025-02-17

SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation

OPENALEX - Publications

Junjie Sheng Lin Ye J.Z. Wu Yanhong Huang Jianqi Shi and 2 more

The Satisfiability (SAT) problem is a core challenge with significant applications in software engineering, including automated testing, configuration management, and program verification. This paper presents SolSearch, novel framework that harnesses large language models (LLMs) to discover optimize SAT-solving strategies automatically. Leveraging curriculum-based, trial-and-error process, SolSearch enables the LLM iteratively modify generate SAT solver code, thereby improving solving...

10.48550/arxiv.2502.14328 preprint EN arXiv (Cornell University) 2025-02-20

Learning Structured Communication for Multi-agent Reinforcement Learning

OPENALEX - Publications

Junjie Sheng Xiangfeng Wang Bo Jin Junchi Yan Wenhao Li and 3 more

This work explores the large-scale multi-agent communication mechanism under a reinforcement learning (MARL) setting. We summarize general categories of topology for structures in MARL literature, which are often manually specified. Then we propose novel framework termed as Learning Structured Communication (LSC) by using more flexible and efficient topology. Our allows adaptive agent grouping to form different hierarchical formations over episodes, is generated an auxiliary task combined...

10.48550/arxiv.2002.04235 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning

OPENALEX - Publications

Junjie Sheng Lu Wang Fangkai Yang Bo Qiao Hang Dong and 7 more

Oversubscription is a common practice for improving cloud resource utilization. It allows the service provider to sell more resources than physical limit, assuming not all users would fully utilize simultaneously. However, how design an oversubscription policy that improves utilization while satisfying some safety constraints remains open problem. Existing methods and industrial practices are over-conservative, ignoring coordination of diverse usage patterns probabilistic constraints. To...

10.1145/3543507.3583298 article EN Proceedings of the ACM Web Conference 2022 2023-04-26

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning

OPENALEX - Publications

Lu Wang Mayukh Das Fangkai Yang Junjie Sheng Bo Qiao and 9 more

Oversubscription is a prevalent practice in cloud services where the system offers more virtual resources, such as cores machines, to users or applications than its available physical capacity for reducing revenue loss due unused/redundant capacity. While oversubscription can potentially lead significant enhancement efficient resource utilization, caveat that it comes with risks of overloading and introducing jitter at level nodes if all co-located machines have high utilization. Thus...

10.48550/arxiv.2401.07033 preprint EN other-oa arXiv (Cornell University) 2024-01-01

VMAgent: Scheduling Simulator for Reinforcement Learning

OPENALEX - Publications

Junjie Sheng Shengliang Cai Haochuan Cui Wenhao Li Yun Hua and 7 more

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling. inspired by practical (VM) scheduling tasks and provides an efficient simulation platform that can reflect the real situations of cloud computing. Three scenarios (fading, recovering, expansion) are concluded from computing corresponds many reinforcement learning challenges (high dimensional state action spaces, high non-stationarity, life-long demand)....

10.48550/arxiv.2112.04785 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Obtaining Dyadic Fairness by Optimal Transport

OPENALEX - Publications

Moyi Yang Junjie Sheng Wenyan Liu Bo Jin Xiaoling Wang and 1 more

Fairness has been taken as a critical metric in machine learning models, which is considered an important component of trustworthy learning. In this paper, we focus on obtaining fairness for popular link prediction tasks, are measured by dyadic fairness. A novel pre-processing methodology proposed to establish through data repairing based optimal transport theory. With the well-established theoretical connection between graph and conditional distribution alignment problem, scheme can be...

10.1109/bigdata55660.2022.10020550 article EN 2021 IEEE International Conference on Big Data (Big Data) 2022-12-17

Dealing with Non-Stationarity in MARL via Trust-Region Decomposition

OPENALEX - Publications

Wenhao Li Xiangfeng Wang Bo Jin Junjie Sheng Hongyuan Zha

Non-stationarity is one thorny issue in cooperative multi-agent reinforcement learning (MARL). One of the reasons policy changes agents during process. Some existing works have discussed various consequences caused by non-stationarity with several kinds measurement indicators. This makes objectives or goals algorithms are inevitably inconsistent and disparate. In this paper, we introduce a novel notion, $\delta$-measurement, to explicitly measure sequence, which can be further proved bounded...

10.48550/arxiv.2102.10616 preprint EN other-oa arXiv (Cornell University) 2021-01-01

VMAgent: A Practical Virtual Machine Scheduling Platform

OPENALEX - Publications

Junjie Sheng Shengliang Cai Haochuan Cui Wenhao Li Yun Hua and 7 more

Virtual machine (VM) scheduling is one of the critical tasks in cloud computing. Many works have attempted to incorporate learning, especially reinforcement empower VM procedures. Although improved results are shown several demo simulators, performances real-world scenarios still underexploited. In this paper, we design a practical platform, i.e., VMAgent, assist researchers developing their methods on problem. VMAgent consists three components: simulator, scheduler, and visualizer. The...

10.24963/ijcai.2022/860 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

OPENALEX - Publications

Wenhao Li Xiangfeng Wang Bo Jin Junjie Sheng Yun Hua and 1 more

When solving a complex task, humans will spontaneously form teams and to complete different parts of the whole respectively. Meanwhile, cooperation between teammates improve efficiency. However, for current cooperative MARL methods, team is constructed through either heuristics or end-to-end blackbox optimization. In order efficiency exploration, we propose structured diversification emergence framework named {\sc{Rochico}} based on reinforced organization control hierarchical consensus...

10.48550/arxiv.2102.04775 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Oil Detection Fault Tree Analysis Based on Improved Expert’s Own Weight–Aggregate Fuzzy Number

OPENALEX - Publications

Junjie Sheng Haijun Wei

Oil detection technology improves the reliability of machinery or equipment. The physical and chemical indicators fluid can reflect cause failure in various aspects, which prevent major accidents to greatest extent by setting up a fault tree. Owing lack data, it is difficult accurately obtain basic event probabilities, makes diagnose faults. expert evaluation method aggregated fuzzy numbers are used exact probability, where probability evaluated as subjective will expert. To improve...

10.3390/lubricants11020062 article EN cc-by Lubricants 2023-02-02

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

OPENALEX - Publications

Junjie Sheng Wenhao Li Bo Jin Hongyuan Zha Jun Wang and 1 more

Over-generalization is a thorny issue in cognitive science, where people may become overly cautious due to past experiences. Agents multi-agent reinforcement learning (MARL) also have been found suffer relative over-generalization (RO) as do and stuck sub-optimal cooperation. Recent methods shown that assigning reasoning ability agents can mitigate RO algorithmically empirically, but there has lack of theoretical understanding RO, let alone designing provably RO-free methods. This paper...

10.48550/arxiv.2306.05353 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Research on the Optimal Dispatching Model of Smart Energy System Based on the Introduction of Risk Factor Master-Slave Game Model

OPENALEX - Publications

Chuanyong Ye Z.L. Liu Shiyan Zhao Tiancheng Xie Junjie Sheng

Minimizing the impact of clean energy volatility and maximizing benefits to owners are hot issues be solved in current smart systems. This paper introduces a risk factor based on master-slave game theory, establishes an optimal scheduling model system with as leader end-users followers, uses conditional value-at-risk theory economics quantitatively analyze cost brought by uncertainty output before day. Using this factor, two-tier dispatching for systems is established introduction model....

10.1109/eeps58791.2023.10257095 article EN 2023-07-28

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

OPENALEX - Publications

Junjie Sheng Zixiao Huang Chuyun Shen Wenhao Li Yun Hua and 3 more

The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can be alternatives PPO traditional sequential tasks? To investigate this, we first take environments collected OpenAI Gym as our testbeds and ground them textual that construct the TextGym simulator. This allows straightforward efficient comparisons between agents, given widespread adoption of Gym. ensure fair effective benchmarking, introduce $5$ levels scenario...

10.48550/arxiv.2312.03290 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Precise Prediction Method of Residential Load Characteristics Based on ARIMA-BPNN Combination Model

OPENALEX - Publications

Junjie Sheng Jie Yuan Mengjie Hu Bolun Yu Shuyue Feng

The complexity and randomness of power load data lead to the prediction accuracy a single forecasting model cannot meet requirements current grid. In this paper, aiming at inherent nonlinear characteristics residential data, an accurate method based on ARIMA-BPNN combined is proposed. Firstly, by treating sequence formed resident over time as random sequence, ARIMA used approximate secondly, series has defect "neglecting characteristics", BP neural network introduced. law mining establish...

10.1109/eiecs59936.2023.10435541 article EN 2023-09-22

Obtaining Dyadic Fairness by Optimal Transport

OPENALEX - Publications

Moyi Yang Junjie Sheng Xiangfeng Wang Wenyan Liu Bo Jin and 2 more

Fairness has been taken as a critical metric in machine learning models, which is considered an important component of trustworthy learning. In this paper, we focus on obtaining fairness for popular link prediction tasks, are measured by dyadic fairness. A novel pre-processing methodology proposed to establish through data repairing based optimal transport theory. With the well-established theoretical connection between graph and conditional distribution alignment problem, scheme can be...

10.48550/arxiv.2202.04520 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning

OPENALEX - Publications

Junjie Sheng Lu Wang Fangkai Yang Bo Qiao Hang Dong and 7 more

Oversubscription is a common practice for improving cloud resource utilization. It allows the service provider to sell more resources than physical limit, assuming not all users would fully utilize simultaneously. However, how design an oversubscription policy that improves utilization while satisfying some safety constraints remains open problem. Existing methods and industrial practices are over-conservative, ignoring coordination of diverse usage patterns probabilistic constraints. To...

10.48550/arxiv.2211.11759 preprint EN cc-by-nc-sa arXiv (Cornell University) 2022-01-01

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

OPENALEX - Publications

Haochuan Cui Junjie Sheng Bo Jin Yiqiu Hu Su Li and 3 more

With the rapid development of cloud computing, virtual machine scheduling has become one most important but challenging issues for computing community, especially practical heterogeneous request sequences. By analyzing impact heterogeneity on some popular heuristic schedulers, it can be found that existing algorithms not handle properly and efficiently. In this paper, a plug-and-play intensifier, called Resource Assigner (ReAssigner), is proposed to enhance efficiency any given scheduler...

10.48550/arxiv.2211.16227 preprint EN other-oa arXiv (Cornell University) 2022-01-01

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

OPENALEX - Publications

Haochuan Cui Junjie Sheng Bo Jin Yiqiu Hu Su Li and 3 more

With the rapid development of cloud computing, virtual machine scheduling has become one most important but challenging issues for computing community, especially practical heterogeneous request sequences. By analyzing impact heterogeneity on some popular heuristic schedulers, it can be found that existing algorithms not handle properly and efficiently. In this paper, a plug-and-play intensifier, called Resource Assigner (ReAssigner), is proposed to enhance efficiency any given scheduler...

10.1109/bigdata55660.2022.10021058 article EN 2021 IEEE International Conference on Big Data (Big Data) 2022-12-17