NFDI4DS | UHH-SEMS - Publication Details

Peng Zhang

ORCID: 0000-0001-7973-2746

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100364041

Research Areas

Advanced Graph Neural Networks
Data Stream Mining Techniques
Complex Network Analysis Techniques
Anomaly Detection Techniques and Applications
Topic Modeling
Graph Theory and Algorithms
Machine Learning and Data Classification
Recommender Systems and Techniques
Time Series Analysis and Forecasting
Network Security and Intrusion Detection
Spam and Phishing Detection
Data Management and Algorithms
Bioinformatics and Genomic Networks
Imbalanced Data Classification Techniques
Machine Learning and Algorithms
Human Mobility and Location-Based Analysis
Domain Adaptation and Few-Shot Learning
Multimodal Machine Learning Applications
Caching and Content Delivery
Face and Expression Recognition
Text and Document Classification Technologies
Privacy-Preserving Technologies in Data
Organic Light-Emitting Diodes Research
Bayesian Methods and Mixture Models
Machine Learning and ELM

Nanjing University of Posts and Telecommunications
2024-2025

Guangzhou University
2021-2024

Nanjing University of Science and Technology
2022-2024

Beijing Institute of Big Data Research
2024

Association for Computing Machinery
2023

Chinese Academy of Sciences
2009-2022

Institute of Information Engineering
2014-2022

Northeastern University
2022

Chongqing University
2022

Shandong Agricultural University
2022

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

OPENALEX - Publications

DeepSeek-AI Daya Guo Dejian Yang Haowei Zhang Junxiao Song and 95 more

We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as preliminary step, demonstrates remarkable capabilities. Through RL, naturally emerges with numerous powerful intriguing behaviors. However, it encounters challenges such poor readability, language mixing. To address these issues further enhance performance, we DeepSeek-R1, which incorporates...

10.48550/arxiv.2501.12948 preprint EN arXiv (Cornell University) 2025-01-22

Graph Neural Architecture Search

OPENALEX - Publications

Yang Gao Hong Yang Peng Zhang Chuan Zhou Yue Hu

Graph neural networks (GNNs) emerged recently as a powerful tool for analyzing non-Euclidean data such social network data. Despite their success, the design of graph requires heavy manual work and domain knowledge. In this paper, we present architecture search method (GraphNAS) that enables automatic best based on reinforcement learning. Specifically, GraphNAS uses recurrent to generate variable-length strings describe architectures networks, trains with policy gradient maximize expected...

10.24963/ijcai.2020/195 article EN 2020-07-01

Multiview Privileged Support Vector Machines

OPENALEX - Publications

Jingjing Tang Yingjie Tian Peng Zhang Xiaohui Liu

Multiview learning (MVL), by exploiting the complementary information among multiple feature sets, can improve performance of many existing tasks. Support vector machine (SVM)-based models have been frequently used for MVL. A typical SVM-based MVL model is SVM-2K, which extends SVM using distance minimization version kernel canonical correlation analysis. However, SVM-2K cannot fully unleash power different views. Recently, a framework privileged (LUPI) has proposed to data with information....

10.1109/tnnls.2017.2728139 article EN IEEE Transactions on Neural Networks and Learning Systems 2017-08-11

Binarized attributed network embedding

OPENALEX - Publications

Hong Yang Shirui Pan Peng Zhang Ling Chen Defu Lian and 1 more

Attributed network embedding enables joint representation learning of node links and attributes. Existing attributed models are designed in continuous Euclidean spaces which often introduce data redundancy impose challenges to storage computation costs. To this end, we present a Binarized Network Embedding model (BANE for short) learn binary representation. Specifically, define new Weisfeiler-Lehman proximity matrix capture dependence between attributes by aggregating the information from...

10.1109/icdm.2018.8626170 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2018-11-01

Active Learning from Data Streams

OPENALEX - Publications

Xingquan Zhu Peng Zhang Xiaodong Lin Yong Shi

In this paper, we address a new research problem on active learning from data streams where volumes grow continuously and labeling all is considered expensive impractical. The objective to label small portion of stream which model derived predict newly arrived instances as accurate possible. order tackle the challenges raised by streams' dynamic nature, propose classifier ensembling based framework selectively labels build an classifier. A minimal variance principle introduced guide instance...

10.1109/icdm.2007.101 article EN 2007-10-01

GraphNAS: Graph Neural Architecture Search with Reinforcement Learning

OPENALEX - Publications

Yang Gao Hong Yang Peng Zhang Chuan Zhou Yue Hu

Graph Neural Networks (GNNs) have been popularly used for analyzing non-Euclidean data such as social network and biological data. Despite their success, the design of graph neural networks requires a lot manual work domain knowledge. In this paper, we propose Architecture Search method (GraphNAS short) that enables automatic search best architecture based on reinforcement learning. Specifically, GraphNAS first uses recurrent to generate variable-length strings describe architectures...

10.48550/arxiv.1904.09981 preprint EN other-oa arXiv (Cornell University) 2019-01-01

When Ensemble Learning Meets Deep Learning: a New Deep Support Vector Machine for Classification

OPENALEX - Publications

Zhiquan Qi Bo Wang Yingjie Tian Peng Zhang

10.1016/j.knosys.2016.05.055 article EN Knowledge-Based Systems 2016-05-30

Online Learning from Trapezoidal Data Streams

OPENALEX - Publications

Qin Zhang Peng Zhang Guodong Long Wei Ding Chengqi Zhang and 1 more

In this paper, we study a new problem of continuous learning from doubly-streaming data where both volume and feature space increase over time. We refer to the as trapezoidal streams corresponding online streams. The is challenging because dimension time, existing <xref ref-type="bibr" rid="ref1"> [1]</xref> , rid="ref2">[2]</xref> selection rid="ref3">[3]</xref> streaming algorithms rid="ref4">[4]</xref> rid="ref5">[5]</xref> are inapplicable. propose Online Learning with Streaming Features...

10.1109/tkde.2016.2563424 article EN IEEE Transactions on Knowledge and Data Engineering 2016-05-05

Subgraph Neighboring Relations Infomax for Inductive Link Prediction on Knowledge Graphs

OPENALEX - Publications

Xiaohan Xu Peng Zhang Yongquan He Chengpeng Chao Chaoyang Yan

Inductive link prediction for knowledge graph aims at predicting missing links between unseen entities, those not shown in training stage. Most previous works learn entity-specific embeddings of which cannot handle entities. Recent several methods utilize enclosing subgraph to obtain inductive ability. However, all these only consider the part without complete neighboring relations, leads issue that partial relations are neglected, and sparse subgraphs hard be handled. To address that, we...

10.24963/ijcai.2022/325 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

HGNAS++: Efficient Architecture Search for Heterogeneous Graph Neural Networks

OPENALEX - Publications

Yang Gao Peng Zhang Chuan Zhou Hong Yang Zhao Li and 2 more

Heterogeneous graphs are commonly used to describe networked data with multiple types of nodes and edges. Graph Neural Networks (HGNNs) powerful tools for analyzing heterogeneous graphs. However, designing neural architectures HGNNs requires extensive domain knowledge time-consuming manual work. Recently, architecture search algorithms have become popular in automatically homogeneous graph networks. In this paper, we present a Architecture Search algorithm (HGNAS short) which allows the...

10.1109/tkde.2023.3239842 article EN IEEE Transactions on Knowledge and Data Engineering 2023-02-07

Classifier and Cluster Ensembles for Mining Concept Drifting Data Streams

OPENALEX - Publications

Peng Zhang Xingquan Zhu Jianlong Tan Li Guo

Ensemble learning is a commonly used tool for building prediction models from data streams, due to its intrinsic merits of handling large volumes stream data. Despite extraordinary successes in mining, existing ensemble models, environments, mainly fall into the classifiers category, without realizing that requires labor intensive labeling process, and it often case we may have small number labeled samples train few classifiers, but unlabeled are available build clusters streams....

10.1109/icdm.2010.125 article EN 2010-12-01

E-Tree: An Efficient Indexing Structure for Ensemble Models on Data Streams

OPENALEX - Publications

Peng Zhang Chuan Zhou Peng Wang Byron J. Gao Xingquan Zhu and 1 more

Ensemble learning is a common tool for data stream classification, mainly because of its inherent advantages handling large volumes and concept drifting. Previous studies, to date, have been primarily focused on building accurate ensemble models from data. However, linear scan number base classifiers in the during prediction incurs significant costs response time, preventing being practical many real-world time-critical applications, such as Web traffic monitoring, spam detection, intrusion...

10.1109/tkde.2014.2298018 article EN IEEE Transactions on Knowledge and Data Engineering 2014-01-31

Comparative study between incremental and ensemble learning on data streams: Case study

OPENALEX - Publications

Wenyu Zang Peng Zhang Chuan Zhou Li Guo

With unlimited growth of real-world data size and increasing requirement real-time processing, immediate processing big stream has become an urgent problem. In data, hidden patterns commonly evolve over time (i.e.,concept drift), where many dynamic learning strategies have been proposed, such as the incremental ensemble learning. To best our knowledge, there is no work systematically compare these two methods. this paper we conduct comparative study between theses We first introduce concept...

10.1186/2196-1115-1-5 article EN cc-by Journal Of Big Data 2014-06-24

MDNN: A Multimodal Deep Neural Network for Predicting Drug-Drug Interaction Events

OPENALEX - Publications

Tengfei Lyu Jianliang Gao Ling Tian Zhao Li Peng Zhang and 1 more

The interaction of multiple drugs could lead to serious events, which causes injuries and huge medical costs. Accurate prediction drug-drug (DDI) events can help clinicians make effective decisions establish appropriate therapy programs. Recently, many AI-based techniques have been proposed for predicting DDI associated events. However, most existing methods pay less attention the potential correlations between other multimodal data such as targets enzymes. To address this problem, we...

10.24963/ijcai.2021/487 article EN 2021-08-01

Fan-Shaped Extending Conjugation Strategies for Achieving Narrowband Emissions of Boron–Nitrogen-Based Molecules

OPENALEX - Publications

Ping Li Qingqing Yang Peng Zhang Chunyu Zeng Xianjie Wang and 2 more

Multiple-resonance thermally activated delayed fluorescence (MR-TADF) materials have attracted extensive attention due to their 100% exciton utilization efficiency and narrowband emissions. Numerous tube-shaped MR-TADF emitters with full-color emissions been reported, updated molecular design strategies need be proposed find more "recipes" narrow the emission spectral range. Upon changing shape of fluorophore from a tubular fan-shaped structure, investigated molecules exhibit based on...

10.1021/acs.jpca.4c08377 article EN The Journal of Physical Chemistry A 2025-02-06

Mining Data Streams with Labeled and Unlabeled Training Examples

OPENALEX - Publications

Peng Zhang Xingquan Zhu Li Guo

In this paper, we propose a framework to build prediction models from data streams which contain both labeled and unlabeled examples. We argue that due the increasing collection ability but limited resources for labeling, stream collected at hand may only have small number of examples, whereas large portion remain can be beneficial learning. Unleashing full potential instances mining is, however, significant challenge, consider even fully suffer concept drifting, inappropriate uses samples...

10.1109/icdm.2009.76 article EN 2009-12-01

Enabling Fast Lazy Learning for Data Streams

OPENALEX - Publications

Peng Zhang Byron J. Gao Xingquan Zhu Li Guo

Lazy learning, such as k-nearest neighbor has been widely applied to many applications. Known for well capturing data locality, lazy learning can be advantageous highly dynamic and complex environments streams. Yet its high memory consumption low prediction efficiency have made it less favorable stream oriented Specifically, traditional stores all the training inductive process is deferred until a query appears, whereas in applications, records flow continuously large volumes of class labels...

10.1109/icdm.2011.63 article EN 2011-12-01

Robust Embedding with Multi-Level Structures for Link Prediction

OPENALEX - Publications

Zihan Wang Zhaochun Ren Chunyu He Peng Zhang Yue Hu

Knowledge Graph (KG) embedding has become crucial for the task of link prediction. Recent work applies encoder-decoder models to tackle this problem, where an encoder is formulated as a graph neural network (GNN) and decoder represented by method. These approaches enforce techniques with structure information. Unfortunately, existing GNN-based frameworks still confront 3 severe problems: low representational power, stacking in flat way, poor robustness noise. In work, we propose novel...

10.24963/ijcai.2019/728 article EN 2019-07-28

GraphNAS++: Distributed Architecture Search for Graph Neural Networks

OPENALEX - Publications

Yang Gao Peng Zhang Hong Yang Chuan Zhou Zhihong Tian and 3 more

Graph neural networks (GNNs) are popularly used to analyze non-Euclidean graph data. Despite their successes, the design of requires heavy manual work and rich domain knowledge. Recently, architecture search algorithms widely automatically architectures for CNNs RNNs. Inspired by success algorithms, we present a algorithm GraphNAS that enables automatic best based on reinforcement learning. Specifically, uses recurrent network as controller generate variable-length strings describe networks,...

10.1109/tkde.2022.3178153 article EN IEEE Transactions on Knowledge and Data Engineering 2022-01-01

Enabling fast prediction for ensemble models on data streams

OPENALEX - Publications

Peng Zhang Jun Li Peng Wang Byron J. Gao Xingquan Zhu and 1 more

Ensemble learning has become a common tool for data stream classification, being able to handle large volumes of and concept drifting. Previous studies focus on building accurate prediction models from data. However, linear scan number base classifiers in the ensemble during incurs significant costs response time, preventing practical many real world time-critical applications, such as Web traffic monitoring, spam detection, intrusion detection. In these streams usually arrive at speed...

10.1145/2020408.2020442 article EN 2011-08-21

Dual Implicit Mining-Based Latent Friend Recommendation

OPENALEX - Publications

Lin Cui Jia Wu Dechang Pi Peng Zhang Paul Kennedy

The latent friend recommendation in online social media is interesting, yet challenging, because the user-item ratings and user-user relationships are both sparse. In this paper, we propose a new dual implicit mining-based model that simultaneously considers interest topics of users link between local topic cliques. Specifically, first an algorithm called all reviews from user tags their corresponding items to learn weights, then compute similarity using symmetric Jensen-Shannon divergence....

10.1109/tsmc.2017.2777889 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2018-01-15

HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph

OPENALEX - Publications

Yongquan He Peng Zhang Luchen Liu Qi Liang Wenyuan Zhang and 1 more

In recent years, temporal knowledge graph (TKG) reasoning has received significant attention. Most existing methods assume that all timestamps and corresponding graphs are available during training, which makes it difficult to predict future events. To address this issue, works learn infer events based on historical information. However, these do not comprehensively consider the latent patterns behind changes, pass information selectively, update representations appropriately accurately....

10.24963/ijcai.2021/264 preprint EN 2021-08-01

Live-Streaming Fraud Detection: A Heterogeneous Graph Neural Network Approach

OPENALEX - Publications

Zhao Li Haishuai Wang Peng Zhang Pengrui Hui Jiaming Huang and 3 more

Live-streaming platforms have recently gained significant popularity by attracting an increasing number of young users and become a very promising form online shopping. Similar to the traditional shopping such as Taobao, live-streaming also suffer from malicious fraudulent behaviors where many transactions are not genuine. The existing anti-fraud models proposed recognize on inapplicable platforms. This is mainly because characterized unique type heterogeneous networks multiple types nodes...

10.1145/3447548.3467065 article EN 2021-08-13

Graph Neural Network for Ethereum Fraud Detection

OPENALEX - Publications

Runnan Tan Qingfeng Tan Peng Zhang Zhao Li

Currently, the blockchain technology has been widely applied to various industries, and attracted wide attention. However, because of its unique anonymity, digital currency become a haven for all kinds cyber crimes. It reported that Ethereum frauds provide huge profits, pose serious threat financial security network. To create desired environment, an effective method is urgently needed automatically detect identify in governance system. In view this, this paper proposes detecting by mining...

10.1109/ickg52313.2021.00020 article EN 2021-12-01

A novel semi-supervised classification approach for evolving data streams

OPENALEX - Publications

Guobo Liao Peng Zhang Hongpeng Yin Xuanhong Deng Yanxia Li and 2 more

10.1016/j.eswa.2022.119273 article EN Expert Systems with Applications 2022-11-20

Coming Soon ...