NFDI4DS | UHH-SEMS - Publication Details

Chaojie Wang

ORCID: 0000-0002-7644-7621

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100774764

Research Areas

Topic Modeling
Advanced Text Analysis Techniques
Natural Language Processing Techniques
Advanced Graph Neural Networks
Computational and Text Analysis Methods
Bayesian Methods and Mixture Models
Text and Document Classification Technologies
Statistical Methods and Inference
Domain Adaptation and Few-Shot Learning
Advanced Neural Network Applications
Complex Network Analysis Techniques
Remote-Sensing Image Classification
Advanced Image and Video Retrieval Techniques
Graph Theory and Algorithms
Statistical Methods and Bayesian Inference
Matrix Theory and Algorithms
Mineral Processing and Grinding
Web Data Mining and Analysis
Semantic Web and Ontologies
Neural Networks and Applications
Adsorption and biosorption for pollutant removal
Image and Signal Denoising Methods
Machine Learning and Data Classification
Image Retrieval and Classification Techniques
Pain Management and Treatment

Liaoning University of Traditional Chinese Medicine
2025

Jiangsu University
2019-2025

Affiliated Hospital of Jiangsu University
2022-2025

Foshan University
2025

Nanyang Technological University
2024

Shandong Normal University
2024

Jining University
2024

Xidian University
2018-2022

East China Normal University
2021-2022

Ministry of Natural Resources
2022

Friendly Topic Assistant for Transformer Based Abstractive Summarization

OPENALEX - Publications

Zhengjue Wang Zhibin Duan Hao Zhang Chaojie Wang Long Tian and 2 more

ive document summarization is a comprehensive task including understanding and summary generation, in which area Transformer-based models have achieved the state-of-the-art performance. Compared with Transformers, topic are better at learning explicit semantics, hence could be integrated into Transformers to further boost their To this end, we rearrange explore semantics learned by model, then propose assistant (TA) three modules. TA compatible various user-friendly since i) plug-and-play...

10.18653/v1/2020.emnlp-main.35 article EN 2020-01-01

Variational Temporal Deep Generative Model for Radar HRRP Target Recognition

OPENALEX - Publications

Dandan Guo Bo Chen Wenchao Chen Chaojie Wang Hongwei Liu and 1 more

We develop a recurrent gamma belief network (rGBN) for radar automatic target recognition (RATR) based on high-resolution range profile (HRRP), which characterizes the temporal dependence across cells of HRRP. The proposed rGBN adopts hierarchy distributions to build its deep generative model. For scalable training and fast out-of-sample prediction, we propose hybrid stochastic-gradient Markov chain Monte Carlo (MCMC) variational inference model perform posterior inference. To utilize label...

10.1109/tsp.2020.3027470 article EN IEEE Transactions on Signal Processing 2020-01-01

A joint analysis of single cell transcriptomics and proteomics using transformer

OPENALEX - Publications

Yuanyuan Chen Xiaodan Fan Chaowen Shi Zheng Shi Chaojie Wang

CITE-seq provides a powerful method for simultaneously measuring RNA and protein expression at the single-cell level. The integrated analysis of in identical cells is crucial revealing cellular heterogeneity. However, high experimental costs associated with limit its widespread application. In this paper, we propose scTEL, deep learning framework based on Transformer encoder layers, to establish mapping from sequenced unobserved same cells. This computation-based approach significantly...

10.1038/s41540-024-00484-9 article EN cc-by-nc-nd npj Systems Biology and Applications 2025-01-02

Optimizing biochar for carbon sequestration: a synergistic approach using machine learning and natural language processing

OPENALEX - Publications

Jiayi Li Yixuan Chen Chaojie Wang Hanbo Chen Yurong Gao and 7 more

Abstract Biochar is a promising technology for carbon storage and greenhouse gas (GHG) reduction, but optimizing it challenging due to the complexity of natural systems. Machine learning (ML) language processing (NLP) offer solutions through enhanced data analysis pattern recognition, ushering in new era biochar research. Graphical

10.1007/s42773-024-00424-0 article EN cc-by Biochar 2025-01-20

Recent Advances in Biochar-Based Hydrogel Composites: Preparation, Aquatic Environmental Applications, and Adsorption Mechanisms

OPENALEX - Publications

Yuxin Zhao Chaojie Wang Qing Han Zheng Fang Yurong Gao and 5 more

In the face of escalating crisis water pollution, biochar-based hydrogel composites (BCGs) have emerged as a promising material for treatment, owing to their distinctive performance and environmental friendliness. These combine high specific surface area porous structure biochar with three-dimensional network hydrogel, demonstrating superior adsorption capacities ease recyclability within aquatic systems. This paper provides first overview BCGs synthesis methods, particular emphasis on...

10.3390/pr13030664 article EN Processes 2025-02-26

Coupled Global–Local object detection for large VHR aerial images

OPENALEX - Publications

Xi Chen Chaojie Wang Zhihong Li Min Liu Qingli Li and 4 more

10.1016/j.knosys.2022.110097 article EN Knowledge-Based Systems 2022-11-17

Time-series forecasting of mortality rates using transformer

OPENALEX - Publications

Jun Wang Lihong Wen Xiao Lu Chaojie Wang

Predicting mortality rates is a crucial issue in life insurance pricing and demographic statistics. Traditional approaches, such as the Lee-Carter model its variants, predict trends of using factor models, which explain variations from perspective ages, gender, regions, other factors. Recently, deep learning techniques have achieved great success various tasks shown strong potential for time-series forecasting. In this paper, we propose modified Transformer architecture predicting major...

10.1080/03461238.2023.2218859 article EN Scandinavian Actuarial Journal 2023-05-30

An explicit formula for the inverse of a pentadiagonal Toeplitz matrix

OPENALEX - Publications

Chaojie Wang Hongyi Li Di Zhao

10.1016/j.cam.2014.08.010 article EN publisher-specific-oa Journal of Computational and Applied Mathematics 2014-09-30

Interpretable machine learning for predicting heavy metal removal and optimizing biochar characteristics

OPENALEX - Publications

Chaojie Wang Yuxin Zhao Yurong Gao Hanbo Chen Xiaofei Li and 4 more

10.1016/j.jwpe.2024.106484 article EN Journal of Water Process Engineering 2024-11-11

Learning Fair Representations via Distance Correlation Minimization

OPENALEX - Publications

Dandan Guo Chaojie Wang Baoxiang Wang Hongyuan Zha

As machine learning algorithms are increasingly deployed for high-impact automated decision-making, the presence of bias (in datasets or tasks) gradually becomes one most critical challenges in applications. Such range from race face recognition to gender hiring systems, where and can be denoted as sensitive attributes. In recent years, much progress has been made ensuring fairness reducing standard settings. Among them, fair representations with respect attributes attracted increasing...

10.1109/tnnls.2022.3187165 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-08-15

High-level attributes modeling for indoor scenes classification

OPENALEX - Publications

Chaojie Wang Jun Yu Dapeng Tao

10.1016/j.neucom.2013.05.032 article EN Neurocomputing 2013-06-13

Multimodal Poisson Gamma Belief Network

OPENALEX - Publications

Chaojie Wang Bo Chen Mingyuan Zhou

To learn a deep generative model of multimodal data, we propose Poisson gamma belief network (mPGBN) that tightly couple the data different modalities at multiple hidden layers. The mPGBN unsupervisedly extracts nonnegative latent representation using an upward-downward Gibbs sampler. It imposes sparse connections between layers, making it simple to visualize process and relationships features modalities. Our experimental results on bi-modal consisting images tags show can easily impute...

10.1609/aaai.v32i1.11846 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Reform and Innovation of College English Teaching Under the Background of Mobile Internet and Big Data

OPENALEX - Publications

Chaojie Wang Jie Pan

The rapid development of mobile Internet has brought new opportunities to college education and teaching. Taking university English teaching as the research object, this paper analyses characteristics in classroom, WeChat platform, learning app other forms teaching, changes influence on its application According actual situation, a comprehensive evaluation system based process results was established. show that model innovative examination can effectively improve efficiency students'...

10.4018/ijicte.343320 article EN International Journal of Information and Communication Technology Education 2024-05-07

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

OPENALEX - Publications

Chaojie Wang Yanchen Deng Zhiyi Lv Shuicheng Yan Bo An

Large Language Models (LLMs) have demonstrated impressive capability in many nature language tasks. However, the auto-regressive generation process makes LLMs prone to produce errors, hallucinations and inconsistent statements when performing multi-step reasoning. In this paper, we aim alleviate pathology by introducing Q*, a general, versatile agile framework for guiding decoding with deliberative planning. By learning plug-and-play Q-value model as heuristic function, our Q* can...

10.48550/arxiv.2406.14283 preprint EN arXiv (Cornell University) 2024-06-20

Interpretable Machine Learning for Predicting Heavy Metal Removal and Optimizing Biochar Characteristics

OPENALEX - Publications

Chaojie Wang Yuxin Zhao Yurong Gao Hanbo Chen Xiaofei Li and 4 more

10.2139/ssrn.4919543 preprint EN 2024-01-01

Improving Unsupervised Hierarchical Representation With Reinforcement Learning

OPENALEX - Publications

Ruyi An Y. Li Xu He Pengjie Gu Mengchen Zhao and 5 more

10.1109/cvpr52733.2024.02165 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Convolutional Poisson Gamma Belief Network

OPENALEX - Publications

Chaojie Wang Bo Chen Sucheng Xiao Mingyuan Zhou

For text analysis, one often resorts to a lossy representation that either completely ignores word order or embeds each as low-dimensional dense feature vector. In this paper, we propose convolutional Poisson factor analysis (CPFA) directly operates on lossless processes the words in document sequence of high-dimensional one-hot vectors. To boost its performance, further gamma belief network (CPGBN) couples CPFA with via novel probabilistic pooling layer. forms into phrases and captures very...

10.48550/arxiv.1905.05394 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The Performance Analysis of Signal Recognition Using Attention Based CNN Method

OPENALEX - Publications

Zan Yin Bo Chen Weimin Zhen Chaojie Wang Ting Zhang

Modulation recognition has always been an important task in the development of cognitive radio. At present, there are two main application methods for signal data, namely, directly using sequence and some conversions such as constellation diagram. In this paper, converted contour stella images adopted data source research. The deep learning method proposed, which is called Image-based CNN with Attention Model (ICAM). ICAM based on Residual Neural Network (ResNet). To evaluate performance...

10.1109/access.2020.3038208 article EN cc-by IEEE Access 2020-01-01

Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network

OPENALEX - Publications

Zhibin Duan Dongsheng Wang Bo Chen Chaojie Wang Chen Wen-chao and 3 more

Hierarchical topic models such as the gamma belief network (GBN) have delivered promising results in mining multi-layer document representations and discovering interpretable taxonomies. However, they often assume prior that topics at each layer are independently drawn from Dirichlet distribution, ignoring dependencies between both same across different layers. To relax this assumption, we propose sawtooth factorial embedding guided GBN, a deep generative model of documents captures semantic...

10.48550/arxiv.2107.02757 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Knowledge-Aware Bayesian Deep Topic Model

OPENALEX - Publications

Dongsheng Wang Yishi Xu Miaoge Li Zhibin Duan Chaojie Wang and 2 more

We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling. Although embedded models (ETMs) and its variants have gained promising performance in text analysis, they mainly focus on mining word co-occurrence patterns, ignoring potentially easy-to-obtain hierarchies that could help enhance coherence. While several knowledge-based recently been proposed, are either only applicable to shallow or sensitive the quality of provided knowledge....

10.48550/arxiv.2209.14228 preprint EN other-oa arXiv (Cornell University) 2022-01-01

High-dimensional correlation matrix estimation for general continuous data with Bagging technique

OPENALEX - Publications

Chaojie Wang Jin Du Xiaodan Fan

Abstract High-dimensional covariance matrix estimation plays a central role in multivariate statistical analysis. It is well-known that the sample singular when size smaller than dimension of variable, but estimate must be positive-definite. This motivates some modifications to preserve its efficient pairwise covariance. In this paper, we modify correlation using Bagging technique. The proposed estimator flexible for general continuous data. Under mild conditions, show theoretically can...

10.1007/s10994-022-06138-3 article EN cc-by Machine Learning 2022-03-18

Generative Text Convolutional Neural Network for Hierarchical Document Representation Learning

OPENALEX - Publications

Chaojie Wang Bo Chen Zhibin Duan Wenchao Chen Hao Zhang and 1 more

For document analysis, existing methods often resort to the representation that either discards word order information or projects each into a low-dimensional dense embedding vector. However, confined by data's sparsity and high-dimensionality, limited effort has been made explore semantic structures underlying formulates as sequence of one-hot vectors, especially in probabilistic modeling literature. To construct generative model for this type representation, we first develop convolutional...

10.1109/tpami.2022.3192319 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01

Coming Soon ...