Chaojie Wang

ORCID: 0000-0002-7644-7621
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Advanced Text Analysis Techniques
  • Natural Language Processing Techniques
  • Advanced Graph Neural Networks
  • Computational and Text Analysis Methods
  • Bayesian Methods and Mixture Models
  • Text and Document Classification Technologies
  • Statistical Methods and Inference
  • Domain Adaptation and Few-Shot Learning
  • Advanced Neural Network Applications
  • Complex Network Analysis Techniques
  • Remote-Sensing Image Classification
  • Advanced Image and Video Retrieval Techniques
  • Graph Theory and Algorithms
  • Statistical Methods and Bayesian Inference
  • Matrix Theory and Algorithms
  • Mineral Processing and Grinding
  • Web Data Mining and Analysis
  • Semantic Web and Ontologies
  • Neural Networks and Applications
  • Adsorption and biosorption for pollutant removal
  • Image and Signal Denoising Methods
  • Machine Learning and Data Classification
  • Image Retrieval and Classification Techniques
  • Pain Management and Treatment

Liaoning University of Traditional Chinese Medicine
2025

Jiangsu University
2019-2025

Affiliated Hospital of Jiangsu University
2022-2025

Foshan University
2025

Nanyang Technological University
2024

Shandong Normal University
2024

Jining University
2024

Xidian University
2018-2022

East China Normal University
2021-2022

Ministry of Natural Resources
2022

ive document summarization is a comprehensive task including understanding and summary generation, in which area Transformer-based models have achieved the state-of-the-art performance. Compared with Transformers, topic are better at learning explicit semantics, hence could be integrated into Transformers to further boost their To this end, we rearrange explore semantics learned by model, then propose assistant (TA) three modules. TA compatible various user-friendly since i) plug-and-play...

10.18653/v1/2020.emnlp-main.35 article EN 2020-01-01

We develop a recurrent gamma belief network (rGBN) for radar automatic target recognition (RATR) based on high-resolution range profile (HRRP), which characterizes the temporal dependence across cells of HRRP. The proposed rGBN adopts hierarchy distributions to build its deep generative model. For scalable training and fast out-of-sample prediction, we propose hybrid stochastic-gradient Markov chain Monte Carlo (MCMC) variational inference model perform posterior inference. To utilize label...

10.1109/tsp.2020.3027470 article EN IEEE Transactions on Signal Processing 2020-01-01

CITE-seq provides a powerful method for simultaneously measuring RNA and protein expression at the single-cell level. The integrated analysis of in identical cells is crucial revealing cellular heterogeneity. However, high experimental costs associated with limit its widespread application. In this paper, we propose scTEL, deep learning framework based on Transformer encoder layers, to establish mapping from sequenced unobserved same cells. This computation-based approach significantly...

10.1038/s41540-024-00484-9 article EN cc-by-nc-nd npj Systems Biology and Applications 2025-01-02

Abstract Biochar is a promising technology for carbon storage and greenhouse gas (GHG) reduction, but optimizing it challenging due to the complexity of natural systems. Machine learning (ML) language processing (NLP) offer solutions through enhanced data analysis pattern recognition, ushering in new era biochar research. Graphical

10.1007/s42773-024-00424-0 article EN cc-by Biochar 2025-01-20

In the face of escalating crisis water pollution, biochar-based hydrogel composites (BCGs) have emerged as a promising material for treatment, owing to their distinctive performance and environmental friendliness. These combine high specific surface area porous structure biochar with three-dimensional network hydrogel, demonstrating superior adsorption capacities ease recyclability within aquatic systems. This paper provides first overview BCGs synthesis methods, particular emphasis on...

10.3390/pr13030664 article EN Processes 2025-02-26

Predicting mortality rates is a crucial issue in life insurance pricing and demographic statistics. Traditional approaches, such as the Lee-Carter model its variants, predict trends of using factor models, which explain variations from perspective ages, gender, regions, other factors. Recently, deep learning techniques have achieved great success various tasks shown strong potential for time-series forecasting. In this paper, we propose modified Transformer architecture predicting major...

10.1080/03461238.2023.2218859 article EN Scandinavian Actuarial Journal 2023-05-30

10.1016/j.cam.2014.08.010 article EN publisher-specific-oa Journal of Computational and Applied Mathematics 2014-09-30

As machine learning algorithms are increasingly deployed for high-impact automated decision-making, the presence of bias (in datasets or tasks) gradually becomes one most critical challenges in applications. Such range from race face recognition to gender hiring systems, where and can be denoted as sensitive attributes. In recent years, much progress has been made ensuring fairness reducing standard settings. Among them, fair representations with respect attributes attracted increasing...

10.1109/tnnls.2022.3187165 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-08-15

To learn a deep generative model of multimodal data, we propose Poisson gamma belief network (mPGBN) that tightly couple the data different modalities at multiple hidden layers. The mPGBN unsupervisedly extracts nonnegative latent representation using an upward-downward Gibbs sampler. It imposes sparse connections between layers, making it simple to visualize process and relationships features modalities. Our experimental results on bi-modal consisting images tags show can easily impute...

10.1609/aaai.v32i1.11846 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

The rapid development of mobile Internet has brought new opportunities to college education and teaching. Taking university English teaching as the research object, this paper analyses characteristics in classroom, WeChat platform, learning app other forms teaching, changes influence on its application According actual situation, a comprehensive evaluation system based process results was established. show that model innovative examination can effectively improve efficiency students'...

10.4018/ijicte.343320 article EN International Journal of Information and Communication Technology Education 2024-05-07

Large Language Models (LLMs) have demonstrated impressive capability in many nature language tasks. However, the auto-regressive generation process makes LLMs prone to produce errors, hallucinations and inconsistent statements when performing multi-step reasoning. In this paper, we aim alleviate pathology by introducing Q*, a general, versatile agile framework for guiding decoding with deliberative planning. By learning plug-and-play Q-value model as heuristic function, our Q* can...

10.48550/arxiv.2406.14283 preprint EN arXiv (Cornell University) 2024-06-20

10.1109/cvpr52733.2024.02165 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

For text analysis, one often resorts to a lossy representation that either completely ignores word order or embeds each as low-dimensional dense feature vector. In this paper, we propose convolutional Poisson factor analysis (CPFA) directly operates on lossless processes the words in document sequence of high-dimensional one-hot vectors. To boost its performance, further gamma belief network (CPGBN) couples CPFA with via novel probabilistic pooling layer. forms into phrases and captures very...

10.48550/arxiv.1905.05394 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Modulation recognition has always been an important task in the development of cognitive radio. At present, there are two main application methods for signal data, namely, directly using sequence and some conversions such as constellation diagram. In this paper, converted contour stella images adopted data source research. The deep learning method proposed, which is called Image-based CNN with Attention Model (ICAM). ICAM based on Residual Neural Network (ResNet). To evaluate performance...

10.1109/access.2020.3038208 article EN cc-by IEEE Access 2020-01-01

Hierarchical topic models such as the gamma belief network (GBN) have delivered promising results in mining multi-layer document representations and discovering interpretable taxonomies. However, they often assume prior that topics at each layer are independently drawn from Dirichlet distribution, ignoring dependencies between both same across different layers. To relax this assumption, we propose sawtooth factorial embedding guided GBN, a deep generative model of documents captures semantic...

10.48550/arxiv.2107.02757 preprint EN other-oa arXiv (Cornell University) 2021-01-01

We propose a Bayesian generative model for incorporating prior domain knowledge into hierarchical topic modeling. Although embedded models (ETMs) and its variants have gained promising performance in text analysis, they mainly focus on mining word co-occurrence patterns, ignoring potentially easy-to-obtain hierarchies that could help enhance coherence. While several knowledge-based recently been proposed, are either only applicable to shallow or sensitive the quality of provided knowledge....

10.48550/arxiv.2209.14228 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Abstract High-dimensional covariance matrix estimation plays a central role in multivariate statistical analysis. It is well-known that the sample singular when size smaller than dimension of variable, but estimate must be positive-definite. This motivates some modifications to preserve its efficient pairwise covariance. In this paper, we modify correlation using Bagging technique. The proposed estimator flexible for general continuous data. Under mild conditions, show theoretically can...

10.1007/s10994-022-06138-3 article EN cc-by Machine Learning 2022-03-18

For document analysis, existing methods often resort to the representation that either discards word order information or projects each into a low-dimensional dense embedding vector. However, confined by data's sparsity and high-dimensionality, limited effort has been made explore semantic structures underlying formulates as sequence of one-hot vectors, especially in probabilistic modeling literature. To construct generative model for this type representation, we first develop convolutional...

10.1109/tpami.2022.3192319 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01
Coming Soon ...