NFDI4DS | UHH-SEMS - Publication Details

Chongxuan Li

ORCID: 0000-0002-0912-9076

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5072905534

Research Areas

Generative Adversarial Networks and Image Synthesis
Domain Adaptation and Few-Shot Learning
Model Reduction and Neural Networks
Music and Audio Processing
Machine Learning in Healthcare
Multimodal Machine Learning Applications
Topic Modeling
Machine Learning and Data Classification
Computer Graphics and Visualization Techniques
Image Retrieval and Classification Techniques
Adversarial Robustness in Machine Learning
Advanced Image and Video Retrieval Techniques
Face recognition and analysis
Gaussian Processes and Bayesian Inference
Neural Networks and Applications
Video Analysis and Summarization
Bayesian Methods and Mixture Models
Text and Document Classification Technologies
Anomaly Detection Techniques and Applications
Chaos-based Image/Signal Encryption
Human Pose and Action Recognition
Ferroelectric and Negative Capacitance Devices
Advanced Image Processing Techniques
COVID-19 diagnosis using AI
Cell Image Analysis Techniques

Renmin University of China
2021-2025

Beijing Institute of Big Data Research
2022-2025

Tsinghua University
2015-2023

Robert Bosch (Taiwan)
2020

Triple Generative Adversarial Nets

OPENALEX - Publications

Chongxuan Li Kun Xu Jun Zhu Bo Zhang

Generative Adversarial Nets (GANs) have shown promise in image generation and semi-supervised learning (SSL). However, existing GANs SSL two problems: (1) the generator discriminator (i.e. classifier) may not be optimal at same time; (2) cannot control semantics of generated samples. The problems essentially arise from two-player formulation, where a single shares incompatible roles identifying fake samples predicting labels it only estimates data without considering labels. To address...

10.48550/arxiv.1703.02291 preprint EN other-oa arXiv (Cornell University) 2017-01-01

DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps

OPENALEX - Publications

Cheng Lü Yuhao Zhou Fan Bao J.F. Chen Chongxuan Li and 1 more

Diffusion probabilistic models (DPMs) are emerging powerful generative models. Despite their high-quality generation performance, DPMs still suffer from slow sampling as they generally need hundreds or thousands of sequential function evaluations (steps) large neural networks to draw a sample. Sampling can be viewed alternatively solving the corresponding diffusion ordinary differential equations (ODEs). In this work, we propose an exact formulation solution ODEs. The analytically computes...

10.48550/arxiv.2206.00927 preprint EN other-oa arXiv (Cornell University) 2022-01-01

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

OPENALEX - Publications

Zhengyi Wang Cheng Lu Yikai Wang Fan Bao Chongxuan Li and 2 more

Score distillation sampling (SDS) has shown great promise in text-to-3D generation by distilling pretrained large-scale text-to-image diffusion models, but suffers from over-saturation, over-smoothing, and low-diversity problems. In this work, we propose to model the 3D parameter as a random variable instead of constant SDS present variational score (VSD), principled particle-based framework explain address aforementioned issues generation. We show that is special case VSD leads poor samples...

10.48550/arxiv.2305.16213 preprint EN other-oa arXiv (Cornell University) 2023-01-01

DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models

OPENALEX - Publications

Cheng Lü Yuhao Zhou Fan Bao Jianfei Chen Chongxuan Li and 1 more

Diffusion probabilistic models (DPMs) have achieved impressive success in high-resolution image synthesis, especially recent large-scale text-to-image generation applications. An essential technique for improving the sample quality of DPMs is guided sampling, which usually needs a large guidance scale to obtain best quality. The commonly-used fast sampler sampling DDIM, first-order diffusion ODE solver that generally 100 250 steps high-quality samples. Although works propose dedicated...

10.48550/arxiv.2211.01095 preprint EN other-oa arXiv (Cornell University) 2022-01-01

All are Worth Words: A ViT Backbone for Diffusion Models

OPENALEX - Publications

Fan Bao Shen Nie Kaiwen Xue Yue Cao Chongxuan Li and 2 more

Vision transformers (ViT) have shown promise in various vision tasks while the U-Net based on a convolutional neural network (CNN) remains dominant diffusion models. We design simple and general ViT-based architecture (named U-ViT) for image generation with U-ViT is characterized by treating all inputs including time, condition noisy patches as tokens employing long skip connections between shallow deep layers. evaluate unconditional classconditional generation, well text-to-image tasks,...

10.1109/cvpr52729.2023.02171 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models

OPENALEX - Publications

Fan Bao Chongxuan Li Jun Zhu Bo Zhang

Diffusion probabilistic models (DPMs) represent a class of powerful generative models. Despite their success, the inference DPMs is expensive since it generally needs to iterate over thousands timesteps. A key problem in estimate variance each timestep reverse process. In this work, we present surprising result that both optimal and corresponding KL divergence DPM have analytic forms w.r.t. its score function. Building upon it, propose Analytic-DPM, training-free framework estimates using...

10.48550/arxiv.2201.06503 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Learning to Write Stylized Chinese Characters by Reading a Handful of Examples

OPENALEX - Publications

Danyang Sun Tongzheng Ren Chongxuan Li Hang Su Jun Zhu

Automatically writing stylized characters is an attractive yet challenging task, especially for Chinese with complex shapes and structures. Most current methods are restricted to generate already present in the training set, but required retrain model when generating of new styles. In this paper, we develop a novel framework Style-Aware Variational Auto-Encoder (SA-VAE), which disentangles content-relevant style-relevant components character feature intercross pair-wise optimization method....

10.24963/ijcai.2018/128 article EN 2018-07-01

ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-supervised Continual Learning

OPENALEX - Publications

Liyuan Wang Kuo Yang Chongxuan Li Lanqing Hong Zhenguo Li and 1 more

Continual learning usually assumes the incoming data are fully labeled, which might not be applicable in real applications. In this work, we consider semi-supervised continual (SSCL) that incrementally learns from partially labeled data. Observing existing methods lack ability to continually exploit unlabeled data, propose deep Online Replay with Discriminator Consistency (ORDisCo) interdependently learn a classifier conditional generative adversarial network (GAN), passes learned...

10.1109/cvpr46437.2021.00534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning

OPENALEX - Publications

Yudeng Lin Qingtian Zhang Bin Gao Jianshi Tang Peng Yao and 9 more

10.1038/s42256-023-00680-y article EN Nature Machine Intelligence 2023-06-22

On Evaluating Adversarial Robustness of Large Vision-Language Models

OPENALEX - Publications

Yunqing Zhao Tianyu Pang Chao‐Hai Du Xiao Yang Chongxuan Li and 2 more

Large vision-language models (VLMs) such as GPT-4 have achieved unprecedented performance in response generation, especially with visual inputs, enabling more creative and adaptable interaction than large language ChatGPT. Nonetheless, multimodal generation exacerbates safety concerns, since adversaries may successfully evade the entire system by subtly manipulating most vulnerable modality (e.g., vision). To this end, we propose evaluating robustness of open-source VLMs realistic high-risk...

10.48550/arxiv.2305.16934 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Triple Generative Adversarial Networks

OPENALEX - Publications

Chongxuan Li Kun Xu Jun Zhu Jiashuo Liu Bo Zhang

We propose a unified game-theoretical framework to perform classification and conditional image generation given limited supervision. It is formulated as three-player minimax game consisting of generator, classifier discriminator, therefore referred Triple Generative Adversarial Network (Triple-GAN). The generator the characterize distributions between images labels classification, respectively. discriminator solely focuses on identifying fake image-label pairs. Theoretically, formulation...

10.1109/tpami.2021.3127558 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-11-13

Memory Replay with Data Compression for Continual Learning

OPENALEX - Publications

Liyuan Wang Xingxing Zhang Kuo Yang Longhui Yu Chongxuan Li and 5 more

Continual learning needs to overcome catastrophic forgetting of the past. Memory replay representative old training samples has been shown as an effective solution, and achieves state-of-the-art (SOTA) performance. However, existing work is mainly built on a small memory buffer containing few original data, which cannot fully characterize data distribution. In this work, we propose with compression (MRDC) reduce storage cost thus increase their amount that can be stored in buffer. Observing...

10.48550/arxiv.2202.06592 preprint EN other-oa arXiv (Cornell University) 2022-01-01

ControlVideo: conditional control for one-shot text-driven video editing and beyond

OPENALEX - Publications

Min Zhao Rongzhen Wang Fan Bao Chongxuan Li Junxing Zhu

10.1007/s11432-023-4184-4 article EN Science China Information Sciences 2025-02-08

Large Language Diffusion Models

OPENALEX - Publications

Shen Nie Fengqi Zhu Zebin You Xiaolu Zhang Jingyang Ou and 5 more

Autoregressive models (ARMs) are widely regarded as the cornerstone of large language (LLMs). We challenge this notion by introducing LLaDA, a diffusion model trained from scratch under pre-training and supervised fine-tuning (SFT) paradigm. LLaDA distributions through forward data masking process reverse process, parameterized vanilla Transformer to predict masked tokens. By optimizing likelihood bound, it provides principled generative approach for probabilistic inference. Across extensive...

10.48550/arxiv.2502.09992 preprint EN arXiv (Cornell University) 2025-02-14

Max-Margin Deep Generative Models for (Semi-)Supervised Learning

OPENALEX - Publications

Chongxuan Li Jun Zhu Bo Zhang

Deep generative models (DGMs) can effectively capture the underlying distributions of complex data by learning multilayered representations and performing inference. However, it is relatively insufficient to boost discriminative ability DGMs. This paper presents max-margin deep (mmDGMs) a class-conditional variant (mmDCGMs), which explore strongly principle improve predictive performance DGMs in both supervised semi-supervised learning, while retaining capability. In we use predictions...

10.1109/tpami.2017.2766142 article EN publisher-specific-oa IEEE Transactions on Pattern Analysis and Machine Intelligence 2017-10-24

Deep reinforcement learning with credit assignment for combinatorial optimization

OPENALEX - Publications

Dong Yan Jiayi Weng Shiyu Huang Chongxuan Li Yichi Zhou and 2 more

10.1016/j.patcog.2021.108466 article EN Pattern Recognition 2021-11-27

Collaborative Filtering With User-Item Co-Autoregressive Models

OPENALEX - Publications

Chao Du Chongxuan Li Yin Zheng Jun Zhu Bo Zhang

Deep neural networks have shown promise in collaborative filtering (CF). However, existing approaches are either user-based or item-based, which cannot leverage all the underlying information explicitly. We propose CF-UIcA, a co-autoregressive model for CF tasks, exploits structural correlation domains of both users and items. The co-autoregression allows extra desired properties to be incorporated different tasks. Furthermore, we develop an efficient stochastic learning algorithm handle...

10.1609/aaai.v32i1.11884 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Learning to Generate with Memory

OPENALEX - Publications

Chongxuan Li Jun Zhu Bo Zhang

Memory units have been widely used to enrich the capabilities of deep networks on capturing long-term dependencies in reasoning and prediction tasks, but little investigation exists generative models (DGMs) which are good at inferring high-level invariant representations from unlabeled data. This paper presents a model with possibly large external memory an attention mechanism capture local detail information that is often lost bottom-up abstraction process representation learning. By...

10.48550/arxiv.1602.07416 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Bayesian Neural Network Realization by Exploiting Inherent Stochastic Characteristics of Analog RRAM

OPENALEX - Publications

Yudeng Lin Xiaobo Sharon Hu He Qian Huaqiang Wu Qingtian Zhang and 7 more

For the first time, this paper develops a novel stochastic computing method by utilizing inherent random noises of analog RRAM. With designed switching characteristics, RRAM device can realize function sampling from tunable probabilistic distribution. A Bayesian neural network (BayNN), whose weights are represented probability distributions, is experimentally demonstrated on fabricated 160K array. The measured result achieves 97% accuracy for image classification MNIST dataset. Moreover,...

10.1109/iedm19573.2019.8993616 article EN 2021 IEEE International Electron Devices Meeting (IEDM) 2019-12-01

Max-margin Deep Generative Models

OPENALEX - Publications

Chongxuan Li Jun Zhu Tianlin Shi Bo Zhang

Deep generative models (DGMs) are effective on learning multilayered representations of complex data and performing inference input by exploring the ability. However, little work has been done examining or empowering discriminative ability DGMs making accurate predictions. This paper presents max-margin deep (mmDGMs), which explore strongly principle to improve power DGMs, while retaining capability. We develop an efficient doubly stochastic subgradient algorithm for piecewise linear...

10.48550/arxiv.1504.06787 preprint EN other-oa arXiv (Cornell University) 2015-01-01

MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering

OPENALEX - Publications

Tsung Wei Tsai Chongxuan Li Jun Zhu

We present Mixture of Contrastive Experts (MiCE), a unified probabilistic clustering framework that simultaneously exploits the discriminative representations learned by contrastive learning and semantic structures captured latent mixture model. Motivated experts, MiCE employs gating function to partition an unlabeled dataset into subsets according semantics multiple experts discriminate distinct instances assigned them in manner. To solve nontrivial inference problems caused variables, we...

10.48550/arxiv.2105.01899 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Coming Soon ...