NFDI4DS | UHH-SEMS - Publication Details

Variational Autoencoder for Semi-Supervised Text Classification

OPENALEX - Publications

Weidi Xu Haoze Sun Chao Deng Ying Tan

Although semi-supervised variational autoencoder (SemiVAE) works in image classification task, it fails text task if using vanilla LSTM as its decoder. From a perspective of reinforcement learning, is verified that the decoder's capability to distinguish between different categorical labels essential. Therefore, Semi-supervised Sequential Variational Autoencoder (SSVAE) proposed, which increases by feeding label into decoder RNN at each time-step. Two specific structures are investigated and...

10.1609/aaai.v31i1.10966 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

Baichuan 2: Open Large-scale Language Models

OPENALEX - Publications

A. Y. Yang Bin Xiao Bingning Wang Borong Zhang Ce Bian and 50 more

Large language models (LLMs) have demonstrated remarkable performance on a variety of natural tasks based just few examples instructions, reducing the need for extensive feature engineering. However, most powerful LLMs are closed-source or limited in their capability languages other than English. In this technical report, we present Baichuan 2, series large-scale multilingual containing 7 billion and 13 parameters, trained from scratch, 2.6 trillion tokens. 2 matches outperforms open-source...

10.48550/arxiv.2309.10305 preprint EN other-oa arXiv (Cornell University) 2023-01-01

A threat assessment method of group targets based on interval-valued intuitionistic fuzzy multi-attribute group decision-making

OPENALEX - Publications

Depeng Kong Tianqing Chang Quandong Wang Haoze Sun Wenjun Dai

10.1016/j.asoc.2018.03.015 article EN Applied Soft Computing 2018-03-18

An improved artificial bee colony algorithm based on elite group guidance and combined breadth-depth search strategy

OPENALEX - Publications

Depeng Kong Tianqing Chang Wenjun Dai Quandong Wang Haoze Sun

10.1016/j.ins.2018.02.025 article EN Information Sciences 2018-02-17

TDFusion: When Tensor Decomposition Meets Medical Image Fusion in the Nonsubsampled Shearlet Transform Domain

OPENALEX - Publications

Rui Zhang Zhongyang Wang Haoze Sun Lizhen Deng Hu Zhu

In this paper, a unified optimization model for medical image fusion based on tensor decomposition and the non-subsampled shearlet transform (NSST) is proposed. The NSST method to fuse high-frequency (HF) low-frequency (LF) parts of two source images obtain mixed-frequency fused image. general, we integrate information from perspective (TD) fusion. Due structural differences between representations, potential loss may occur in images. To address issue, introduce joint static dynamic guidance...

10.3390/s23146616 article EN cc-by Sensors 2023-07-23

Optimal multivariate quota-share reinsurance: A nonparametric mean-CVaR framework

OPENALEX - Publications

Haoze Sun Chengguo Weng Yi Zhang

10.1016/j.insmatheco.2016.11.006 article EN Insurance Mathematics and Economics 2016-11-30

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

OPENALEX - Publications

Miao Zheng Liang Hao Fan Yang Haoze Sun Tianpeng Li and 14 more

In recent years, the rise of Large Language Models (LLMs) has spurred a growing demand for plug-and-play AI systems. Among various techniques, prompt engineering stands out as particularly significant. However, users often face challenges in writing prompts due to steep learning curve and significant time investment, existing automatic (APE) models can be difficult use. To address this issue, we propose PAS, an LLM-based APE system. PAS utilizes LLMs trained on high-quality, automatically...

10.48550/arxiv.2407.06027 preprint EN arXiv (Cornell University) 2024-07-08

Medical image fusion via decoupled representation and component-wise regularization learning

OPENALEX - Publications

Rui Zhang Haoze Sun Lizhen Deng Hu Zhu Wei Qian

10.1016/j.bspc.2024.106859 article EN Biomedical Signal Processing and Control 2024-11-01

Sogou Machine Reading Comprehension Toolkit

OPENALEX - Publications

Jindou Wu Yunlun Yang Chao Deng Hongyi Tang Bingning Wang and 3 more

Machine reading comprehension have been intensively studied in recent years, and neural network-based models shown dominant performances. In this paper, we present a Sogou Reading Comprehension (SMRC) toolkit that can be used to provide the fast efficient development of modern machine models, including both published original prototypes. To achieve goal, provides dataset readers, flexible preprocessing pipeline, necessary network components, built-in which make whole process data...

10.48550/arxiv.1903.11848 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Variational Autoencoders for Semi-supervised Text Classification

OPENALEX - Publications

Weidi Xu Haoze Sun Chao Deng Ying Tan

Although semi-supervised variational autoencoder (SemiVAE) works in image classification task, it fails text task if using vanilla LSTM as its decoder. From a perspective of reinforcement learning, is verified that the decoder's capability to distinguish between different categorical labels essential. Therefore, Semi-supervised Sequential Variational Autoencoder (SSVAE) proposed, which increases by feeding label into decoder RNN at each time-step. Two specific structures are investigated and...

10.48550/arxiv.1603.02514 preprint EN public-domain arXiv (Cornell University) 2016-01-01

Multi-digit image synthesis using recurrent conditional variational autoencoder

OPENALEX - Publications

Haoze Sun Weidi Xu Chao Deng Ying Tan

In the field of deep neural networks, several generative methods have been proposed to address challenges from and discriminative tasks, e.g., natural language process, image caption generation. this paper, a conditional recurrent variational autoencoder is for multi-digit synthesis. This model capable generating images given number sequences retaining generalisation ability recover different types background. Our method evaluated on SVHN dataset experimental results show it succeeds...

10.1109/ijcnn.2016.7727223 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2016-07-01

Armored Target Detection in Battlefield Environment Based on Top-Down Aggregation Network and Hierarchical Scale Optimization

OPENALEX - Publications

Haoze Sun Tianqing Chang Lei Zhang Yang Guo-zhen Bin Han and 1 more

Armored equipment plays a crucial role in the ground battlefield. The fast and accurate detection of enemy armored targets is significant to take initiative Comparing general object vehicle detection, target battlefield environment more challenging due long distance observation complicated environment. In this paper, an robust automatic method proposed detect Firstly, inspired by Feature Pyramid Network (FPN), we propose top-down aggregation (TDA) network which enhances shallow feature maps...

10.1142/s0218001419500071 article EN International Journal of Pattern Recognition and Artificial Intelligence 2018-09-11

TextDream: Conditional Text Generation by Searching in the Semantic Space

OPENALEX - Publications

Weidi Xu Haoze Sun Chao Deng Ying Tan

Conditional text generation is a fundamental task in natural language generation. Traditional conditional generative models build probability distributions over the given labels. However, categorical label information usually very abstract, e.g., sentiment, and it difficult to be disentangled from content. Therefore, instead of generating by modeling distribution, we propose novel method TextDream through searching semantic space. Specifically, this method, random seed initially new...

10.1109/cec.2018.8477776 article EN 2022 IEEE Congress on Evolutionary Computation (CEC) 2018-07-01

Empirical likelihood based confidence intervals for the tail index when

OPENALEX - Publications

Haoze Sun Yuexiang Jiang

10.1016/j.spl.2013.10.001 article EN Statistics & Probability Letters 2013-10-11

Excessive-emission vehicles real-time track matching algorithm based on road network topology and weights

OPENALEX - Publications

Haoze Sun Peng Jiang Qingshan She Cheng Yu Hongze Lin and 2 more

The algorithm for excessive-emission vehicles track matching based on network topology and weights (NTWMA) is proposed in this paper to resolve the trajectory problems of vehicles. topological structure initially constructed factor called breadth-first traversal. Then, by using road constraints construct a set adjacent candidate sections matching, sum distance, direction, relative weight each considered as solving condition. Finally, optimal section sequence calculated Dijkstra algorithm,...

10.23919/chicc.2019.8865886 article EN 2019-07-01

CoSeR: Bridging Image and Language for Cognitive Super-Resolution

OPENALEX - Publications

Haoze Sun Wenbo Li Jianzhuang Liu Haoyu Chen Renjing Pei and 3 more

Existing super-resolution (SR) models primarily focus on restoring local texture details, often neglecting the global semantic information within scene. This oversight can lead to omission of crucial details or introduction inaccurate textures during recovery process. In our work, we introduce Cognitive Super-Resolution (CoSeR) framework, empowering SR with capacity comprehend low-resolution images. We achieve this by marrying image appearance and language understanding generate a cognitive...

10.48550/arxiv.2311.16512 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

OPENALEX - Publications

Haoyu Chen Wenbo Li Jinjin Gu Jingjing Ren Haoze Sun and 4 more

For image super-resolution (SR), bridging the gap between performance on synthetic datasets and real-world degradation scenarios remains a challenge. This work introduces novel "Low-Res Leads Way" (LWay) training framework, merging Supervised Pre-training with Self-supervised Learning to enhance adaptability of SR models images. Our approach utilizes low-resolution (LR) reconstruction network extract embeddings from LR images, them super-resolved outputs for reconstruction. Leveraging unseen...

10.48550/arxiv.2403.02601 preprint EN arXiv (Cornell University) 2024-03-04

Baichuan-Omni Technical Report

OPENALEX - Publications

Yadong Li Haoze Sun Mingan Lin Tianpeng Li Guosheng Dong and 22 more

The salient multimodal capabilities and interactive experience of GPT-4o highlight its critical role in practical applications, yet it lacks a high-performing open-source counterpart. In this paper, we introduce Baichuan-Omni, the first 7B Multimodal Large Language Model (MLLM) adept at concurrently processing analyzing modalities image, video, audio, text, while delivering an advanced strong performance. We propose effective training schema starting with model proceeding through two stages...

10.48550/arxiv.2410.08565 preprint EN arXiv (Cornell University) 2024-10-11

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

OPENALEX - Publications

Mingyang Chen Haoze Sun Tianpeng Li Fan Yang Hao Liang and 5 more

Large Language Models (LLMs) have exhibited significant potential in performing diverse tasks, including the ability to call functions or use external tools enhance their performance. While current research on function calling by LLMs primarily focuses single-turn interactions, this paper addresses overlooked necessity for engage multi-turn calling--critical handling compositional, real-world queries that require planning with but not only functions. To facilitate this, we introduce an...

10.48550/arxiv.2410.12952 preprint EN arXiv (Cornell University) 2024-10-16

Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration

OPENALEX - Publications

Haoze Sun Wenbo Li Jiayue Liu Kaiwen Zhou Yongqiang Chen and 5 more

Generalization has long been a central challenge in real-world image restoration. While recent diffusion-based restoration methods, which leverage generative priors from text-to-image models, have made progress recovering more realistic details, they still encounter "generative capability deactivation" when applied to out-of-distribution data. To address this, we propose using text as an auxiliary invariant representation reactivate the capabilities of these models. We begin by identifying...

10.48550/arxiv.2412.00878 preprint EN arXiv (Cornell University) 2024-12-01

Non-parametric tests for the tail equivalence via empirical likelihood

OPENALEX - Publications

Yuexiang Jiang Haoze Sun Yi Zhang Huaigang Long

In this paper, the problem of whether left tail and right a distribution share same extreme value index (EVI) is addressed we propose two different test statistics. The first one based on result joint asymptotic normality Hill estimators for EVIs both tails. And therefore, can construct quotient-type statistic, which χ2(1) distributed after some standardization. second statistic proposed in paper inspired by two-sample empirical likelihood methodology, prove its non parametric version Wilk’s...

10.1080/03610926.2016.1242736 article EN Communication in Statistics- Theory and Methods 2016-10-20

Fast Armored Target Detection Based on Lightweight Network

OPENALEX - Publications

Haoze Sun Tianqing Chang Lei Zhang Yang Guo-zhen Bin Han and 1 more

10.3724/sp.j.1089.2019.17467 article EN Journal of Computer-Aided Design & Computer Graphics 2019-01-01

Research on Trajectory Prediction Method of Mobile Pollution Source Based on Hybrid Genetic Particle Swarm and optimized Extreme Learning Machine

OPENALEX - Publications

Fan Zhang Haoze Sun Peng Jiang Qingshan She Huan Xu and 1 more

In order to accurately predict the trajectory of mobile pollution sources such as motor vehicles in real time, a prediction method based on hybrid genetic particle swarm optimization and optimized extreme learning machine (HGPSO-OELM) is proposed this paper. Extreme Learning Machine (OELM) avoids disadvantage traditional (ELM) which has poor generalization performance for small data sets. However, due random assignment input weights hidden layer node biases parameter groups, accuracy...

10.1109/cac48633.2019.8996782 article EN 2019-11-01