NFDI4DS | UHH-SEMS - Publication Details

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

OPENALEX - Publications

Mai Xu Yuhang Song Jianyi Wang Minglang Qiao Liangyu Huo and 1 more

Panoramic video provides immersive and interactive experience by enabling humans to control the field of view (FoV) through head movement (HM). Thus, HM plays a key role in modeling human attention on panoramic video. This paper establishes database collecting subjects' sequences. From this database, we find that data are highly consistent across subjects. Furthermore, deep reinforcement learning (DRL) can be applied predict positions, via maximizing reward imitating scanpaths agent's...

10.1109/tpami.2018.2858783 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-07-25

HyperSOR: Context-aware Graph Hypernetwork for Salient Object Ranking

OPENALEX - Publications

Minglang Qiao Mai Xu Lai Jiang Peng Lei Shi-Jie Wen and 2 more

Salient object ranking (SOR) aims to segment salient objects in an image and simultaneously predict their saliency rankings, according the shifted human attention over different objects. The existing SOR approaches mainly focus on object-based attention, e.g., semantic appearance of object. However, we find that scene context plays a vital role SOR, which same varies lot at scenes. In this paper, thus make first attempt towards explicitly learning for SOR. Specifically, establish large-scale...

10.1109/tpami.2024.3368158 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-02-21

Viewport-Dependent Saliency Prediction in 360° Video

OPENALEX - Publications

Minglang Qiao Mai Xu Zulin Wang Ali Borji

Saliency prediction in traditional images and videos has drawn extensive research interests recent years. Few works have been proposed for saliency over 360° videos. They focus on directly predicting fixations the whole panorama. When viewing videos, a person can only observe content her viewport, which means that fraction of scene be seen at any given time. In this paper, we study human attention viewport propose novel visual model, dubbed saliency, to predict Two contributions are...

10.1109/tmm.2020.2987682 article EN IEEE Transactions on Multimedia 2020-04-20

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

OPENALEX - Publications

Ren Yang Radu Timofte Meisong Zheng Qunliang Xing Minglang Qiao and 74 more

This paper reviews the NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video. In this challenge, we proposed LDV 2.0 dataset, which includes dataset (240 videos) 95 additional videos. challenge three tracks. Track 1 aims at enhancing videos compressed by HEVC a fixed QP. 2 3 target both super-resolution quality enhancement video. They require x2 x4 super-resolution, respectively. The tracks totally attract more than 600 registrations. test phase, 8 teams, teams...

10.1109/cvprw56347.2022.00129 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022-06-01

Solar-powered flow reactor

OPENALEX - Publications

Minglang Qiao

10.1038/s44286-025-00202-0 article EN Nature Chemical Engineering 2025-03-24

Joint Learning of Audio–Visual Saliency Prediction and Sound Source Localization on Multi-face Videos

OPENALEX - Publications

Minglang Qiao Yufan Liu Mai Xu Xin Deng Bing Li and 2 more

10.1007/s11263-023-01950-3 article EN International Journal of Computer Vision 2023-12-29

Saliency Prediction on Mobile Videos: A Fixation Mapping-Based Dataset and A Transformer Approach

OPENALEX - Publications

Shi-Jie Wen Li Yang Mai Xu Minglang Qiao Tao Xu and 1 more

With the booming development of smart devices, mobile videos have drawn broad interest when humans surf social media. Different from traditional long-form videos, are featured with uncertain human attention behavior so far owing to specific displaying mode, thus promoting research on saliency prediction for videos. Unfortunately, current eye-tracking experiments not applicable since stationary eye-tracker and eye fixation acquisition dedicated presented computers. To tackle this issue, we...

10.1109/tcsvt.2023.3342903 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-12-14

Hand gesture recognition method by radar based on convolutional neural network

OPENALEX - Publications

Wang Jun Tong Zheng Peng Lei Yuan Zhang Minglang Qiao

10.13700/j.bh.1001-5965.2017.0397 article EN Beijing Hangkong Hangtian Daxue xuebao 2018-06-20

Progressive Training of A Two-Stage Framework for Video Restoration

OPENALEX - Publications

Meisong Zheng Qunliang Xing Minglang Qiao Mai Xu Lai Jiang and 2 more

As a widely studied task, video restoration aims to enhance the quality of videos with multiple potential degradations, such as noises, blurs and compression artifacts. Among restorations, compressed enhancement super-resolution are two main tacks significant values in practical scenarios. Recently, recurrent neural networks transformers attract increasing research interests this field, due their impressive capability sequence-to-sequence modeling. However, training these models is not only...

10.1109/cvprw56347.2022.00115 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022-06-01

Residual based hierarchical feature compression for multi-task machine vision

OPENALEX - Publications

Chaoran Chen Mai Xu Shengxi Li Shipeng Li Minglang Qiao and 1 more

With the remarkable success of deep learning, image/video coding for machines (VCM) has been playing an important role in facilitating intelligent vision tasks. However, existing VCM methods suffer from either sub-optimality using image compression standards, or generalisation issues learning-based methods. To address these issues, this paper proposes a residual-based hierarchical feature (RHFC) method to achieve optimal and universal object detection segmentation. More specifically, we...

10.1109/icme55011.2023.00253 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2023-07-01

Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach

OPENALEX - Publications

Minglang Qiao Mai Xu Shi-Jie Wen Lai Jiang Shengxi Li and 3 more

Predicting video saliency is crucial for improving sports processing efficiency, thereby providing an enriched viewing experience a wide-ranging audience. However, there long-term absence of well-established eye-tracking database and learning-based approach, particularly tailored videos. In this paper, we establish large-scale dubbed audio-visual (AVS). AVS consists 1,000 high-quality videos with eye fixations from 60 participants. Through the data analysis on AVS, observe that human...

10.1109/icassp48485.2024.10446481 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

MT-VQA: A Multi-task Approach for Quality Assessment of Short-form Videos

OPENALEX - Publications

Shi-Jie Wen Minglang Qiao Lai Jiang Mai Xu Xin Deng and 1 more

10.1145/3689093.3689181 article EN 2024-10-20

Saliency Prediction of Traffic Surveillance Videos: A Benchmark and A Multi-Task Approach

OPENALEX - Publications

Weiwei Duan Minglang Qiao Lai Jiang Mai Xu Xin Deng and 2 more

10.1109/wcsp62071.2024.10826669 article EN 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP) 2024-10-24

A Novel Patch Selection Approach for Quality Assessment on Short-Form Videos

OPENALEX - Publications

Shi-Jie Wen Lai Jiang Minglang Qiao Mai Xu Xin Deng and 2 more

10.1109/wcsp62071.2024.10826890 article EN 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP) 2024-10-24

Prediction of wood pulp K with radial basis function neural network

OPENALEX - Publications

M.T. Musavi D.R. Coughlin Minglang Qiao

This paper presents a radial basis function (RBF) network for prediction of continuous wood pulp delignification factor. In making process, the quality is measured by K# which related to lignin content remaining in pulp. Availability an accurate during any time digester operation very critical control and saving million dollars reducing energy raw material consumption. To assure operation, currently human experts who analyze samples plant's laboratory then decide how process variables....

10.1109/iscas.1995.523743 article EN 2002-11-19

Radial basis function neural network as predictive process control model

OPENALEX - Publications

M.J. Sinclair M.T. Musavi Minglang Qiao

This is an experimental study to compare the performance of widespread backpropagation network (BP) a radial basis function (RBF) and generalized regression neural (GRNN) for potential use as on-line process models. Criteria comparison include generalization ability unseen data, robustness shifts, with sparse training computational demands.

10.1109/iscas.1995.523801 article EN 2002-11-19

Classification of mouse chromosomes using artificial neural networks

OPENALEX - Publications

M.T. Musavi Minglang Qiao Muriel T. Davisson Ellen C. Akeson

This paper presents the results of our experiments for classification mouse chromosomes using a radial basis function (RBF) and probabilistic neural network (PNN). The fast orthogonal search (FOS) was utilized training RBF network. There were 840 540 testing chromosomes. best error rate recorded at 16.4% result is better than available 18.3% which achieved with much more

10.1109/icnn.1996.549008 article EN Proceedings of International Conference on Neural Networks (ICNN'96) 2002-12-23

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos

OPENALEX - Publications

Minglang Qiao Yufan Liu Mai Xu Xin Deng Bing Li and 2 more

Visual and audio events simultaneously occur both attract attention. However, most existing saliency prediction works ignore the influence of only consider vision modality. In this paper, we propose a multitask learning method for visual-audio sound source localization on multi-face video by leveraging visual, face information. Specifically, first introduce large-scale database in condition (MVVA), containing eye-tracking data annotations. Using database, find that influences human...

10.48550/arxiv.2111.08567 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Progressive Training of A Two-Stage Framework for Video Restoration

OPENALEX - Publications

Meisong Zheng Qunliang Xing Minglang Qiao Mai Xu Lai Jiang and 2 more

As a widely studied task, video restoration aims to enhance the quality of videos with multiple potential degradations, such as noises, blurs and compression artifacts. Among restorations, compressed enhancement super-resolution are two main tacks significant values in practical scenarios. Recently, recurrent neural networks transformers attract increasing research interests this field, due their impressive capability sequence-to-sequence modeling. However, training these models is not only...

10.48550/arxiv.2204.09924 preprint EN cc-by-nc-nd arXiv (Cornell University) 2022-01-01

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

OPENALEX - Publications

Ren Yang Radu Timofte Meisong Zheng Qunliang Xing Minglang Qiao and 74 more

This paper reviews the NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video. In this challenge, we proposed LDV 2.0 dataset, which includes dataset (240 videos) 95 additional videos. challenge three tracks. Track 1 aims at enhancing videos compressed by HEVC a fixed QP. 2 3 target both super-resolution quality enhancement video. They require x2 x4 super-resolution, respectively. The tracks totally attract more than 600 registrations. test phase, 8 teams, teams...

10.48550/arxiv.2204.09314 preprint EN cc-by-nc-nd arXiv (Cornell University) 2022-01-01