NFDI4DS | UHH-SEMS - Publication Details

Yilin Wang

ORCID: 0000-0003-4031-8753

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100449864

Research Areas

Image and Video Quality Assessment
Advanced Image Processing Techniques
Bauxite Residue and Utilization
Image Enhancement Techniques
Extraction and Separation Processes
Advanced Image and Video Retrieval Techniques
Visual Attention and Saliency Detection
Advanced Neural Network Applications
Advanced Vision and Imaging
Industrial Vision Systems and Defect Detection
Generative Adversarial Networks and Image Synthesis
Video Analysis and Summarization
Recycling and utilization of industrial and municipal waste in materials production
Welding Techniques and Residual Stresses
Domain Adaptation and Few-Shot Learning
Aluminum Alloys Composites Properties
Multimodal Machine Learning Applications
Video Surveillance and Tracking Methods
Image Retrieval and Classification Techniques
Anomaly Detection Techniques and Applications
Additive Manufacturing Materials and Processes
Advanced Image Fusion Techniques
Image and Signal Denoising Methods
Advanced Welding Techniques Analysis
Computational Drug Discovery Methods

Google (United States)
2016-2025

Zhengzhou University
2025

Shandong Academy of Sciences
2024-2025

Qilu University of Technology
2023-2025

Chongqing University of Arts and Sciences
2025

Hong Kong Polytechnic University
2022-2024

University of Science and Technology Beijing
2021-2024

Adobe Systems (United States)
2020-2024

Tsinghua University
2017-2024

Jimei University
2022-2024

MUSIQ: Multi-scale Image Quality Transformer

OPENALEX - Publications

Junjie Ke Qifei Wang Yilin Wang Peyman Milanfar Feng Yang

Image quality assessment (IQA) is an important research topic for understanding and improving visual experience. The current state-of-the-art IQA methods are based on convolutional neural networks (CNNs). performance of CNN-based models often compromised by the fixed shape constraint in batch training. To accommodate this, input images usually resized cropped to a shape, causing image degradation. address we design multi-scale Transformer (MUSIQ) process native resolution with varying sizes...

10.1109/iccv48922.2021.00510 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Image Quality Assessment Using Contrastive Learning

OPENALEX - Publications

Pavan C. Madhusudana Neil Birkbeck Yilin Wang Balu Adsumilli Alan C. Bovik

We consider the problem of obtaining image quality representations in a self-supervised manner. use prediction distortion type and degree as an auxiliary task to learn features from unlabeled dataset containing mixture synthetic realistic distortions. then train deep Convolutional Neural Network (CNN) using contrastive pairwise objective solve problem. refer proposed training framework resulting IQA model CONTRastive Image QUality Evaluator (CONTRIQUE). During evaluation, CNN weights are...

10.1109/tip.2022.3181496 article EN publisher-specific-oa IEEE Transactions on Image Processing 2022-01-01

RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content

OPENALEX - Publications

Zhengzhong Tu Xiangxu Yu Yilin Wang Neil Birkbeck Balu Adsumilli and 1 more

Blind or no-reference video quality assessment of user-generated content (UGC) has become a trending, challenging, heretofore unsolved problem. Accurate and efficient predictors suitable for this are thus in great demand to achieve more intelligent analysis processing UGC videos. Previous studies have shown that natural scene statistics deep learning features both sufficient capture spatial distortions, which contribute significant aspect issues. However, these models either incapable...

10.1109/ojsp.2021.3090333 article EN cc-by IEEE Open Journal of Signal Processing 2021-01-01

Multimodal Contrastive Training for Visual Representation Learning

OPENALEX - Publications

Xin Yuan Zhe Lin Jason Kuen Jianming Zhang Yilin Wang and 3 more

We develop an approach to learning visual representations that embraces multimodal data, driven by a combination of intra- and inter-modal similarity preservation objectives. Unlike existing pre-training methods, which solve proxy prediction task in single domain, our method exploits intrinsic data properties within each modality semantic information from cross-modal correlation simultaneously, hence improving the quality learned representations. By including training unified framework with...

10.1109/cvpr46437.2021.00692 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Lite Vision Transformer with Enhanced Self-Attention

OPENALEX - Publications

Chenglin Yang Yilin Wang Jianming Zhang He Zhang Zijun Wei and 2 more

Despite the impressive representation capacity of vision transformer models, current light-weight models still suffer from inconsistent and incorrect dense predictions at local regions. We suspect that power their self-attention mechanism is limited in shallower thinner networks. propose Lite Vision Transformer (LVT), a novel network with two enhanced mechanisms to improve model performances for mobile deployment. For low-level features, we introduce Convolutional Self-Attention (CSA)....

10.1109/cvpr52688.2022.01169 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Novel ways for hydrogen production based on methane steam and dry reforming integrated with carbon capture

OPENALEX - Publications

Bosheng Su Yilin Wang Zhilong Xu Wei Han Hongguang Jin and 1 more

The combination of methane steam reforming technology and CCS (Carbon Capture Storage) has great potential to reduce carbon emissions in the process hydrogen production. Different from traditional idea capturing CO2 Dioxide) exhaust gas with high work consumption, this study simultaneously focuses on separation fuel recycling. A new production system is developed by coupled capture. Separated captured high-purity dioxide could be recycled for dry reforming; basis, a...

10.1016/j.enconman.2022.116199 article EN cc-by-nc-nd Energy Conversion and Management 2022-09-18

Influence of welding sequences on residual stress and deformation of U-rib components fabricated by laser-arc hybrid welding

OPENALEX - Publications

Shaoning Geng Yuantai Li Ping Jiang Yilin Wang Jun Jin and 1 more

10.1016/j.jmrt.2025.01.032 article EN cc-by-nc-nd Journal of Materials Research and Technology 2025-01-07

Real-time human gesture grading based on OpenPose

OPENALEX - Publications

Sen Qiao Yilin Wang Jian Li

In this paper, we presented a real-time 2D human gesture grading system from monocular images based on OpenPose, library for multi-person keypoint detection. After capturing positions of person's joints and skeleton wireframe the body, computed equation motion trajectory every joint. Similarity metric was defined as distance between trajectories standard videos. A modifiable scoring formula used simulating scenario. Experimental results showed that worked efficiently with high performance,...

10.1109/cisp-bmei.2017.8301910 article EN 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) 2017-10-01

Mask Guided Matting via Progressive Refinement Network

OPENALEX - Publications

Qihang Yu Jianming Zhang He Zhang Yilin Wang Zhe Lin and 3 more

We propose Mask Guided (MG) Matting, a robust matting framework that takes general coarse mask as guidance. MG Matting leverages network (PRN) design which encourages the model to provide self-guidance progressively refine uncertain regions through decoding process. A series of guidance perturbation operations are also introduced in training further enhance its robustness external show PRN can generalize unseen types masks such trimap and low-quality alpha matte, making it suitable for...

10.1109/cvpr46437.2021.00121 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content

OPENALEX - Publications

Zhengzhong Tu Yilin Wang Neil Birkbeck Balu Adsumilli Alan C. Bovik

Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to evolution affordable reliable consumer capture devices, tremendous popularity social media platforms. Accordingly, there is a great need for accurate video quality assessment (VQA) models UGC/consumer monitor, control, optimize this vast content. Blind prediction in-the-wild quite challenging, since degradations UGC are unpredictable, complicated, often commingled....

10.1109/tip.2021.3072221 article EN IEEE Transactions on Image Processing 2021-01-01

Rich features for perceptual quality assessment of UGC videos

OPENALEX - Publications

Yilin Wang Junjie Ke Hossein Talebi Joong Gon Yim Neil Birkbeck and 3 more

Video quality assessment for User Generated Content (UGC) is an important topic in both industry and academia. Most existing methods only focus on one aspect of the perceptual assessment, such as technical or compression artifacts. In this paper, we create a large scale dataset to comprehensively investigate characteristics generic UGC video quality. Besides subjective ratings content labels dataset, also propose DNN-based framework thoroughly analyze importance content, quality, level Our...

10.1109/cvpr46437.2021.01323 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Mitigation of porosity in adjustable-ring-mode laser welding of medium-thick aluminum alloy

OPENALEX - Publications

Jianmin Li Shaoning Geng Yilin Wang Chunming Wang Ping Jiang

10.1016/j.ijheatmasstransfer.2024.125514 article EN International Journal of Heat and Mass Transfer 2024-04-12

QSPR Studies on Vapor Pressure, Aqueous Solubility, and the Prediction of Water−Air Partition Coefficients

OPENALEX - Publications

Alan R. Katritzky Yilin Wang Sulev Sild Tarmo Tamm Mati Karelson

The vapor pressures and the aqueous solubilities of 411 compounds with a large structural diversity were investigated using quantitative structure−property relationship (QSPR) approach. A five-descriptor equation squared correlation coefficient (R2) 0.949 for pressure six-descriptor R2 0.879 solubility obtained. All descriptors derived solely from chemical structure compounds. QSPR equations allow reliable prediction water−air partition coefficients.

10.1021/ci980022t article EN Journal of Chemical Information and Computer Sciences 1998-06-30

Sentiment Analysis for Social Media Images

OPENALEX - Publications

Yilin Wang Baoxin Li

In this proposal, we study the problem of understanding human sentiments from large scale collection Internet images based on both image features and contextual social network information (such as friend comments user description). Despite great strides in analyzing sentiment text information, analysis behind content has largely been ignored. Thus, extend significant advances text-based prediction tasks to higher level challenge predicting underlying images. We show that neither visual nor...

10.1109/icdmw.2015.142 article EN 2015-11-01

Hierarchical Attention Network for Action Recognition in Videos

OPENALEX - Publications

Yilin Wang Suhang Wang Jiliang Tang Neil O’Hare Yi Chang and 1 more

Understanding human actions in wild videos is an important task with a broad range of applications. In this paper we propose novel approach named Hierarchical Attention Network (HAN), which enables to incorporate static spatial information, short-term motion information and long-term video temporal structures for complex action understanding. Compared recent convolutional neural network based approaches, HAN has following advantages (1) can efficiently capture longer range; (2) able reveal...

10.48550/arxiv.1607.06416 preprint EN cc-by arXiv (Cornell University) 2016-01-01

YouTube UGC Dataset for Video Compression Research

OPENALEX - Publications

Yilin Wang Sasi Inguva Balu Adsumilli

Non-professional video, commonly known as User Generated Content (UGC) has become very popular in today's video sharing applications. However, traditional metrics used compression and quality assessment, like BD-Rate PSNR, are designed for pristine originals. Thus, their accuracy drops significantly when being applied on non-pristine originals (the majority of UGC). Understanding difficulties assessment the scenario UGC is important, but there few public datasets available research. This...

10.1109/mmsp.2019.8901772 preprint EN 2019-09-01

Diversifying Tire-Defect Image Generation Based on Generative Adversarial Network

OPENALEX - Publications

Yulong Zhang Yilin Wang Zhiqiang Jiang Fagen Liao Zheng Li and 3 more

With the development of data-driven models, deep learning has been increasingly applied in field defect detection. However, performance models is greatly restricted by costly labeling and sample scarcity. One best approaches to solve data imbalance problem increasing quantity diversity samples. Meanwhile, current based on generative adversarial network (GAN) cannot readily control category shape generated samples, which results inefficient augmentation. Thus, simultaneously achieve...

10.1109/tim.2022.3160542 article EN IEEE Transactions on Instrumentation and Measurement 2022-01-01

A clean two-stage Bayer process for achieving near-zero waste discharge from high-iron gibbsitic bauxite

OPENALEX - Publications

Guotao Zhou Yilin Wang Yu-guan Zhang Tiangui Qi Qiusheng Zhou and 3 more

10.1016/j.jclepro.2023.136991 article EN Journal of Cleaner Production 2023-04-03

Fractional fourier transform and its application

OPENALEX - Publications

Yilin Wang

The Fourier Transform (FT) is a linear transformation for the primitive function. It takes some set of functions to be an orthogonal basis. Its physical meaning transfer function onto each base functions. Because it can convert between time and frequency domains, FT widely employed in many fields. Fractional (FrFT) improvement progress based on FT. This paper will define FrFT. Then distinction FrFT discussed. Finally, specific examples its application processing digital image are provided....

10.54254/2753-8818/42/20240103 article EN cc-by Theoretical and Natural Science 2024-06-24

BBAND INDEX: A NO-REFERENCE BANDING ARTIFACT PREDICTOR

OPENALEX - Publications

Zhengzhong Tu J. Lin Yilin Wang Balu Adsumilli Alan C. Bovik

Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos. These staircase-shaped color bands can be very noticeable high-definition Here we study this and propose new distortion-specific no-reference quality model for predicting banding artifacts, called the Blind BANding Detector (BBAND index). BBAND inspired by human visual models. The proposed detector generate pixel-wise visibility map output severity...

10.1109/icassp40776.2020.9053634 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020-04-09

Subjective and Objective Quality Assessment of High Frame Rate Videos

OPENALEX - Publications

Pavan C. Madhusudana Xiangxu Yu Neil Birkbeck Yilin Wang Balu Adsumilli and 1 more

High frame rate (HFR) videos are becoming increasingly common with the tremendous popularity of live, high-action streaming content such as sports. Although HFR contents generally very high quality, bandwidth requirements make them challenging to deliver efficiently, while simultaneously maintaining their quality. To optimize trade-offs between and video in terms adaptation, it is imperative understand intricate relationship perceptual Towards advancing progression this direction we designed...

10.1109/access.2021.3100462 article EN cc-by IEEE Access 2021-01-01

Predicting the Quality of Compressed Videos With Pre-Existing Distortions

OPENALEX - Publications

Xiangxu Yu Neil Birkbeck Yilin Wang Christos G. Bampis Balu Adsumilli and 1 more

Because of the increasing ease video capture, many millions consumers create and upload large volumes User-Generated-Content (UGC) videos to social streaming media sites over Internet. UGC are commonly captured by naive users having limited skills imperfect techniques, tend be afflicted mixtures highly diverse in-capture distortions. These then often uploaded for sharing onto cloud servers, where they further compressed storage transmission. Our paper tackles practical problem predicting...

10.1109/tip.2021.3107213 article EN IEEE Transactions on Image Processing 2021-01-01

Effect of pulsed laser pretreatment induced pit-structure on the formation of intermetallic compounds in titanium-aluminum dissimilar welded joints

OPENALEX - Publications

Jintian Zhao Ping Jiang Shaoning Geng Yilin Wang Boan Xu

10.1016/j.optlastec.2023.109589 article EN Optics & Laser Technology 2023-06-09

Coming Soon ...