Wenyu Liu

ORCID: 0000-0002-4582-7488
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Advanced Neural Network Applications
  • Image Retrieval and Classification Techniques
  • Video Surveillance and Tracking Methods
  • Domain Adaptation and Few-Shot Learning
  • Advanced Vision and Imaging
  • Human Pose and Action Recognition
  • Energy Efficient Wireless Sensor Networks
  • Medical Image Segmentation Techniques
  • Handwritten Text Recognition Techniques
  • Multimodal Machine Learning Applications
  • Visual Attention and Saliency Detection
  • Robotics and Sensor-Based Localization
  • Indoor and Outdoor Localization Technologies
  • Heat Transfer and Optimization
  • Video Coding and Compression Technologies
  • Face and Expression Recognition
  • Mobile Ad Hoc Networks
  • Simulation and Modeling Applications
  • Hand Gesture Recognition Systems
  • Advanced Data Compression Techniques
  • Advanced Measurement and Detection Methods
  • Image Processing and 3D Reconstruction
  • Computer Graphics and Visualization Techniques
  • 3D Shape Modeling and Analysis

Huazhong University of Science and Technology
2016-2025

Chinese Academy of Medical Sciences & Peking Union Medical College
2009-2025

Nanjing University
2018-2025

Qingdao Agricultural University
2025

Nankai University
2025

Collaborative Innovation Center of Chemical Science and Engineering Tianjin
2025

West China Hospital of Sichuan University
2017-2025

Peking Union Medical College Hospital
2009-2025

Beijing University of Civil Engineering and Architecture
2025

Tsinghua–Berkeley Shenzhen Institute
2025

High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image a low-resolution representation through subnetwork that is formed by connecting high-to-low resolution convolutions <i>in series</i> (e.g., ResNet, VGGNet), then recover high-resolution from encoded representation. Instead, our proposed network, named High-Resolution...

10.1109/tpami.2020.2983686 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-04-01

Full-image dependencies provide useful contextual information to benefit visual understanding problems. In this work, we propose a Criss-Cross Network (CCNet) for obtaining such in more effective and efficient way. Concretely, each pixel, novel criss-cross attention module CCNet harvests the of all pixels on its path. By taking further recurrent operation, pixel can finally capture full-image from pixels. Overall, is with following merits: 1) GPU memory friendly. Compared non-local block,...

10.1109/iccv.2019.00069 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects with both high accuracy and efficiency in a single network forward pass, involving no post-process except for standard non-maximum suppression. TextBoxes outperforms competing methods terms of localization is much faster, taking only 0.09s per image implementation. Furthermore, combined recognizer, significantly state-of-the-art approaches on word spotting recognition tasks.

10.1609/aaai.v31i1.11196 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

With the increasing popularity of practical vision systems and smart phones, text detection in natural scenes becomes a critical yet challenging task. Most existing methods have focused on detecting horizontal or near-horizontal texts. In this paper, we propose system which detects texts arbitrary orientations images. Our algorithm is equipped with two-level classification scheme two sets features specially designed for capturing both intrinsic characteristics To better evaluate our compare...

10.1109/cvpr.2012.6247787 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2012-06-01

Abstract Accurate and rapid diagnosis of COVID-19 suspected cases plays a crucial role in timely quarantine medical treatment. Developing deep learning-based model for automatic detection on chest CT is helpful to counter the outbreak SARS-CoV-2. A weakly-supervised software system was developed using 3D volumes detect COVID-19. For each patient, lung region segmented pre-trained UNet; then fed into neural network predict probability infectious. 499 collected from Dec. 13, 2019, Jan. 23,...

10.1101/2020.03.12.20027185 preprint EN cc-by-nc-nd medRxiv (Cold Spring Harbor Laboratory) 2020-03-17

The pervasiveness of location-acquisition technologies (GPS, GSM networks, etc.) enable people to conveniently log the location histories they visited with spatio-temporal data. increasing availability large amounts data pertaining an individual's trajectories has given rise a variety geographic information systems, and also brings us opportunities challenges automatically discover valuable knowledge from these trajectories. In this paper, we move towards direction aim geographically mine...

10.1145/1463434.1463477 article EN 2008-11-05

Accurate and rapid diagnosis of COVID-19 suspected cases plays a crucial role in timely quarantine medical treatment. Developing deep learning-based model for automatic on chest CT is helpful to counter the outbreak SARS-CoV-2. A weakly-supervised learning framework was developed using 3D volumes classification lesion localization. For each patient, lung region segmented pre-trained UNet; then fed into neural network predict probability infectious; lesions are localized by combining...

10.1109/tmi.2020.2995965 article EN IEEE Transactions on Medical Imaging 2020-05-20

High-resolution representation learning plays an essential role in many vision problems, e.g., pose estimation and semantic segmentation. The high-resolution network (HRNet)~\cite{SunXLW19}, recently developed for human estimation, maintains representations through the whole process by connecting high-to-low resolution convolutions \emph{parallel} produces strong repeatedly conducting fusions across parallel convolutions. In this paper, we conduct a further study on introducing simple yet...

10.48550/arxiv.1904.04514 preprint EN other-oa arXiv (Cornell University) 2019-01-01

This paper studies the problem of learning image semantic segmentation networks only using image-level labels as supervision, which is important since it can significantly reduce human annotation efforts. Recent state-of-the-art methods on this first infer sparse and discriminative regions for each object class a deep classification network, then train network supervision. Inspired by traditional seeded region growing, we propose to starting from progressively increase pixel-level...

10.1109/cvpr.2018.00733 article EN 2018-06-01

In this paper, we propose a novel approach for text detection in natural images. Both local and global cues are taken into account localizing lines coarse-to-fine procedure. First, Fully Convolutional Network (FCN) model is trained to predict the salient map of regions holistic manner. Then, line hypotheses estimated by combining character components. Finally, another FCN classifier used centroid each character, order remove false hypotheses. The framework general handling multiple...

10.1109/cvpr.2016.451 preprint EN 2016-06-01

The study of Cyber-Physical System (CPS) has been an active area research. Internet Data Center (IDC) is important emerging System. As the demand on services drastically increases in recent years, power used by IDCs skyrocketing. While most existing research focuses reducing consumptions IDCs, management problem for minimizing total electricity cost overlooked. This faced service providers, especially current multi-electricity market, where price may exhibit time and location diversities....

10.1109/infcom.2010.5461933 article EN 2010-03-01

Of late, weakly supervised object detection is with great importance in recognition. Based on deep learning, detectors have achieved many promising results. However, compared fully detection, it more challenging to train network based a manner. Here we formulate as Multiple Instance Learning (MIL) problem, where instance classifiers (object detectors) are put into the hidden nodes. We propose novel online classifier refinement algorithm integrate MIL and procedure single network, end-to-end...

10.1109/cvpr.2017.326 article EN 2017-07-01

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects with both high accuracy and efficiency in a single network forward pass, involving no post-process except for standard non-maximum suppression. TextBoxes outperforms competing methods terms of localization is much faster, taking only 0.09s per image implementation. Furthermore, combined recognizer, significantly state-of-the-art approaches on word spotting recognition tasks.

10.48550/arxiv.1611.06779 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Contextual information is vital in visual understanding problems, such as semantic segmentation and object detection. We propose a criss-cross network (CCNet) for obtaining full-image contextual very effective efficient way. Concretely, each pixel, novel attention module harvests the of all pixels on its path. By taking further recurrent operation, pixel can finally capture dependencies. Besides, category consistent loss proposed to enforce produce more discriminative features. Overall,...

10.1109/tpami.2020.3007032 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-07-03

In this paper, we introduce a new skeleton pruning method based on contour partitioning. Any partition can be used, but the partitions obtained by discrete curve evolution (DCE) yield excellent results. The theoretical properties and experiments presented demonstrate that skeletons are in accord with human visual perception stable, even presence of significant noise shape variations, have same topology as original skeletons. particular, proven proposed approach never produces spurious...

10.1109/tpami.2007.59 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2007-01-23

Weakly Supervised Object Detection (WSOD), using only image-level annotations to train object detectors, is of growing importance in recognition. In this paper, we propose a novel deep network for WSOD. Unlike previous networks that transfer the detection problem an image classification Multiple Instance Learning (MIL), our strategy generates proposal clusters learn refined instance classifiers by iterative process. The proposals same cluster are spatially adjacent and associated with...

10.1109/tpami.2018.2876304 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-10-16

High level semantics embodied in scene texts are both rich and clear thus can serve as important cues for a wide range of vision applications, instance, image understanding, indexing, video search, geolocation, automatic navigation. In this paper, we present unified framework text detection recognition natural images. The contributions paper threefold: 1) accomplished concurrently using exactly the same features classification scheme; 2) contrast to methods literature, which mainly focus on...

10.1109/tip.2014.2353813 article EN IEEE Transactions on Image Processing 2014-09-08

Driven by the wide range of applications, scene text detection and recognition have become active research topics in computer vision. Though extensively studied, localizing reading uncontrolled environments remain extremely challenging, due to various interference factors. In this paper, we propose a novel multi-scale representation for recognition. This consists set detectable primitives, termed as strokelets, which capture essential substructures characters at different granularities....

10.1109/cvpr.2014.515 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2014-06-01

Shape similarity and shape retrieval are very important topics in computer vision. The recent progress this domain has been mostly driven by designing smart descriptors for providing better measure between pairs of shapes. In paper, we provide a new perspective to problem considering the existing shapes as group, study their measures query graph structure. Our method is general can be built on top any measure. For given measure, learned through transduction. iteratively so that neighbors...

10.1109/tpami.2009.85 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2009-04-18

Though deep learning-based object detection methods have achieved promising results on the conventional datasets, it is still challenging to locate objects from low-quality images captured in adverse weather conditions. The existing either difficulties balancing tasks of image enhancement and detection, or often ignore latent information beneficial for detection. To alleviate this problem, we propose a novel Image-Adaptive YOLO (IA-YOLO) framework, where each can be adaptively enhanced...

10.1609/aaai.v36i2.20072 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28
Coming Soon ...