NFDI4DS | UHH-SEMS - Publication Details

Thomas S. Huang

ORCID: 0000-0001-8474-5859

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101457342

Research Areas

Advanced Image and Video Retrieval Techniques
Advanced Vision and Imaging
Image Retrieval and Classification Techniques
Face and Expression Recognition
Face recognition and analysis
Video Analysis and Summarization
Video Surveillance and Tracking Methods
Image and Signal Denoising Methods
Advanced Image Processing Techniques
Human Pose and Action Recognition
Speech and Audio Processing
Music and Audio Processing
Image Processing Techniques and Applications
Advanced Neural Network Applications
Robotics and Sensor-Based Localization
Sparse and Compressive Sensing Techniques
Optical measurement and interference techniques
Domain Adaptation and Few-Shot Learning
Medical Image Segmentation Techniques
Hand Gesture Recognition Systems
Emotion and Mood Recognition
Anomaly Detection Techniques and Applications
Neural Networks and Applications
Advanced Data Compression Techniques
Remote-Sensing Image Classification

University of Illinois Urbana-Champaign
2013-2023

Central South University
2023

Jet Propulsion Laboratory
2004-2022

International University of the Caribbean
2018-2021

Nature Inspires Creativity Engineers Lab
2010-2020

York University
2019-2020

University of Michigan–Ann Arbor
2020

Seoul National University
2019

Kapiolani Medical Center for Women and Children
2019

Nanjing University of Science and Technology
2016

Image Super-Resolution Via Sparse Representation

OPENALEX - Publications

Shuicheng Yan John Wright Thomas S. Huang Yi Ma

This paper presents a new approach to single-image super-resolution, based on sparse signal representation. Research image statistics suggests that patches can be well-represented as linear combination of elements from an appropriately chosen over-complete dictionary. Inspired by this observation, we seek representation for each patch the low-resolution input, and then use coefficients generate high-resolution output. Theoretical results compressed sensing suggest under mild conditions,...

10.1109/tip.2010.2050625 article EN IEEE Transactions on Image Processing 2010-05-26

Least-Squares Fitting of Two 3-D Point Sets

OPENALEX - Publications

K.S. Arun Thomas S. Huang Steven D. Blostein

Two point sets {pi} and {p'i}; i = 1, 2,..., N are related by p'i Rpi + T Ni, where R is a rotation matrix, translation vector, Ni noise vector. Given {p'i}, we present an algorithm for finding the least-squares solution of T, which based on singular value decomposition (SVD) 3 × matrix. This new compared to two earlier algorithms with respect computer time requirements.

10.1109/tpami.1987.4767965 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 1987-09-01

Locality-constrained Linear Coding for image classification

OPENALEX - Publications

Jinjun Wang Shuicheng Yan Kai Yu Fengjun Lv Thomas S. Huang and 1 more

The traditional SPM approach based on bag-of-features (BoF) requires nonlinear classifiers to achieve good image classification performance. This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ SPM. LLC utilizes locality constraints project each descriptor into its local-coordinate system, and projected coordinates are integrated by max pooling generate final representation. With linear classifier, proposed performs...

10.1109/cvpr.2010.5540018 article EN 2010-06-01

Linear spatial pyramid matching using sparse coding for image classification

OPENALEX - Publications

Shuicheng Yan Kai Yu Yihong Gong Thomas S. Huang

Recently SVMs using spatial pyramid matching (SPM) kernel have been highly successful in image classification. Despite its popularity, these nonlinear a complexity O(n <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ∼ n xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> ) training and O(n) testing, where is the size, implying that it nontrivial to scaleup algorithms handlemore than thousands of images. In this paper we develop an...

10.1109/cvpr.2009.5206757 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2009-06-01

Graph Regularized Nonnegative Matrix Factorization for Data Representation

OPENALEX - Publications

Deng Cai Xiaofei He Jiawei Han Thomas S. Huang

Matrix factorization techniques have been frequently applied in information retrieval, computer vision, and pattern recognition. Among them, Nonnegative Factorization (NMF) has received considerable attention due to its psychological physiological interpretation of naturally occurring data whose representation may be parts based the human brain. On other hand, from geometric perspective, is usually sampled a low-dimensional manifold embedded high-dimensional ambient space. One then hopes...

10.1109/tpami.2010.231 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2010-12-23

Generative Image Inpainting with Contextual Attention

OPENALEX - Publications

Jiahui Yu Zhe Lin Shuicheng Yan Xiaohui Shen Xin Lu and 1 more

Recent deep learning based approaches have shown promising results for the challenging task of inpainting large missing regions in an image. These methods can generate visually plausible image structures and textures, but often create distorted or blurry textures inconsistent with surrounding areas. This is mainly due to ineffectiveness convolutional neural networks explicitly borrowing copying information from distant spatial locations. On other hand, traditional texture patch synthesis are...

10.1109/cvpr.2018.00577 article EN 2018-06-01

Sparse Representation for Computer Vision and Pattern Recognition

OPENALEX - Publications

John Wright Yi Ma Julien Mairal Guillermo Sapiro Thomas S. Huang and 1 more

Techniques from sparse signal representation are beginning to see significant impact in computer vision, often on nontraditional applications where the goal is not just obtain a compact high-fidelity of observed signal, but also extract semantic information. The choice dictionary plays key role bridging this gap: unconventional dictionaries consisting of, or learned from, training samples themselves provide obtaining state-of-the-art results and attaching meaning representations....

10.1109/jproc.2010.2044470 article EN Proceedings of the IEEE 2010-05-10

Relevance feedback: a power tool for interactive content-based image retrieval

OPENALEX - Publications

Yong Rui Thomas S. Huang M. Ortega Sharad Mehrotra

Content-based image retrieval (CBIR) has become one of the most active research areas in past few years. Many visual feature representations have been explored and many systems built. While these efforts establish basis CBIR, usefulness proposed approaches is limited. Specifically, relatively ignored two distinct characteristics CBIR systems: (1) gap between high-level concepts low-level features, (2) subjectivity human perception content. This paper proposes a relevance feedback based...

10.1109/76.718510 article EN IEEE Transactions on Circuits and Systems for Video Technology 1998-01-01

Free-Form Image Inpainting With Gated Convolution

OPENALEX - Publications

Jiahui Yu Zhe Lin Shuicheng Yan Xiaohui Shen Xin Lu and 1 more

We present a generative image inpainting system to complete images with free-form mask and guidance. The is based on gated convolutions learned from millions of without additional labelling efforts. proposed convolution solves the issue vanilla that treats all input pixels as valid ones, generalizes partial by providing learnable dynamic feature selection mechanism for each channel at spatial location across layers. Moreover, masks may appear anywhere in any shape, global local GANs designed...

10.1109/iccv.2019.00457 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Image Retrieval: Current Techniques, Promising Directions, and Open Issues

OPENALEX - Publications

Yong Rui Thomas S. Huang Shih‐Fu Chang

10.1006/jvci.1999.0413 article EN Journal of Visual Communication and Image Representation 1999-03-01

Image super-resolution as sparse representation of raw image patches

OPENALEX - Publications

Shuicheng Yan John L. Wright Thomas S. Huang Yi Ma

This paper addresses the problem of generating a super-resolution (SR) image from single low-resolution input image. We approach this perspective compressed sensing. The is viewed as downsampled version high-resolution image, whose patches are assumed to have sparse representation with respect an over-complete dictionary prototype signal-atoms. principle sensing ensures that under mild conditions, can be correctly recovered signal. will demonstrate effectiveness sparsity prior for...

10.1109/cvpr.2008.4587647 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2008-06-01

A fast two-dimensional median filtering algorithm

OPENALEX - Publications

Thomas S. Huang Gang Yang Gongguo Tang

We present a fast algorithm for two-dimensional median filtering. It is based on storing and updating the gray level histogram of picture elements in window. The much faster than conventional sorting methods. For window size m × n, computer time required 0(n).

10.1109/tassp.1979.1163188 article EN IEEE Transactions on Acoustics Speech and Signal Processing 1979-02-01

NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results

OPENALEX - Publications

Radu Timofte Eirikur Agustsson Luc Van Gool Shuicheng Yan Lei Zhang and 72 more

This paper reviews the first challenge on single image super-resolution (restoration of rich details in an low resolution image) with focus proposed solutions and results. A new DIVerse 2K dataset (DIV2K) was employed. The had 6 competitions divided into 2 tracks 3 magnification factors each. Track 1 employed standard bicubic downscaling setup, while unknown operators (blur kernel decimation) but learnable through high res train images. Each competition ∽100 registered participants 20 teams...

10.1109/cvprw.2017.149 article EN 2017-07-01

Facial expression recognition from video sequences: temporal and static modeling

OPENALEX - Publications

Ira L. Cohen Nicu Sebe Ashutosh Garg Lawrence S. Chen Thomas S. Huang

10.1016/s1077-3142(03)00081-x article EN Computer Vision and Image Understanding 2003-07-01

Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces

OPENALEX - Publications

R. Tsai Thomas S. Huang

Two main results are established in this paper. First, we show that seven point correspondences sufficient to uniquely determine from two perspective views the three-dimensional motion parameters (within a scale factor for translations) of rigid object with curved surfaces. The points should not be traversed by planes one plane containing origin, nor cone origin. Second, set ``essential parameters'' introduced which up translations, and can estimated solving linear equations derived eight...

10.1109/tpami.1984.4767471 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 1984-01-01

Coupled Dictionary Training for Image Super-Resolution

OPENALEX - Publications

Shuicheng Yan Zhaowen Wang Zhe Lin Scott Cohen Thomas S. Huang

In this paper, we propose a novel coupled dictionary training method for single image super-resolution based on patchwise sparse recovery, where the learned couple dictionaries relate low- and high-resolution patch spaces via representation. The learning process enforces that representation of low-resolution in terms can well reconstruct its underlying with highresolution space. We model problem as bilevel optimization problem, includes an 1-norm minimization constraints. Implicit...

10.1109/tip.2012.2192127 article EN IEEE Transactions on Image Processing 2012-04-11

Content-based image retrieval with relevance feedback in MARS

OPENALEX - Publications

Yong Rui Thomas S. Huang Sharad Mehrotra

Technology advances in the areas of image processing (IP) and information retrieval (IR) have evolved separately for a long time. However, successful content-based systems require integration two. There is an urgent need to develop mechanisms link model text model, such that well established techniques can be utilized. Approaches converting feature vectors (IF domain) weighted-term (IR are proposed this paper. Furthermore, relevance feedback technique from IR domain used demonstrate...

10.1109/icip.1997.638621 article EN 2002-11-23

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

OPENALEX - Publications

Bowen Cheng Bin Xiao Jingdong Wang Humphrey Shi Thomas S. Huang and 1 more

Bottom-up human pose estimation methods have difficulties in predicting the correct for small persons due to challenges scale variation. In this paper, we present HigherHRNet: a novel bottom-up method learning scale-aware representations using high-resolution feature pyramids. Equipped with multi-resolution supervision training and aggregation inference, proposed approach is able solve variation challenge multi-person localize keypoints more precisely, especially person. The pyramid...

10.1109/cvpr42600.2020.00543 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Fast fourier transform and convolution algorithms

OPENALEX - Publications

Henri J. Nussbaumer King‐Sun Fu Thomas S. Huang Manfred R. Schroeder

10.1016/0378-4754(81)90075-6 article EN Mathematics and Computers in Simulation 1981-07-01

Deep Networks for Image Super-Resolution with Sparse Prior

OPENALEX - Publications

Zhaowen Wang Ding Liu Shuicheng Yan Wei Han Thomas S. Huang

Deep learning techniques have been successfully applied in many areas of computer vision, including low-level image restoration problems. For super-resolution, several models based on deep neural networks recently proposed and attained superior performance that overshadows all previous handcrafted models. The question then arises whether large-capacity data-driven become the dominant solution to ill-posed super-resolution problem. In this paper, we argue domain expertise represented by...

10.1109/iccv.2015.50 preprint EN 2015-12-01

Coming Soon ...