- Advanced Image and Video Retrieval Techniques
- Web and Library Services
- Library Science and Administration
- Multimodal Machine Learning Applications
- Online and Blended Learning
- Library Science and Information Literacy
- Advanced Vision and Imaging
- Dialysis and Renal Disease Management
- Mobile Learning in Education
- Library Science and Information Systems
- Advanced Neural Network Applications
- Linguistics, Language Diversity, and Identity
- Image Enhancement Techniques
- Visual Attention and Saliency Detection
- Renal Diseases and Glomerulopathies
- Advanced Image Processing Techniques
- Lexicography and Language Studies
- Library Collection Development and Digital Resources
- Image Retrieval and Classification Techniques
- Handwritten Text Recognition Techniques
- Domain Adaptation and Few-Shot Learning
- Video Analysis and Summarization
- Open Education and E-Learning
- Health Sciences Research and Education
- Generative Adversarial Networks and Image Synthesis
Georgia State University
2023-2025
Adobe Systems (United States)
2015-2024
Medical College of Wisconsin
2022-2024
University of Alabama at Birmingham
2024
Cleveland Clinic Florida
2023-2024
Washington DC VA Medical Center
2022-2023
Veterans Health Administration
2023
George Washington University
2013-2022
Children's Hospital of Wisconsin
2022
Indiana University Health
2022
In this paper, we propose a novel coupled dictionary training method for single image super-resolution based on patchwise sparse recovery, where the learned couple dictionaries relate low- and high-resolution patch spaces via representation. The learning process enforces that representation of low-resolution in terms can well reconstruct its underlying with highresolution space. We model problem as bilevel optimization problem, includes an 1-norm minimization constraints. Implicit...
Depth estimation and semantic segmentation are two fundamental problems in image understanding. While the tasks strongly correlated mutually beneficial, they usually solved separately or sequentially. Motivated by complementary properties of tasks, we propose a unified framework for joint depth prediction. Given an image, first use trained Convolutional Neural Network (CNN) to jointly predict global layout composed pixel-wise values labels. By allowing interactions between information,...
Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image similar foreground background colors or complicated textures. The main reasons are prior methods 1) only use low-level features 2) lack high-level context. In this paper, we propose novel deep learning based algorithm that can tackle both these problems. Our model two parts. first part convolutional encoder-decoder network takes the corresponding trimap as...
Interactive object selection is a very important research problem and has many applications. Previous algorithms require substantial user interactions to estimate the foreground background distributions. In this paper, we present novel deep-learning-based algorithm which much better understanding of objectness can reduce just few clicks. Our transforms user-provided positive negative clicks into two Euclidean distance maps are then concatenated with RGB channels images compose (image,...
Psychosocial issues are an understudied yet important concern in the overall health of hemodialysis (HD) patients. Stress is a concomitant chronic illness and its treatment, may have meaningful influences on psychological medical outcomes. This article reviews psychopathology, social support, family issues, dialysis unit culture, socioeconomic status patients treated with center HD. Depressive affect decreased perception support been linked mortality several studies ESRD Decreased marital...
We develop a deep learning algorithm for contour detection with fully convolutional encoder-decoder network. Different from previous low-level edge detection, our focuses on detecting higher-level object contours. Our network is trained end-to-end PASCAL VOC refined ground truth inaccurate polygon annotations, yielding much higher precision in than methods. find that the learned model generalizes well to unseen classes same supercategories MS COCO and can match state-of-the-art BSDS500...
We propose a fast regression model for practical single image super-resolution based on in-place examples, by leveraging two fundamental approaches- learning from an external database and self-examples. Our self-similarity refines the recently proposed local proving that patch in upper scale have good matches around its origin location lower image. Based first-order approximation of nonlinear mapping function low-to high-resolution patches is learned. Extensive experiments benchmark...
In this paper, we present a new image matting algorithm that achieves state-of-the-art performance on benchmark dataset of images. This is achieved by solving two major problems encountered current sampling based algorithms. The first the range in which foreground and background are sampled often limited to such an extent true colors not present. Here, describe method more comprehensive representative set samples collected so as miss out samples. accomplished expanding for pixels farther...
Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them. Existing methods fail when faced with even minor variations in appearance. Here, we present DVQA, a dataset that tests many aspects of bar chart understanding question answering framework. Unlike visual (VQA), DVQA requires processing words and answers unique particular chart. State-of-the-art VQA perform poorly on propose two strong baselines considerably better. Our work will enable...
One property that remains lacking in image captions generated by contemporary methods is discriminability: being able to tell two images apart given the caption for one of them. We propose a way improve this aspect generation. By incorporating into captioning training objective loss component directly related ability (by machine) disambiguate image/caption matches, we obtain systems produce much more discriminative caption, according human evaluation. Remarkably, our approach leads...
Interactive segmentation is useful for selecting objects of interest in images and continues to be a topic much study. Methods that grow regions from foreground/background seeds, such as the recent geodesic approach, avoid boundary-length bias graph-cut methods but have their own towards minimizing paths resulting increased sensitivity seed placement. The lack edge modeling or similar approaches limits ability precisely localize object boundaries, something at which generally excel. This...
This paper presents a scalable scene parsing algorithm based on image retrieval and superpixel matching. We focus rare object classes, which play an important role in achieving richer semantic understanding of visual scenes, compared to common background classes. Towards this end, we make two novel contributions: class expansion context description. First, considering the long-tailed nature label distribution, expand set by exemplars thus achieve more balanced classification results. Second,...
We introduce two data augmentation and normalization techniques, which, used with a CNN-LSTM, significantly reduce Word Error Rate (WER) Character (CER) beyond best-reported results on handwriting recognition tasks. (1) apply novel profile technique to both word line images. (2) augment existing text images using random perturbations regular grid. our training test Our approach achieves low WER CER over hundreds of authors, multiple languages variety collections written centuries apart....
Video sequences contain many cues that may be used to segment objects in them, such as color, gradient, color adjacency, shape, temporal coherence, camera and object motion, easily-trackable points. This paper introduces LIVEcut, a novel method for interactively selecting video by extracting leveraging much of this information possible. Using graph-cut optimization framework, LIVEcut propagates the selection forward frame frame, allowing user correct any mistakes along way if needed....
Few studies have assessed sleep disturbances or perception of pain in patients with early-stage chronic kidney disease. It was hypothesized that and disturbance would increase disease stage, correlate psychosocial variables, there be a higher prevalence compared general medical patients.A total 92 predialysis 61 outpatients were evaluated using the Beck Depression Inventory, Illness Effects Questionnaire, Multidimensional Scale Perceived Social Support, Satisfaction Life Scale, Karnofsky...
Estimating the amount of blur in a given image is important for computer vision applications. More specifically, spatially varying defocus point-spread-functions (PSFs) over an reveal geometric information scene, and their estimate can also be used to recover all-in-focus image. A PSF specified by single parameter indicating its scale. Most existing algorithms only select optimal from finite set candidate PSFs each pixel. Some those methods require coded aperture filter inserted camera. In...
We present an optical flow algorithm for large displacement motions. Most existing methods use the standard coarse-to-fine framework to deal with motions which has intrinsic limitations. Instead, we formulate motion estimation problem as a segmentation problem. approximate nearest neighbor fields compute initial field and robust set of similarity transformations candidates segmentation. To account deviations from transformations, add local deformations in process. also observe that small...
Illumination estimation is the process of determining chromaticity illumination in an imaged scene order to remove undesirable color casts through white-balancing. While computational constancy a well-studied topic computer vision, it remains challenging due ill-posed nature problem. One class techniques relies on low-level statistical information image distribution and works under various assumptions (e.g. Grey-World, White-Patch, etc). These methods have advantage that they are simple...
Image scene understanding requires learning the relationships between objects in scene. A with many may have only a few individual interacting (e.g., party image people, handful of people might be speaking each other). To detect all relationships, it would inefficient to first and then classify pairs, not is number pairs quadratic, but classification limited object categories, which scalable for real-world images. In this paper we address these challenges by using related regions images...