NFDI4DS | UHH-SEMS - Publication Details

Shanmuganathan Raman

ORCID: 0000-0003-2718-7891

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5070142969

Research Areas

Advanced Vision and Imaging
Image Enhancement Techniques
Advanced Image and Video Retrieval Techniques
Advanced Image Processing Techniques
Visual Attention and Saliency Detection
Video Surveillance and Tracking Methods
Image Processing Techniques and Applications
Human Pose and Action Recognition
Advanced Neural Network Applications
Computer Graphics and Visualization Techniques
3D Shape Modeling and Analysis
Natural Language Processing Techniques
Robotics and Sensor-Based Localization
Video Analysis and Summarization
Generative Adversarial Networks and Image Synthesis
Advanced Image Fusion Techniques
Image and Signal Denoising Methods
Image Retrieval and Classification Techniques
Hand Gesture Recognition Systems
Medical Image Segmentation Techniques
Multimodal Machine Learning Applications
Topic Modeling
Image and Video Quality Assessment
Image Processing and 3D Reconstruction
Anomaly Detection Techniques and Applications

Indian Institute of Technology Gandhinagar
2016-2025

GITAM University
2015

Dr. Hari Singh Gour University
2012

Indian Institute of Technology Bombay
1976-2011

Indian Institute of Technology Madras
1995-2009

Lawrence Berkeley National Laboratory
2005-2007

Indian Institute of Technology Kanpur
2005

Motorola (United States)
1995-2005

Indian Institute of Technology Guwahati
2004

University of Illinois Urbana-Champaign
1991-2002

Bilateral Filter Based Compositing for Variable Exposure Photography

OPENALEX - Publications

Shanmuganathan Raman Subhasis Chaudhuri

Compositing a scene from multiple images is of considerableinterest to graphics professionals. Typical compositing techniques involve estimation or explicit prepar ation matte by an artist. In this article, we address the problem automatic o btained through variable exposure photography. We consider High Dynamic Range Imaging (HDRI) d review some existing approaches for directly generating Low (LDR) image mul ti-exposure images. propose computationally efficient method using edge-prese rving...

10.2312/egs.20091034 article EN Eurographics 2009-01-01

Deep Generative Filter for Motion Deblurring

OPENALEX - Publications

Sainandan Ramakrishnan Shubham Pachori Aalok Gangopadhyay Shanmuganathan Raman

Removing blur caused by camera shake in images has always been a challenging problem computer vision literature due to its ill-posed nature. Motion the relative motion between and object 3D space induces spatially varying blurring effect over entire image. In this paper, we propose novel deep filter based on Generative Adversarial Network (GAN) architecture integrated with global skip connection dense order tackle problem. Our model, while bypassing process of kernel estimation,...

10.1109/iccvw.2017.353 article EN 2017-10-01

Yoga-82: A New Dataset for Fine-grained Classification of Human Poses

OPENALEX - Publications

Manisha Verma Sudhakar Kumawat Yuta Nakashima Shanmuganathan Raman

Human pose estimation is a well-known problem in computer vision to locate joint positions. Existing datasets for learning of poses are observed be not challenging enough terms diversity, object occlusion and view points. This makes the annotation process relatively simple restricts application models that have been trained on them. To handle more variety human poses, we propose concept fine-grained hierarchical classification, which formulate as classification task, dataset, Yoga-82,...

10.1109/cvprw50498.2020.00527 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

LP-3DCNN: Unveiling Local Phase in 3D Convolutional Neural Networks

OPENALEX - Publications

Sudhakar Kumawat Shanmuganathan Raman

Traditional 3D Convolutional Neural Networks (CNNs) are computationally expensive, memory intensive, prone to overfit, and most importantly, there is a need improve their feature learning capabilities. To address these issues, we propose Rectified Local Phase Volume (ReLPV) block, an efficient alternative the standard convolutional layer. The ReLPV block extracts phase in local neighborhood (e.g., 3 × 3) of each position input map obtain maps. extracted by computing Short Term Fourier...

10.1109/cvpr.2019.00504 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Facial Expression Recognition Using Visual Saliency and Deep Learning

OPENALEX - Publications

Viraj Mavani Shanmuganathan Raman Krishna Prasad Miyapuram

We have developed a convolutional neural network for the purpose of recognizing facial expressions in human beings. fine-tuned existing model trained on visual recognition dataset used ILSVRC2012 to two widely expression datasets - CFEE and RaFD, which when tested independently yielded test accuracies 74.79% 95.71%, respectively. Generalization results was evident by training one testing other. Further, image product cropped faces their saliency maps were computed using Deep Multi-Layer...

10.1109/iccvw.2017.327 article EN 2017-10-01

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition

OPENALEX - Publications

Sudhakar Kumawat Manisha Verma Yuta Nakashima Shanmuganathan Raman

Conventional 3D convolutional neural networks (CNNs) are computationally expensive, memory intensive, prone to overfitting, and most importantly, there is a need improve their feature learning capabilities. To address these issues, we propose spatio-temporal short term Fourier transform (STFT) blocks, new class of blocks that can serve as an alternative the layer its variants in CNNs. An STFT block consists non-trainable convolution layers capture spatially and/or temporally local...

10.1109/tpami.2021.3076522 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-01-01

EEG2IMAGE: Image Reconstruction from EEG Brain Signals

OPENALEX - Publications

Prajwal Singh Pankaj Pandey Krishna Prasad Miyapuram Shanmuganathan Raman

Reconstructing images using brain signals of imagined visuals may provide an augmented vision to the disabled, leading advancement Brain-Computer Interface (BCI) technology. The recent progress in deep learning has boosted study area synthesizing from Generative Adversarial Networks (GAN). In this work, we have proposed a framework for activity recorded by electroencephalogram (EEG) small-size EEG datasets. This is subject's head scalp when they ask visualize certain classes Objects and...

10.1109/icassp49357.2023.10096587 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Improving legal information retrieval using an ontological framework

OPENALEX - Publications

M. Saravanan Balaraman Ravindran Shanmuganathan Raman

10.1007/s10506-009-9075-y article EN Artificial Intelligence and Law 2009-05-13

FHDR: HDR Image Reconstruction from a Single LDR Image using Feedback Network

OPENALEX - Publications

Zeeshan Khan Mukul Khanna Shanmuganathan Raman

High dynamic range (HDR) image generation from a single exposure low (LDR) has been made possible due to the recent advances in Deep Learning. Various feed-forward Convolutional Neural Networks (CNNs) have proposed for learning LDR HDR representations. To better utilize power of CNNs, we exploit idea feedback, where initial level features are guided by high using hidden state Recurrent Network. Unlike forward pass conventional network, reconstruction feedback network is learned over multiple...

10.1109/globalsip45357.2019.8969167 article EN 2019-11-01

Reconstruction of high contrast images for dynamic scenes

OPENALEX - Publications

Shanmuganathan Raman Subhasis Chaudhuri

10.1007/s00371-011-0653-0 article EN The Visual Computer 2011-11-05

LBVCNN: Local Binary Volume Convolutional Neural Network for Facial Expression Recognition From Image Sequences

OPENALEX - Publications

Sudhakar Kumawat Manisha Verma Shanmuganathan Raman

Recognizing facial expressions is one of the central problems in computer vision. Temporal image sequences have useful spatio-temporal features for recognizing expressions. In this paper, we propose a new 3D Convolution Neural Network (CNN) that can be trained end-to-end expression recognition on temporal without using landmarks. More specifically, novel convolutional layer call Local Binary Volume (LBV) proposed. The LBV layer, when used with our newly proposed LBVCNN network, achieve...

10.1109/cvprw.2019.00030 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019-06-01

Learning Robust Deep Visual Representations from EEG Brain Recordings

OPENALEX - Publications

Prajwal Singh Dwip Dalal Gautam Vashishtha Krishna Prasad Miyapuram Shanmuganathan Raman

Decoding the human brain has been a hallmark of neuroscientists and Artificial Intelligence researchers alike. Reconstruction visual images from Electroencephalography (EEG) signals garnered lot interest due to its applications in brain-computer interfacing. This study proposes two-stage method where first step is obtain EEG-derived features for robust learning deep representations subsequently utilize learned representation image generation classification. We demonstrate generalizability...

10.1109/wacv57701.2024.00738 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024-01-03

Attentive Spatio-Temporal Representation Learning for Diving Classification

OPENALEX - Publications

Gagan Kanojia Sudhakar Kumawat Shanmuganathan Raman

Competitive diving is a well recognized aquatic sport in which person dives from platform or springboard into the water. Based on acrobatics performed during dive, classified finite set of action classes are standardized by FINA. In this work, we propose an attention guided LSTM-based neural network architecture for task classification. The takes frames video as input and determines its class. We evaluate performance proposed model recently introduced competitive dataset, Diving48. It...

10.1109/cvprw.2019.00302 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019-06-01

GraphFill: Deep Image Inpainting using Graphs

OPENALEX - Publications

Shashikant Verma Aman Sharma Roopa Sheshadri Shanmuganathan Raman

We present a novel coarser-to-finer approach for deep graphical image inpainting that utilizes GraphFill, graph neural network-based learning framework, and lightweight generative baseline network. construct pyramidal the input-masked by reducing it into superpixels, each representing node in graph. The proposed facilitates transfer of global context from coarser to finer pyramid levels, enabling GraphFill estimate plausible information unknown values estimated is used fill masked region,...

10.1109/wacv57701.2024.00492 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024-01-03

Automatic trimap generation for image matting

OPENALEX - Publications

Vikas Gupta Shanmuganathan Raman

Image matting is an important problem in computational photography. Although, it has been studied for more than two decades, yet there a challenge of developing automatic algorithm which does not require any human intervention. Most the state-of-the-art algorithms intervention form trimap or scribbles to generate alpha matte input image. In this paper, we present simple and efficient approach automatically from image make whole process free human-in-the-loop. We use learning based method...

10.1109/iconsip.2016.7857477 article EN 2016-10-01

Detecting Approximate Reflection Symmetry in a Point Set Using Optimization on Manifold

OPENALEX - Publications

Rajendra Nagar Shanmuganathan Raman

We propose an algorithm to detect approximate reflection symmetry present in a set of volumetrically distributed points belonging ℝ <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">d</sup> containing distorted pattern. pose the problem detecting as establishing correspondences between which are reflections each other and we determine transformation. formulate optimization framework amounts solving linear assignment determining transformation on...

10.1109/tsp.2019.2893835 article EN IEEE Transactions on Signal Processing 2019-01-17

L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild

OPENALEX - Publications

Soumyaratna Debnath Harish Katti Shashikant Verma Shanmuganathan Raman

While 2D pose estimation has advanced our ability to interpret body movements in animals and primates, it is limited by the lack of depth information, constraining its application range. 3D provides a more comprehensive solution incorporating spatial depth, yet creating extensive datasets for challenging due their dynamic unpredictable behaviours natural settings. To address this, we propose hybrid approach that utilizes rigged avatars pipeline generate synthetic acquire necessary...

10.48550/arxiv.2501.01174 preprint EN arXiv (Cornell University) 2025-01-02

IMPORTANT: Advanced Pollen Classification of Indian Medicinal Plants through SEM and Computer Vision

OPENALEX - Publications

Jaidev Sanjay Khalane Nilesh D. Gawande Shanmuganathan Raman Subramanian Sankaranarayanan

Abstract Pollen grains of plant species have unique morphological characteristics. The variability in shape, size, and microscopic pollen surface features can be efficiently used to determine the which they belong. This approach instrumental regions with rich biodiversity species, specifically medicinal plant. creation a dataset for these using SEM images computer vision application beneficial their identification. We developed robust utilizing scanning electron microscopy (SEM) generate...

10.1101/2025.01.08.631879 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2025-01-13

L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild

OPENALEX - Publications

Soumyaratna Debnath Harish Katti Shashikant Verma Shanmuganathan Raman

10.1109/icassp49660.2025.10890158 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

BloomCoreset: Fast Coreset Sampling using Bloom Filters for Fine-Grained Self-Supervised Learning

OPENALEX - Publications

Prajwal Singh Gautam Vashishtha Indra Deep Mastan Shanmuganathan Raman

10.1109/icassp49660.2025.10888815 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

LIPIDS: Learning-based Illumination Planning In Discretized (Light) Space for Photometric Stereo

OPENALEX - Publications

Ashish Tiwari Mihirkumar Sutariya Shanmuganathan Raman

10.1109/wacv61041.2025.00073 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects

OPENALEX - Publications

Soumyaratna Debnath Ashish Tiwari Kaustubh Sadekar Shanmuganathan Raman

Recent advancements in learning-based methods have opened new avenues for exploring and interpreting art forms, such as shadow art, origami, sketch through computational models. One notable visual form is 3D Anamorphic Art which an ensemble of arbitrarily shaped objects creates a realistic meaningful expression when observed from particular viewpoint loses its coherence over the other viewpoints. In this work, we build on insights to perform object arrangement. We introduce RASP,...

10.48550/arxiv.2504.02465 preprint EN arXiv (Cornell University) 2025-04-03

Geometric approach to segmentation and protein localization in cell culture assays

OPENALEX - Publications

Shanmuganathan Raman Christopher A. Maxwell Mary Helen Barcellos‐Hoff Bahram Parvin

Summary Cell‐based fluorescence imaging assays are heterogeneous and require the collection of a large number images for detailed quantitative analysis. Complexities arise as result variation in spatial nonuniformity, shape, overlapping compartments scale (size). A new technique methodology has been developed tested delineating subcellular morphology partitioning at multiple scales. This system is packaged an integrated software platform quantifying that obtained through microscopy. Proposed...

10.1111/j.1365-2818.2007.01712.x article EN Journal of Microscopy 2007-01-01

Robust PCA-based solution to image composition using augmented Lagrange multiplier (ALM)

OPENALEX - Publications

Adit Bhardwaj Shanmuganathan Raman

10.1007/s00371-015-1075-1 article EN The Visual Computer 2015-03-16

Coming Soon ...