NFDI4DS | UHH-SEMS - Publication Details

Kumar Ayush

ORCID: 0000-0002-9680-2061

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100653219

Research Areas

Generative Adversarial Networks and Image Synthesis
Domain Adaptation and Few-Shot Learning
Advanced Image and Video Retrieval Techniques
Remote-Sensing Image Classification
COVID-19 epidemiological studies
Image Enhancement Techniques
Visual Attention and Saliency Detection
Visual perception and processing mechanisms
Image and Video Quality Assessment
Face Recognition and Perception
Natural Language Processing Techniques
Anomaly Detection Techniques and Applications
Recommender Systems and Techniques
Advanced Image Processing Techniques
Water resources management and optimization
Biomedical Text Mining and Ontologies
Perovskite Materials and Applications
Topic Modeling
Advanced Vision and Imaging
Solid-state spectroscopy and crystallography
Machine Learning in Materials Science
Advanced Neural Network Applications
Chalcogenide Semiconductor Thin Films
Land Use and Ecosystem Services
Video Surveillance and Tracking Methods

Bennett University
2024-2025

Government Medical College
2024

Indian Institute of Technology Madras
2023

Meril Life Sciences (India)
2023

Institute of Management Technology
2023

Lovely Professional University
2023

Stanford University
2019-2022

Indian Institute of Technology Bombay
2017-2021

Adobe Systems (United States)
2018-2019

Indian Space Research Organisation
2017

Fly Cell Atlas: A single-nucleus transcriptomic atlas of the adult fruit fly

OPENALEX - Publications

Hongjie Li Jasper Janssens Maxime De Waegeneer Sai Saroja Kolluru Kristofer Davie and 95 more

For more than 100 years, the fruit fly

10.1126/science.abk2432 article EN Science 2022-03-03

DeepFix: A Fully Convolutional Neural Network for Predicting Human Eye Fixations

OPENALEX - Publications

Srinivas S S Kruthiventi Kumar Ayush R. Venkatesh Babu

Understanding and predicting the human visual attention mechanism is an active area of research in fields neuroscience computer vision. In this paper, we propose DeepFix, a fully convolutional neural network, which models bottom-up via saliency prediction. Unlike classical works, characterize map using various hand-crafted features, our model automatically learns features hierarchical fashion predicts end-to-end manner. DeepFix designed to capture semantics at multiple scales while taking...

10.1109/tip.2017.2710620 article EN IEEE Transactions on Image Processing 2017-06-01

Geography-Aware Self-Supervised Learning

OPENALEX - Publications

Kumar Ayush Burak Uzkent Chenlin Meng Kumar Tanmay Marshall Burke and 2 more

Contrastive learning methods have significantly narrowed the gap between supervised and unsupervised on computer vision tasks. In this paper, we explore their application to geo-located datasets, e.g. remote sensing, where unlabeled data is often abundant but labeled scarce. We first show that due different characteristics, a non-trivial persists contrastive standard benchmarks. To close gap, propose novel training exploit spatio-temporal structure of sensing data. leverage spatially aligned...

10.1109/iccv48922.2021.01002 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Generative Adversarial Learning for Reducing Manual Annotation in Semantic Segmentation on Large Scale Miscroscopy Images: Automated Vessel Segmentation in Retinal Fundus Image as Test Case

OPENALEX - Publications

Avisek Lahiri Kumar Ayush Prabir Kumar Biswas Pabitra Mitra

Convolutional Neural Network(CNN) based semantic segmentation require extensive pixel level manual annotation which is daunting for large microscopic images. The paper aimed towards mitigating this labeling effort by leveraging the recent concept of generative adversarial network(GAN) wherein a generator maps latent noise space to realistic images while discriminator differentiates between samples drawn from database and generator. We extend multi task learning discriminator-classifier...

10.1109/cvprw.2017.110 article EN 2017-07-01

Generating Interpretable Poverty Maps using Object Detection in Satellite Images

OPENALEX - Publications

Kumar Ayush Burak Uzkent Marshall Burke David B. Lobell Stefano Ermon

Accurate local-level poverty measurement is an essential task for governments and humanitarian organizations to track the progress towards improving livelihoods distribute scarce resources. Recent computer vision advances in using satellite imagery predict have shown increasing accuracy, but they do not generate features that are interpretable policymakers, inhibiting adoption by practitioners. Here we demonstrate computational framework accurately at a local level applying object detectors...

10.24963/ijcai.2020/608 article EN 2020-07-01

SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On

OPENALEX - Publications

Surgan Jandial Ayush Chopra Kumar Ayush Mayur Hemani Abhijeet Kumar and 1 more

Image-based virtual try-on for fashion has gained considerable attention recently. The task requires trying on a clothing item target model image. An efficient framework this is composed of two stages: (1) warping (transforming) the cloth to align with pose and shape model, (2) texture transfer module seamlessly integrate warped onto Existing methods suffer from artifacts distortions in their output. In work, we present Sieve Net, robust image-based try-on. Firstly, introduce multi-stage...

10.1109/wacv45572.2020.9093458 article EN 2020-03-01

DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

OPENALEX - Publications

Srinivas S S Kruthiventi Kumar Ayush R. Venkatesh Babu

Understanding and predicting the human visual attentional mechanism is an active area of research in fields neuroscience computer vision. In this work, we propose DeepFix, a first-of-its-kind fully convolutional neural network for accurate saliency prediction. Unlike classical works which characterize map using various hand-crafted features, our model automatically learns features hierarchical fashion predicts end-to-end manner. DeepFix designed to capture semantics at multiple scales while...

10.48550/arxiv.1510.02927 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Efficient Poverty Mapping from High Resolution Remote Sensing Images

OPENALEX - Publications

Kumar Ayush Burak Uzkent Kumar Tanmay Marshall Burke David B. Lobell and 1 more

The combination of high-resolution satellite imagery and machine learning have proven useful in many sustainability-related tasks, including poverty prediction, infrastructure measurement, forest monitoring. However, the accuracy afforded by comes at a cost, as such is extremely expensive to purchase scale. This creates substantial hurdle efficient scaling widespread adoption high-resolution-based approaches. To reduce acquisition costs while maintaining accuracy, we propose reinforcement...

10.1609/aaai.v35i1.16072 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

nanoNET: machine learning platform for predicting nanoparticles distribution in a polymer matrix

OPENALEX - Publications

Kumar Ayush Abhishek Seth Tarak K. Patra

Polymer nanocomposites (PNCs) offer a broad range of thermophysical properties that are linked to their compositions. However, it is challenging establish universal composition-property relationship in PNCs due wide-ranging composition and chemical space. Here, we address this problem develop new method model the composition-microstructure relation PNC through an intelligent machine-learning pipeline named nanoNET. The nanoNET nanoparticles (NPs) distribution predictor, built upon computer...

10.1039/d3sm00567d article EN Soft Matter 2023-01-01

Microbial Fabrication of Biogenic Nanoparticles and their Applications in Water Remediation

OPENALEX - Publications

Gongutri Borah Yvon ukwishaka Anupma Thakur Kumar Ayush Kashyap Komal and 3 more

10.1016/j.teengi.2025.100006 article EN Total environment engineering. 2025-01-01

Beyond Poverty: Identifying the Overlooked Factors Influencing Education

OPENALEX - Publications

K. K. Jain Kumar Ayush

10.1109/cictn64563.2025.10932371 article EN 2025-02-06

Contrasting temperature dependence of the band gap in CH3NH3PbX3 (X=I,</…

OPENALEX - Publications

Rishabh Saxena Jiban Kangsabanik Kumar Ayush Aga Shahee Shivam Singh and 6 more

Although hybrid halide perovskites $(\text{MAPb}{X}_{3},$ $\mathrm{MA}={\mathrm{CH}}_{3}{\mathrm{NH}}_{3} \mathrm{and} X=\mathrm{I}, \mathrm{Br}, \mathrm{Cl})$ have been ubiquitously explored from the photovoltaic perspective, there are still a few unanswered questions which require more fundamental understanding. One such unsettled issue is puzzling behavior of band gap. Unlike conventional semiconductors, $\text{MAPb}{X}_{3}$ $(X=\mathrm{I}, \mathrm{Br})$ found to show blueshift (increase)...

10.1103/physrevb.102.081201 article EN Physical review. B./Physical review. B 2020-08-03

Powering Virtual Try-On via Auxiliary Human Segmentation Learning

OPENALEX - Publications

Kumar Ayush Surgan Jandial Ayush Chopra Balaji Krishnamurthy

Image-based virtual try-on for fashion has gained considerable attention recently. This task requires to fit an in-shop cloth image on a target model image. An efficient framework this is composed of two stages: (1) warping the align with body shape and pose model, (2) composition module seamlessly integrate warped onto Existing methods suffer from artifacts distortions in their output. In work, we propose use auxiliary learning power existing state-of-the-art network. We leverage prediction...

10.1109/iccvw.2019.00397 article EN 2019-10-01

Efficient Conditional Pre-training for Transfer Learning

OPENALEX - Publications

Shuvam Chakraborty Burak Uzkent Kumar Ayush Kumar Tanmay Evan Sheehan and 1 more

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning target dataset. This strategy helps reduce dependence improves convergence rate generalization task. Although datasets is very useful new methods or models, its foremost disadvantage high training cost. To address this, we propose efficient filtering to select relevant subsets from Additionally, discover that lowering image resolutions in step...

10.1109/cvprw56347.2022.00469 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022-06-01

Powering Robust Fashion Retrieval With Information Rich Feature Embeddings

OPENALEX - Publications

Ayush Chopra Abhishek Sinha Hiresh Gupta Mausoom Sarkar Kumar Ayush and 1 more

Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a challenging problem owing to wide range of deformations clothing items along with visual distortions their images. In this paper, we propose Grid Search Network (GSN) learning feature embeddings fashion retrieval. The proposed approach posits the training procedure as search problem, focused on locating matches reference query image grid containing both positive and...

10.1109/cvprw.2019.00045 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019-06-01

Photophysical Model for Non-Exponential Relaxation Dynamics in Hybrid Perovskite Semiconductors

OPENALEX - Publications

Rishabh Saxena Kumar Ayush Nakul Jain Naresh Kumar Kumawat K. L. Narasimhan and 1 more

The photoluminescence (PL) decay of hybrid halide perovskite single crystals (MAPbX3, MA = CH3NH3+, Pb Pb2+, X Br–, and I–) is measured over 4 orders magnitude in intensity the time scales 100s nanoseconds to a few microseconds. This long PL non-exponential, suggesting presence distribution carrier relaxation times. Spectro-temporal studies show that emission peak red-shifts with increasing time. physics this problem closely related donor–acceptor pair recombination crystalline...

10.1021/acs.jpcc.7b11503 article EN The Journal of Physical Chemistry C 2017-12-05

Towards a Unified Framework for Visual Compatibility Prediction

OPENALEX - Publications

Anirudh Singhal Ayush Chopra Kumar Ayush Utkarsh R. Patel Balaji Krishnamurthy

Visual compatibility prediction refers to the task of determining if a set items go well together. Existing techniques for prioritize sensitivity type or context in item representations and evaluate using fill-in-the-blank (FITB) task. We scale FITB stresstest existing methods which highlights need framework that is sensitive multiple modalities relationships. In this work, we introduce unified learning jointly conditioned on type, context, style. The composed TC-GAE, graph-based network...

10.1109/wacv45572.2020.9093555 article EN 2020-03-01

Decoding OTT: Uncovering Global Insights for Film Industry Advancement and Cultural Exchange

OPENALEX - Publications

Vidisha Arvind Abhay Kumar Kumar Ayush Yogesh Pal

This study explores the effects of over-the-top content by examining data from popular streaming services such as Netflix, Hotstar Disney Plus, and Amazon Prime in order to learn more about consumer preferences, industry trends, cross-cultural film exchange. To improve user experience, makes use techniques including textual reviews analysis machine learning methods (K-Means Clustering, Linear Regression, Support Vector Machine Regression). Issues with OTT platforms, like churn issue biased...

10.1109/idciot59759.2024.10468048 article EN 2024-01-04

Transforming Wearable Data into Health Insights using Large Language Model Agents

OPENALEX - Publications

Mike A. Merrill Akshay Paruchuri Naghmeh Rezaei G. Kovács Javier Perez and 15 more

Despite the proliferation of wearable health trackers and importance sleep exercise to health, deriving actionable personalized insights from data remains a challenge because doing so requires non-trivial open-ended analysis these data. The recent rise large language model (LLM) agents, which can use tools reason about interact with world, presents promising opportunity enable such at scale. Yet, application LLM agents in analyzing personal is still largely untapped. In this paper, we...

10.48550/arxiv.2406.06464 preprint EN arXiv (Cornell University) 2024-06-10

Scaling Wearable Foundation Models

OPENALEX - Publications

Girish Narayanswamy Xin Liu Kumar Ayush Yuzhe Yang Xuhai Xu and 13 more

Wearable sensors have become ubiquitous thanks to a variety of health tracking features. The resulting continuous and longitudinal measurements from everyday life generate large volumes data; however, making sense these observations for scientific actionable insights is non-trivial. Inspired by the empirical success generative modeling, where neural networks learn powerful representations vast amounts text, image, video, or audio data, we investigate scaling properties sensor foundation...

10.48550/arxiv.2410.13638 preprint EN arXiv (Cornell University) 2024-10-17

Coming Soon ...