Kumar Ayush

ORCID: 0000-0002-9680-2061
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Generative Adversarial Networks and Image Synthesis
  • Domain Adaptation and Few-Shot Learning
  • Advanced Image and Video Retrieval Techniques
  • Remote-Sensing Image Classification
  • COVID-19 epidemiological studies
  • Image Enhancement Techniques
  • Visual Attention and Saliency Detection
  • Visual perception and processing mechanisms
  • Image and Video Quality Assessment
  • Face Recognition and Perception
  • Natural Language Processing Techniques
  • Anomaly Detection Techniques and Applications
  • Recommender Systems and Techniques
  • Advanced Image Processing Techniques
  • Water resources management and optimization
  • Biomedical Text Mining and Ontologies
  • Perovskite Materials and Applications
  • Topic Modeling
  • Advanced Vision and Imaging
  • Solid-state spectroscopy and crystallography
  • Machine Learning in Materials Science
  • Advanced Neural Network Applications
  • Chalcogenide Semiconductor Thin Films
  • Land Use and Ecosystem Services
  • Video Surveillance and Tracking Methods

Bennett University
2024-2025

Government Medical College
2024

Indian Institute of Technology Madras
2023

Meril Life Sciences (India)
2023

Institute of Management Technology
2023

Lovely Professional University
2023

Stanford University
2019-2022

Indian Institute of Technology Bombay
2017-2021

Adobe Systems (United States)
2018-2019

Indian Space Research Organisation
2017

Hongjie Li Jasper Janssens Maxime De Waegeneer Sai Saroja Kolluru Kristofer Davie and 95 more Vincent Gardeux Wouter Saelens Fabrice David Maria Brbić Katina I. Spanier Jure Leskovec Colleen N. McLaughlin Qijing Xie Robert C. Jones Katja Brueckner Jiwon Shim Sudhir Gopal Tattikota Frank Schnorrer Katja Rust Todd Nystul Zita Carvalho-Santos Carlos Ribeiro Soumitra Pal Sharvani Mahadevaraju Teresa M. Przytycka Aaron M. Allen Stephen F. Goodwin Cameron W. Berry Margaret T. Fuller Helen White‐Cooper Erika Matunis Stephen DiNardo Anthony Galenza Lucy Erin O’Brien Julian A. T. Dow Heinrich Jasper Brian Oliver Norbert Perrimon Bart Deplancke Stephen R. Quake Liqun Luo Stein Aerts Devika Agarwal Yasir H. Ahmed-Braimah Michelle N Arbeitman Majd Ariss Jordan Augsburger Kumar Ayush Catherine C. Baker Torsten U. Banisch Katja Birker Rolf Bodmer Benjamin Bolival Susanna E. Brantley Julie A. Brill Nora C. Brown Norene A. Buehner Xiaoyu Cai Rita Cardoso-Figueiredo Fernando Casares Amy K. Chang Thomas R. Clandinin Sheela Crasta Claude Desplan Angela M. Detweiler Darshan B. Dhakan Erika Donà Stefanie Engert Swann Floc’hlay Nancy George Amanda J. González-Segarra Andrew K. Groves Samantha C. Gumbin Yanmeng Guo D. Harris Yael Heifetz Stephen L. Holtz Felix Horns Bruno Hudry Ruei‐Jiun Hung Yuh Nung Jan Jacob S Jaszczak Gregory S.X.E. Jefferis Jim Karkanias Timothy L. Karr Nadja Sandra Katheder James Kezos Anna Kim Seung K. Kim Lutz Kockel Νικόλαος Κωνσταντινίδης Thomas B. Kornberg Henry M. Krause Andrew Thomas Labott Meghan Laturney Ruth Lehmann Sarah G. Leinwand Jun Li Joshua Shing Shun Li Kai Li

For more than 100 years, the fruit fly

10.1126/science.abk2432 article EN Science 2022-03-03

Understanding and predicting the human visual attention mechanism is an active area of research in fields neuroscience computer vision. In this paper, we propose DeepFix, a fully convolutional neural network, which models bottom-up via saliency prediction. Unlike classical works, characterize map using various hand-crafted features, our model automatically learns features hierarchical fashion predicts end-to-end manner. DeepFix designed to capture semantics at multiple scales while taking...

10.1109/tip.2017.2710620 article EN IEEE Transactions on Image Processing 2017-06-01

Contrastive learning methods have significantly narrowed the gap between supervised and unsupervised on computer vision tasks. In this paper, we explore their application to geo-located datasets, e.g. remote sensing, where unlabeled data is often abundant but labeled scarce. We first show that due different characteristics, a non-trivial persists contrastive standard benchmarks. To close gap, propose novel training exploit spatio-temporal structure of sensing data. leverage spatially aligned...

10.1109/iccv48922.2021.01002 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Convolutional Neural Network(CNN) based semantic segmentation require extensive pixel level manual annotation which is daunting for large microscopic images. The paper aimed towards mitigating this labeling effort by leveraging the recent concept of generative adversarial network(GAN) wherein a generator maps latent noise space to realistic images while discriminator differentiates between samples drawn from database and generator. We extend multi task learning discriminator-classifier...

10.1109/cvprw.2017.110 article EN 2017-07-01

Accurate local-level poverty measurement is an essential task for governments and humanitarian organizations to track the progress towards improving livelihoods distribute scarce resources. Recent computer vision advances in using satellite imagery predict have shown increasing accuracy, but they do not generate features that are interpretable policymakers, inhibiting adoption by practitioners. Here we demonstrate computational framework accurately at a local level applying object detectors...

10.24963/ijcai.2020/608 article EN 2020-07-01

Image-based virtual try-on for fashion has gained considerable attention recently. The task requires trying on a clothing item target model image. An efficient framework this is composed of two stages: (1) warping (transforming) the cloth to align with pose and shape model, (2) texture transfer module seamlessly integrate warped onto Existing methods suffer from artifacts distortions in their output. In work, we present Sieve Net, robust image-based try-on. Firstly, introduce multi-stage...

10.1109/wacv45572.2020.9093458 article EN 2020-03-01

Understanding and predicting the human visual attentional mechanism is an active area of research in fields neuroscience computer vision. In this work, we propose DeepFix, a first-of-its-kind fully convolutional neural network for accurate saliency prediction. Unlike classical works which characterize map using various hand-crafted features, our model automatically learns features hierarchical fashion predicts end-to-end manner. DeepFix designed to capture semantics at multiple scales while...

10.48550/arxiv.1510.02927 preprint EN other-oa arXiv (Cornell University) 2015-01-01

The combination of high-resolution satellite imagery and machine learning have proven useful in many sustainability-related tasks, including poverty prediction, infrastructure measurement, forest monitoring. However, the accuracy afforded by comes at a cost, as such is extremely expensive to purchase scale. This creates substantial hurdle efficient scaling widespread adoption high-resolution-based approaches. To reduce acquisition costs while maintaining accuracy, we propose reinforcement...

10.1609/aaai.v35i1.16072 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Polymer nanocomposites (PNCs) offer a broad range of thermophysical properties that are linked to their compositions. However, it is challenging establish universal composition-property relationship in PNCs due wide-ranging composition and chemical space. Here, we address this problem develop new method model the composition-microstructure relation PNC through an intelligent machine-learning pipeline named nanoNET. The nanoNET nanoparticles (NPs) distribution predictor, built upon computer...

10.1039/d3sm00567d article EN Soft Matter 2023-01-01

Although hybrid halide perovskites $(\text{MAPb}{X}_{3},$ $\mathrm{MA}={\mathrm{CH}}_{3}{\mathrm{NH}}_{3} \mathrm{and} X=\mathrm{I}, \mathrm{Br}, \mathrm{Cl})$ have been ubiquitously explored from the photovoltaic perspective, there are still a few unanswered questions which require more fundamental understanding. One such unsettled issue is puzzling behavior of band gap. Unlike conventional semiconductors, $\text{MAPb}{X}_{3}$ $(X=\mathrm{I}, \mathrm{Br})$ found to show blueshift (increase)...

10.1103/physrevb.102.081201 article EN Physical review. B./Physical review. B 2020-08-03

Image-based virtual try-on for fashion has gained considerable attention recently. This task requires to fit an in-shop cloth image on a target model image. An efficient framework this is composed of two stages: (1) warping the align with body shape and pose model, (2) composition module seamlessly integrate warped onto Existing methods suffer from artifacts distortions in their output. In work, we propose use auxiliary learning power existing state-of-the-art network. We leverage prediction...

10.1109/iccvw.2019.00397 article EN 2019-10-01

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning target dataset. This strategy helps reduce dependence improves convergence rate generalization task. Although datasets is very useful new methods or models, its foremost disadvantage high training cost. To address this, we propose efficient filtering to select relevant subsets from Additionally, discover that lowering image resolutions in step...

10.1109/cvprw56347.2022.00469 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022-06-01

Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a challenging problem owing to wide range of deformations clothing items along with visual distortions their images. In this paper, we propose Grid Search Network (GSN) learning feature embeddings fashion retrieval. The proposed approach posits the training procedure as search problem, focused on locating matches reference query image grid containing both positive and...

10.1109/cvprw.2019.00045 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019-06-01

The photoluminescence (PL) decay of hybrid halide perovskite single crystals (MAPbX3, MA = CH3NH3+, Pb Pb2+, X Br–, and I–) is measured over 4 orders magnitude in intensity the time scales 100s nanoseconds to a few microseconds. This long PL non-exponential, suggesting presence distribution carrier relaxation times. Spectro-temporal studies show that emission peak red-shifts with increasing time. physics this problem closely related donor–acceptor pair recombination crystalline...

10.1021/acs.jpcc.7b11503 article EN The Journal of Physical Chemistry C 2017-12-05

Visual compatibility prediction refers to the task of determining if a set items go well together. Existing techniques for prioritize sensitivity type or context in item representations and evaluate using fill-in-the-blank (FITB) task. We scale FITB stresstest existing methods which highlights need framework that is sensitive multiple modalities relationships. In this work, we introduce unified learning jointly conditioned on type, context, style. The composed TC-GAE, graph-based network...

10.1109/wacv45572.2020.9093555 article EN 2020-03-01

This study explores the effects of over-the-top content by examining data from popular streaming services such as Netflix, Hotstar Disney Plus, and Amazon Prime in order to learn more about consumer preferences, industry trends, cross-cultural film exchange. To improve user experience, makes use techniques including textual reviews analysis machine learning methods (K-Means Clustering, Linear Regression, Support Vector Machine Regression). Issues with OTT platforms, like churn issue biased...

10.1109/idciot59759.2024.10468048 article EN 2024-01-04

Despite the proliferation of wearable health trackers and importance sleep exercise to health, deriving actionable personalized insights from data remains a challenge because doing so requires non-trivial open-ended analysis these data. The recent rise large language model (LLM) agents, which can use tools reason about interact with world, presents promising opportunity enable such at scale. Yet, application LLM agents in analyzing personal is still largely untapped. In this paper, we...

10.48550/arxiv.2406.06464 preprint EN arXiv (Cornell University) 2024-06-10

Wearable sensors have become ubiquitous thanks to a variety of health tracking features. The resulting continuous and longitudinal measurements from everyday life generate large volumes data; however, making sense these observations for scientific actionable insights is non-trivial. Inspired by the empirical success generative modeling, where neural networks learn powerful representations vast amounts text, image, video, or audio data, we investigate scaling properties sensor foundation...

10.48550/arxiv.2410.13638 preprint EN arXiv (Cornell University) 2024-10-17
Coming Soon ...