NFDI4DS | UHH-SEMS - Publication Details

Ankan Bansal

ORCID: 0000-0001-5578-4277

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5023195442

Research Areas

Face recognition and analysis
Face and Expression Recognition
Biometric Identification and Security
Video Surveillance and Tracking Methods
Domain Adaptation and Few-Shot Learning
Multimodal Machine Learning Applications
Advanced Neural Network Applications
Advanced Image and Video Retrieval Techniques
Human Pose and Action Recognition
Generative Adversarial Networks and Image Synthesis
Anomaly Detection Techniques and Applications
Face Recognition and Perception
Topic Modeling
Advanced Vision and Imaging
Natural Language Processing Techniques
Advanced Text Analysis Techniques
Hand Gesture Recognition Systems
Gait Recognition and Analysis
Adversarial Robustness in Machine Learning
Image Retrieval and Classification Techniques
Image Enhancement Techniques
Visual Attention and Saliency Detection
Robotics and Sensor-Based Localization
Image Processing and 3D Reconstruction

University of Maryland, College Park
2016-2022

Amazon (United States)
2021

Park University
2020

Arizona State University
2018

Indian Institute of Technology Kanpur
2013

UMDFaces: An annotated face dataset for training deep networks

OPENALEX - Publications

Ankan Bansal Anirudh Nanduri Carlos D. Castillo Rajeev Ranjan Rama Chellappa

Recent progress in face detection (including keypoint detection), and recognition is mainly being driven by (i) deeper convolutional neural network architectures, (ii) larger datasets. However, most of the large datasets are maintained private companies not publicly available. The academic computer vision community needs more varied to make further progress. In this paper, we introduce a new dataset, called UMDFaces, which has 367,888 annotated faces 8,277 subjects. We also evaluation...

10.1109/btas.2017.8272731 article EN 2017-10-01

Deep Learning for Understanding Faces: Machines May Be Just as Good, or Better, than Humans

OPENALEX - Publications

Rajeev Ranjan Swami Sankaranarayanan Ankan Bansal Navaneeth Bodla Jun-Cheng Chen and 3 more

Recent developments in deep convolutional neural networks (DCNNs) have shown impressive performance improvements on various object detection/recognition problems. This has been made possible due to the availability of large annotated data and a better understanding nonlinear mapping between images class labels, as well affordability powerful graphics processing units (GPUs). These learning also improved capabilities machines faces automatically executing tasks face detection, pose...

10.1109/msp.2017.2764116 article EN IEEE Signal Processing Magazine 2018-01-01

A Fast and Accurate System for Face Detection, Identification, and Verification

OPENALEX - Publications

Rajeev Ranjan Ankan Bansal Jingxiao Zheng Hongyu Xu Joshua Gleason and 5 more

The availability of large annotated datasets and affordable computation power have led to impressive improvements in the performance convolutional neural networks (CNNs) on various face analysis tasks. In this paper, we describe a deep learning pipeline for unconstrained identification verification which achieves state-of-the-art several benchmark datasets. We provide design details modules involved automatic recognition: detection, landmark localization alignment,...

10.1109/tbiom.2019.2908436 article EN IEEE Transactions on Biometrics Behavior and Identity Science 2019-04-01

Detecting Human-Object Interactions via Functional Generalization

OPENALEX - Publications

Ankan Bansal Sai Saketh Rambhatla Abhinav Shrivastava Rama Chellappa

We present an approach for detecting human-object interactions (HOIs) in images, based on the idea that humans interact with functionally similar objects a manner. The proposed model is simple and efficiently uses data, visual features of human, relative spatial orientation human object, knowledge take part humans. provide extensive experimental validation our demonstrate state-of-the-art results HOI detection. On HICO-Det dataset method achieves gain over 2.5% absolute points mean average...

10.1609/aaai.v34i07.6616 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

UPSET and ANGRI : Breaking High Performance Image Classifiers

OPENALEX - Publications

Sayantan Sarkar Ankan Bansal Upal Mahbub Rama Chellappa

In this paper, targeted fooling of high performance image classifiers is achieved by developing two novel attack methods. The first method generates universal perturbations for target classes and the second specific perturbations. Extensive experiments are conducted on MNIST CIFAR10 datasets to provide insights about proposed algorithms show their effectiveness.

10.48550/arxiv.1707.01159 preprint EN other-oa arXiv (Cornell University) 2017-01-01

The Do’s and Don’ts for CNN-Based Face Verification

OPENALEX - Publications

Ankan Bansal Carlos D. Castillo Rajeev Ranjan Rama Chellappa

While the research community appears to have developed a consensus on methods of acquiring annotated data, design and training CNNs, many questions still remain be answered. In this paper, we explore following that are critical face recognition research: (i) Can train images expect systems work videos? (ii) Are deeper datasets better than wider datasets? (iii) Does adding label noise lead improvement in performance deep networks? (iv) Is alignment needed for recognition? We address these by...

10.1109/iccvw.2017.299 article EN 2017-10-01

Pose and Joint-Aware Action Recognition

OPENALEX - Publications

Anshul Shah Shlok Mishra Ankan Bansal Jun-Cheng Chen Rama Chellappa and 1 more

Recent progress on action recognition has mainly focused RGB and optical flow features. In this paper, we approach the problem of joint-based recognition. Unlike other modalities, constellation joints their motion generate models with succinct human information for activity We present a new model recognition, which first extracts features from each joint separately through shared encoder before performing collective reasoning. Our selector module re-weights to select most discriminative...

10.1109/wacv51458.2022.00022 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022-01-01

Deep Features for Recognizing Disguised Faces in the Wild

OPENALEX - Publications

Ankan Bansal Rajeev Ranjan Carlos D. Castillo Rama Chellappa

Unconstrained face verification is a challenging problem owing to variations in pose, illumination, resolution of image, age, etc. This becomes even more complex when the subjects are actively trying deceive systems by wearing disguise. The under consideration here identify subject disguises and reject impostors look like interest. In this paper we present DCNN-based approach for recognizing people picking out impostors. We train two different networks on large dataset comprising still...

10.1109/cvprw.2018.00009 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

Crystal Loss and Quality Pooling for Unconstrained Face Verification and Recognition

OPENALEX - Publications

Rajeev Ranjan Ankan Bansal Hongyu Xu Swami Sankaranarayanan Jun-Cheng Chen and 2 more

In recent years, the performance of face verification and recognition systems based on deep convolutional neural networks (DCNNs) has significantly improved. A typical pipeline for includes training a network subject classification with softmax loss, using penultimate layer output as feature descriptor, generating cosine similarity score given pair images or videos. The loss function does not optimize features to have higher positive pairs lower negative pairs, which leads gap. this paper,...

10.48550/arxiv.1804.01159 preprint EN other-oa arXiv (Cornell University) 2018-01-01

DocTr: Document Transformer for Structured Information Extraction in Documents

OPENALEX - Publications

Haofu Liao Aruni RoyChowdhury Weijian Li Ankan Bansal Yuting Zhang and 4 more

We present a new formulation for structured information extraction (SIE) from visually rich documents. address the limitations of existing IOB tagging and graph-based formulations, which are either overly reliant on correct ordering input text or struggle with decoding complex graph. Instead, motivated by anchor-based object detectors in computer vision, we represent an entity as anchor word bounding box, linking association between words. This is more robust to ordering, maintains compact...

10.1109/iccv51070.2023.01794 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

How are attributes expressed in face DCNNs?

OPENALEX - Publications

Prithviraj Dhar Ankan Bansal Carlos D. Castillo Joshua Gleason P. Jonathon Phillips and 1 more

As deep networks become increasingly accurate at recognizing faces, it is vital to understand how these process faces. While are solely trained recognize identities, they also contain face related information such as sex, age, and pose of the even when not learn attributes. We introduce expressivity a measure much feature vector informs us about an attribute, where can be from internal or final layers network. Expressivity computed by second neural network whose inputs features The output...

10.1109/fg47880.2020.00009 article EN 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021) 2020-11-01

People Counting in High Density Crowds from Still Images

OPENALEX - Publications

Ankan Bansal K. S. Venkatesh

We present a method of estimating the number people in high density crowds from still images. The estimates counts by fusing information multiple sources. Most existing work on crowd counting deals with very small (tens individuals) and use temporal videos. Our uses only images to estimate (hundreds thousands individuals). At this scale, we cannot rely one set features for count estimation. We, therefore, sources, viz. interest points (SIFT), Fourier analysis, wavelet decomposition, GLCM low...

10.48550/arxiv.1507.08445 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Object-Aware Cropping for Self-Supervised Learning

OPENALEX - Publications

Shlok Mishra Anshul Shah Ankan Bansal Abhyuday Jagannatha Abhishek Sharma and 2 more

A core component of the recent success self-supervised learning is cropping data augmentation, which selects sub-regions an image to be used as positive views in loss. The underlying assumption that randomly cropped and resized regions a given share information about objects interest, learned representation will capture. This mostly satisfied datasets such ImageNet where there large, centered object, highly likely present random crops full image. However, other OpenImages or COCO, are more...

10.48550/arxiv.2112.00319 preprint EN public-domain arXiv (Cornell University) 2021-01-01

DocTr: Document Transformer for Structured Information Extraction in Documents

OPENALEX - Publications

Haofu Liao Aruni RoyChowdhury Weijian Li Ankan Bansal Yuting Zhang and 4 more

We present a new formulation for structured information extraction (SIE) from visually rich documents. It aims to address the limitations of existing IOB tagging or graph-based formulations, which are either overly reliant on correct ordering input text struggle with decoding complex graph. Instead, motivated by anchor-based object detectors in vision, we represent an entity as anchor word and bounding box, linking association between words. This is more robust ordering, maintains compact...

10.48550/arxiv.2307.07929 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Proximity-Aware Hierarchical Clustering of unconstrained faces

OPENALEX - Publications

Wei-An Lin Jun-Cheng Chen Rajeev Ranjan Ankan Bansal Swami Sankaranarayanan and 2 more

10.1016/j.imavis.2018.06.007 article EN Image and Vision Computing 2018-07-04

Predicting Dynamical Evolution of Human Activities from a Single Image

OPENALEX - Publications

Suhas Lohit Ankan Bansal Nitesh Shroff Jaishanker K. Pillai Pavan Turaga and 1 more

A human pose often conveys not only the configuration of body parts, but also implicit predictive information about ensuing motion. This dynamic can benefit vision applications which lack explicit motion cues. The visual system easily perceive in still images. However, computational algorithms to infer and utilize it computer are limited. In this paper, we propose a probabilistic framework associated with pose. inference problem is posed as nonparametric density estimation on non-Euclidean...

10.1109/cvprw.2018.00079 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

Coming Soon ...