NFDI4DS | UHH-SEMS - Publication Details

Ajad Chhatkuli

ORCID: 0000-0003-2051-2209

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5009256545

Research Areas

Robotics and Sensor-Based Localization
Advanced Vision and Imaging
Advanced Image and Video Retrieval Techniques
Optical measurement and interference techniques
3D Shape Modeling and Analysis
Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning
Computer Graphics and Visualization Techniques
Advanced Neural Network Applications
Sparse and Compressive Sensing Techniques
Image and Object Detection Techniques
Advanced Image Processing Techniques
Retinal and Macular Surgery
Image Retrieval and Classification Techniques
Image Processing Techniques and Applications
Indoor and Outdoor Localization Technologies
Generative Adversarial Networks and Image Synthesis
Augmented Reality Applications
Industrial Vision Systems and Defect Detection
Advanced Memory and Neural Computing
Retinal Imaging and Analysis
Topic Modeling
Video Analysis and Summarization
Visual Attention and Saliency Detection
Human Motion and Animation

ETH Zurich
2017-2024

Institute for Social and Environmental Research-Nepal
2024

Institut Pascal
2016-2017

Université Clermont Auvergne
2014-2016

Universidad Blas Pascal
2016

Centre National de la Recherche Scientifique
2014

HES-SO University of Applied Sciences and Arts Western Switzerland
2013

Université de Bourgogne
2013

Vision Transformers with Hierarchical Attention

OPENALEX - Publications

Yun Liu Yu-Huan Wu Guolei Sun Le Zhang Ajad Chhatkuli and 1 more

Abstract This paper tackles the high computational/space complexity associated with multi-head self-attention (MHSA) in vanilla vision transformers. To this end, we propose hierarchical MHSA (H-MHSA), a novel approach that computes sell-attention fashion. Specifically, first divide input image into patches as commonly done, and each patch is viewed token. Then, proposed H-MHSA learns token relationships within local patches, serving relationship modeling. small are merged larger ones, models...

10.1007/s11633-024-1393-8 article EN cc-by Deleted Journal 2024-04-19

Non-Rigid Shape-from-Motion for Isometric Surfaces using Infinitesimal Planarity

OPENALEX - Publications

Ajad Chhatkuli Daniel Pizarro Adrien Bartoli

This paper proposes a general framework to solve Non-Rigid Shape-from-Motion (NRSfM) with the perspective camera under isometric deformations. Contrary usual low-rank linear shape basis, isometry allows us recover complex deformations from sparse set of images. Existing methods suffer ambiguities and may be very expensive solve. We bring four main contributions. First, we formulate NRSfM as system first-order Partial Differential Equations (PDE) involving shape’s depth normal field an...

10.5244/c.28.11 article EN 2014-01-01

Automatic Tool Landmark Detection for Stereo Vision in Robot-Assisted Retinal Surgery

OPENALEX - Publications

Thomas Probst Kevis-Kokitsi Maninis Ajad Chhatkuli Mouloud Ourak Emmanuel Vander Poorten and 1 more

Computer vision and robotics are being increasingly applied in medical interventions. Especially interventions where extreme precision is required, they could make a difference. One such application robot-assisted retinal microsurgery. In recent works, conducted under stereo-microscope, with robot-controlled surgical tool. The complementarity of computer has, however, not yet been fully exploited. order to improve the robot control, we interested three-dimensional (3-D) reconstruction...

10.1109/lra.2017.2778020 article EN IEEE Robotics and Automation Letters 2017-11-27

A Stable Analytical Framework for Isometric Shape-from-Template by Surface Integration

OPENALEX - Publications

Ajad Chhatkuli Daniel Pizarro Adrien Bartoli Toby Collins

Shape-from-Template (SfT) reconstructs the shape of a deforming surface from single image, 3D template and deformation prior. For isometric deformations, this is well-posed problem. However, previous methods which require no initialization break down when perspective effects are small, happens object small or viewed larger distances. That is, they do not handle all projection geometries. We propose stable SfT that accurately reconstruct for follow existing approach using first-order...

10.1109/tpami.2016.2562622 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2016-05-04

Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation

OPENALEX - Publications

Rui Gong Yuhua Chen Danda Pani Paudel Yawei Li Ajad Chhatkuli and 3 more

Open compound domain adaptation (OCDA) is a setting, where target modeled as of multiple unknown homogeneous domains, which brings the advantage improved generalization to unseen domains. In this work, we propose principled meta-learning based approach OCDA for semantic segmentation, MOCDA, by modeling unlabeled continuously. Our consists four key steps. First, cluster into sub-target domains image styles, extracted in an unsupervised manner. Then, different are split independent branches,...

10.1109/cvpr46437.2021.00824 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Mapping, Localization and Path Planning for Image-Based Navigation Using Visual Features and Map

OPENALEX - Publications

Janine Thoma Danda Pani Paudel Ajad Chhatkuli Thomas Probst Luc Van Gool

Building on progress in feature representations for image retrieval, image-based localization has seen a surge of research interest. Image-based the advantage being inexpensive and efficient, often avoiding use 3D metric maps altogether. That said, need to maintain large amount reference images as an effective support scene, nonetheless calls them be organized map structure some kind. The problem arises part navigation process. We are, therefore, interested summarizing set landmarks, which...

10.1109/cvpr.2019.00756 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Stable Template-Based Isometric 3D Reconstruction in All Imaging Conditions by Linear Least-Squares

OPENALEX - Publications

Ajad Chhatkuli Daniel Pizarro Adrien Bartoli

It has been recently shown that reconstructing an isometric surface from a single 2D input image matched to 3D template was well-posed problem. This however does not tell us how reconstruction algorithms will behave in practical conditions, where the amount of perspective is generally small and projection thus behaves like weak-perspective or orthography. We here bring answers what theoretically recoverable such imaging explain why existing convex numerical solutions analytical may be...

10.1109/cvpr.2014.96 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2014-06-01

Inextensible Non-Rigid Structure-from-Motion by Second-Order Cone Programming

OPENALEX - Publications

Ajad Chhatkuli Daniel Pizarro Toby Collins Adrien Bartoli

We present a global and convex formulation for the template-less 3D reconstruction of deforming object with perspective camera. show first time how to construct Second-Order Cone Programming (SOCP) problem Non-Rigid Structure-from-Motion (NRSfM) using Maximum-Depth Heuristic (MDH). In this regard, we deviate strongly from general trend affine cameras factorization-based methods solve NRSfM, which do not perform well complex nonlinear deformations. MDH, points' depths are maximized so that...

10.1109/tpami.2017.2762669 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2017-10-13

Inextensible Non-Rigid Shape-from-Motion by Second-Order Cone Programming

OPENALEX - Publications

Ajad Chhatkuli Daniel Pizarro Toby Collins Adrien Bartoli

We present a global and convex formulation for template-less 3D reconstruction of deforming object with the perspective camera. show first time how to construct Second-Order Cone Programming (SOCP) problem Non-Rigid Shape-from-Motion (NRSfM) using Maximum-Depth Heuristic (MDH). In this regard, we deviate strongly from general trend affine cameras factorization-based methods solve NRSfM. MDH, points' depths are maximized so that distance between neighbouring points in camera space upper...

10.1109/cvpr.2016.190 article EN 2016-06-01

Unsupervised Learning of Consensus Maximization for 3D Vision Problems

OPENALEX - Publications

Thomas Probst Danda Pani Paudel Ajad Chhatkuli Luc Van Gool

Consensus maximization is a key strategy in 3D vision for robust geometric model estimation from measurements with outliers. Generic methods consensus maximization, such as Random Sampling and (RANSAC), have played tremendous role the success of vision, spite ubiquity However, replicating same generic behaviour deeply learned architecture, using supervised approaches, has proven to be difficult. In that context, unsupervised huge potential adapt any unseen data distribution, therefore are...

10.1109/cvpr.2019.00102 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Separating compound figures in journal articles to allow for subfigure classification

OPENALEX - Publications

Ajad Chhatkuli Antonio Foncubierta–Rodríguez Dimitrios Markonis Fabrice Mériaudeau Henning Müller

Journal images represent an important part of the knowledge stored in medical literature. Figure classification has received much attention as information image types can be used a variety contexts to focus search and filter out unwanted or "noise", for example non–clinical images. A major problem figure is fact that many figures biomedical literature are compound do often contain more than single type. Some journals separate into several parts but not, thus requiring currently manual...

10.1117/12.2007897 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2013-03-29

ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization

OPENALEX - Publications

Menelaos Kanakis Simon Maurer Matteo Spallanzani Ajad Chhatkuli Luc Van Gool

Efficient detection and description of geometric regions in images is a prerequisite visual systems for localization mapping. Such still rely on traditional handcrafted methods efficient generation lightweight descriptors, common limitation the more powerful neural network models that come with high compute specific hardware requirements. In this paper, we focus adaptations required by networks to enable their use computationally limited platforms such as robots, mobile, augmented reality...

10.1109/cvprw59228.2023.00651 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

Deformable Neural Radiance Fields using RGB and Event Cameras

OPENALEX - Publications

Qi Ma Danda Pani Paudel Ajad Chhatkuli Luc Van Gool

Modeling Neural Radiance Fields for fast-moving deformable objects from visual data alone is a challenging problem. A major issue arises due to the high deformation and low acquisition rates. To address this problem, we propose use event cameras that offer very fast of change in an asynchronous manner. In work, develop novel method model neural radiance fields using RGB cameras. The proposed uses stream events calibrated sparse frames. our setup, camera pose at individual –required integrate...

10.1109/iccv51070.2023.00332 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Efficient Conditional GAN Transfer with Knowledge Propagation across Classes

OPENALEX - Publications

Mohamad Shahbazi Zhiwu Huang Danda Pani Paudel Ajad Chhatkuli Luc Van Gool

Generative adversarial networks (GANs) have shown impressive results in both unconditional and conditional image generation. In recent literature, it is that pre-trained GANs, on a different dataset, can be transferred to improve the generation from small target data. The same, however, has not been well-studied case of GANs (cGANs), which provides new opportunities for knowledge transfer compared setup. particular, classes may borrow related old classes, or share among themselves training....

10.1109/cvpr46437.2021.01199 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Vision Transformers with Hierarchical Attention

OPENALEX - Publications

Yun Liu Yuhuan Wu Guolei Sun Le Zhang Ajad Chhatkuli and 1 more

This paper tackles the high computational/space complexity associated with Multi-Head Self-Attention (MHSA) in vanilla vision transformers. To this end, we propose Hierarchical MHSA (H-MHSA), a novel approach that computes self-attention hierarchical fashion. Specifically, first divide input image into patches as commonly done, and each patch is viewed token. Then, proposed H-MHSA learns token relationships within local patches, serving relationship modeling. small are merged larger ones,...

10.48550/arxiv.2106.03180 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Live image parsing in uterine laparoscopy

OPENALEX - Publications

Ajad Chhatkuli Adrien Bartoli Abed Malti Toby Collins

Augmented Reality (AR) can improve the information delivery to surgeons. In laparosurgery, primary goal of AR is provide multimodal overlaid in live laparoscopic videos. For gynecologic laparoscopy, 3D reconstruction uterus and its deformable registration preoperative data form major problems AR. Shape-from-Shading (SfS) inter-frame require an accurate identification region, occlusions due surgical tools, specularities, other tissues. We propose a cascaded patient-specific real-time...

10.1109/isbi.2014.6868106 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2014-04-01

Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision

OPENALEX - Publications

Thomas Probst Danda Pani Paudel Ajad Chhatkuli Luc Van Gool

In this paper, we formulate a generic non-minimal solver using the existing tools of Polynomials Optimization Problems (POP) from computational algebraic geometry. The proposed method exploits well known Shor's or Lasserre's relaxations, whose theoretical aspects are also discussed. Notably, further exploit POP formulation for consensus maximization problems in 3D vision. Our framework is simple and straightforward to implement, which supported by three diverse applications vision, namely...

10.1109/iccv.2019.01033 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Continuous Pose for Monocular Cameras in Neural Implicit Representation

OPENALEX - Publications

Qi Ma Danda Pani Paudel Ajad Chhatkuli Luc Van Gool

10.1109/cvpr52733.2024.00506 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Unsupervised Template Warp Consistency for Implicit Surface Correspondences

OPENALEX - Publications

Mengya Liu Ajad Chhatkuli Janis Postels Luc Van Gool Federico Tombari

Abstract Unsupervised template discovery via implicit representation in a category of shapes has recently shown strong performance. At the core, such methods deform input to common space which allows establishing correspondences as well shapes. In this work we investigate inherent assumption that neural field optimization naturally leads consistently warped shapes, thus providing both good shape reconstruction and correspondences. Contrary convenient assumption, practice observe is not case,...

10.1111/cgf.14745 article EN cc-by-nc Computer Graphics Forum 2023-05-01

Coming Soon ...