NFDI4DS | UHH-SEMS - Publication Details

Kevis-Kokitsi Maninis

ORCID: 0000-0003-3776-0049

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5038897190

Research Areas

Advanced Neural Network Applications
Advanced Image and Video Retrieval Techniques
Visual Attention and Saliency Detection
Advanced Vision and Imaging
Retinal Imaging and Analysis
3D Shape Modeling and Analysis
Human Pose and Action Recognition
Medical Image Segmentation Techniques
Domain Adaptation and Few-Shot Learning
Robotics and Sensor-Based Localization
3D Surveying and Cultural Heritage
Radiomics and Machine Learning in Medical Imaging
Retinal and Macular Surgery
Medical Imaging and Analysis
Glaucoma and retinal disorders
Automated Road and Building Extraction
COVID-19 diagnosis using AI
AI in cancer detection
Adversarial Robustness in Machine Learning
Intraocular Surgery and Lenses
Video Analysis and Summarization
Multimodal Machine Learning Applications
Computational Physics and Python Applications
Hepatocellular Carcinoma Treatment and Prognosis
Digital Imaging for Blood Diseases

Google (United States)
2023-2025

DeepMind (United Kingdom)
2025

Google (Switzerland)
2022

ETH Zurich
2016-2022

Board of the Swiss Federal Institutes of Technology
2019-2022

One-Shot Video Object Segmentation

OPENALEX - Publications

Sergi Caelles Kevis-Kokitsi Maninis Jordi Pont-Tuset Laura Leal-Taixé Daniel Cremers and 1 more

This paper tackles the task of semi-supervised video object segmentation, i.e., separation an from background in a video, given mask first frame. We present One-Shot Video Object Segmentation (OSVOS), based on fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned ImageNet, foreground and finally learning appearance single annotated test sequence (hence one-shot). Although all frames are processed independently, results...

10.1109/cvpr.2017.565 article EN 2017-07-01

The Liver Tumor Segmentation Benchmark (LiTS)

OPENALEX - Publications

Patrick Bilic Patrick Ferdinand Christ Hongwei Li Eugene Vorontsov Avi Ben-Cohen and 95 more

In this work, we report the set-up and results of Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with IEEE International Symposium on Biomedical Imaging (ISBI) 2017 Conferences Medical Image Computing Computer-Assisted Intervention (MICCAI) 2018. The image dataset is diverse contains primary secondary tumors varied sizes appearances various lesion-to-background levels (hyper-/hypo-dense), created collaboration seven hospitals research institutions. Seventy-five...

10.1016/j.media.2022.102680 article EN cc-by-nc-nd Medical Image Analysis 2022-11-17

Deep Extreme Cut: From Extreme Points to Object Segmentation

OPENALEX - Publications

Kevis-Kokitsi Maninis Sergi Caelles Jordi Pont-Tuset Luc Van Gool

This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise segmentation for images and videos. We do so by adding extra channel image a convolutional neural network (CNN), which contains Gaussian centered each points. The CNN learns transform this information into that matches those demonstrate usefulness approach guided (grabcut-style), interactive segmentation, video dense annotation. show we most results date, also...

10.1109/cvpr.2018.00071 article EN 2018-06-01

Video Object Segmentation without Temporal Information

OPENALEX - Publications

Kevis-Kokitsi Maninis Sergi Caelles Y. Chen Jordi Pont-Tuset Laura Leal-Taixé and 2 more

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency redundancy consecutive frames. When smoothness is suddenly broken, such as when an object occluded, or some frames are missing a sequence, result of these can deteriorate significantly. This paper explores orthogonal approach each frame independently, i.e., disregarding information. In particular, it tackles task semi-supervised segmentation: separation...

10.1109/tpami.2018.2838670 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-05-23

The Liver Tumor Segmentation Benchmark (LiTS)

OPENALEX - Publications

Patrick Bilic Patrick Ferdinand Christ Hongwei Li Eugene Vorontsov Avi Ben-Cohen and 95 more

10.48550/arxiv.1901.04056 preprint EN cc-by-nc-nd arXiv (Cornell University) 2019-01-01

Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks

OPENALEX - Publications

Kevis-Kokitsi Maninis Jordi Pont-Tuset Pablo Arbeláez Luc Van Gool

We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Neural Networks (CNNs). COB is computationally efficient, because it requires a single CNN forward pass for multi-scale contour detection uses novel sparse boundary representation hierarchical segmentation; gives significant leap in performance over the state-of-the-art, generalizes very well to unseen categories datasets....

10.1109/tpami.2017.2700300 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2017-05-02

Attentive Single-Tasking of Multiple Tasks

OPENALEX - Publications

Kevis-Kokitsi Maninis Ilija Radosavovic Iasonas Kokkinos

In this work we address task interference in universal networks by considering that a network is trained on multiple tasks, but performs one at time, an approach refer to as "single-tasking tasks". The thus modifies its behaviour through task-dependent feature adaptation, or attention. This gives the ability accentuate features are adapted task, while shunning irrelevant ones. We further reduce forcing gradients be statistically indistinguishable adversarial training, ensuring common...

10.1109/cvpr.2019.00195 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation

OPENALEX - Publications

Sergi Caelles Jordi Pont-Tuset Federico Perazzi Alberto Montes Kevis-Kokitsi Maninis and 1 more

We present the 2019 DAVIS Challenge on Video Object Segmentation, third edition of series, a public competition designed for task Segmentation (VOS). In addition to original semi-supervised track and interactive introduced in previous edition, new unsupervised multi-object will be featured this year. newly track, participants are asked provide non-overlapping object proposals each image, along with an identifier linking them between frames (i.e. video proposals), without any test-time human...

10.48550/arxiv.1905.00737 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Vid2CAD: CAD Model Alignment Using Multi-View Constraints From Videos

OPENALEX - Publications

Kevis-Kokitsi Maninis Stefan Popov Matthias Niesner Vittorio Ferrari

We address the task of aligning CAD models to a video sequence complex scene containing multiple objects. Our method can process arbitrary videos and fully automatically recover 9 DoF pose for each object appearing in it, thus them common 3D coordinate frame. The core idea our is integrate neural network predictions from individual frames with temporally global, multi-view constraint optimization formulation. This integration resolves scale depth ambiguities per-frame predictions, generally...

10.1109/tpami.2022.3146082 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-25

Probing the 3D Awareness of Visual Foundation Models

OPENALEX - Publications

Mohamed El Banani Amit Raj Kevis-Kokitsi Maninis Abhishek Kar Yuanzhen Li and 5 more

10.1109/cvpr52733.2024.02059 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Automatic Tool Landmark Detection for Stereo Vision in Robot-Assisted Retinal Surgery

OPENALEX - Publications

Thomas Probst Kevis-Kokitsi Maninis Ajad Chhatkuli Mouloud Ourak Emmanuel Vander Poorten and 1 more

Computer vision and robotics are being increasingly applied in medical interventions. Especially interventions where extreme precision is required, they could make a difference. One such application robot-assisted retinal microsurgery. In recent works, conducted under stereo-microscope, with robot-controlled surgical tool. The complementarity of computer has, however, not yet been fully exploited. order to improve the robot control, we interested three-dimensional (3-D) reconstruction...

10.1109/lra.2017.2778020 article EN IEEE Robotics and Automation Letters 2017-11-27

EgoCast: Forecasting Egocentric Human Pose in the Wild

OPENALEX - Publications

María Escobar Juanita Puentes Cristhian Forigua Jordi Pont-Tuset Kevis-Kokitsi Maninis and 1 more

10.1109/wacv61041.2025.00569 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Detection-aided liver lesion segmentation using deep learning

OPENALEX - Publications

Míriam Bellver Kevis-Kokitsi Maninis Jordi Pont-Tuset Xavier Giró-i-Nieto Jordi Torres and 1 more

A fully automatic technique for segmenting the liver and localizing its unhealthy tissues is a convenient tool in order to diagnose hepatic diseases assess response according treatments. In this work we propose method segment lesions from Computed Tomography (CT) scans using Convolutional Neural Networks (CNNs), that have proven good results variety of computer vision tasks, including medical imaging. The network segments consists cascaded architecture, which first focuses on region it....

10.48550/arxiv.1711.11069 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Iterative Deep Learning for Road Topology Extraction

OPENALEX - Publications

Carles Ventura Jordi Pont-Tuset Sergi Caelles Kevis-Kokitsi Maninis Luc Van Gool

This paper tackles the task of estimating topology road networks from aerial images. Building on top a global model that performs dense semantical classification pixels image, we design Convolutional Neural Network (CNN) predicts local connectivity among central pixel an input patch and its border points. By iterating this sweep whole image infer network, inspired by human delineating complex network with tip their finger. We perform extensive comprehensive qualitative quantitative...

10.48550/arxiv.1808.09814 preprint EN other-oa arXiv (Cornell University) 2018-01-01

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

OPENALEX - Publications

Varun Jampani Kevis-Kokitsi Maninis Andreas Engelhardt Arjun Karpur Khanh-Nguyen Truong and 11 more

Recent advances in neural reconstruction enable high-quality 3D object from casually captured image collections. Current techniques mostly analyze their progress on relatively simple collections where Structure-from-Motion (SfM) can provide ground-truth (GT) camera poses. We note that SfM tend to fail in-the-wild such as search results with varying backgrounds and illuminations. To systematic research casual captures, we propose NAVI: a new dataset of category-agnostic objects scans along...

10.48550/arxiv.2306.09109 preprint EN other-oa arXiv (Cornell University) 2023-01-01

CAD-Estate: Large-scale CAD Model Annotation in RGB Videos

OPENALEX - Publications

Kevis-Kokitsi Maninis Stefan Popov Matthias Nießner Vittorio Ferrari

We propose a method for annotating videos of complex multi-object scenes with globally-consistent 3D representation the objects. annotate each object CAD model from database, and place it in coordinate frame scene 9-DoF pose transformation. Our is semi-automatic works on commonly-available RGB videos, without requiring depth sensor. Many steps are performed automatically, tasks by humans simple, well-specified, require only limited reasoning 3D. This makes them feasible crowd-sourcing has...

10.1109/iccv51070.2023.01847 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Coming Soon ...