Anton Milan

ORCID: 0000-0002-9719-3616
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Advanced Neural Network Applications
  • Domain Adaptation and Few-Shot Learning
  • Human Pose and Action Recognition
  • Modular Robots and Swarm Intelligence
  • Machine Learning and Data Classification
  • Robotics and Sensor-Based Localization
  • Target Tracking and Data Fusion in Sensor Networks
  • Impact of Light on Environment and Health
  • Infrared Target Detection Methodologies
  • Robot Manipulation and Learning
  • Remote-Sensing Image Classification
  • Robotic Path Planning Algorithms
  • Visual Attention and Saliency Detection
  • Advanced Optical Sensing Technologies
  • Advanced Chemical Sensor Technologies
  • Soft Robotics and Applications
  • Face and Expression Recognition
  • Fire Detection and Safety Systems
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Advanced Vision and Imaging
  • Machine Learning and Algorithms
  • Industrial Vision Systems and Defect Detection

Amazon (United States)
2021

Amazon (Germany)
2017-2020

The University of Adelaide
2015-2017

University of Bonn
2017

Technical University of Darmstadt
2013

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and also been the first choice for dense classification problems such as semantic segmentation. However, repeated subsampling operations like pooling or convolution striding CNNs lead to a significant decrease initial image resolution. Here, we present RefineNet, generic multi-path refinement network that explicitly exploits all information available along down-sampling process...

10.1109/cvpr.2017.549 article EN 2017-07-01

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, often provide most objective measure performance therefore important guides reseach. Recently, a new benchmark Multiple Object Tracking, MOTChallenge, was launched with goal collecting existing data creating framework standardized evaluation multiple object tracking methods. The first release focuses on people tracking, since pedestrians...

10.48550/arxiv.1603.00831 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Many recent advances in multiple target tracking aim at finding a (nearly) optimal set of trajectories within temporal window. To handle the large space possible trajectory hypotheses, it is typically reduced to finite by some form data-driven or regular discretization. In this work, we propose an alternative formulation multitarget as minimization continuous energy. Contrary approaches, focus on designing energy that corresponds more complete representation problem, rather than one amenable...

10.1109/tpami.2013.103 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2013-05-27

We present a novel approach to online multi-target tracking based on recurrent neural networks (RNNs). Tracking multiple objects in real-world scenes involves many challenges, including a) an a-priori unknown and time-varying number of targets, b) continuous state estimation all c) discrete combinatorial problem data association. Most previous methods involve complex models that require tedious tuning parameters. Here, we propose for the first time, end-to-end learning tracking. Existing...

10.1609/aaai.v31i1.11194 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

In the recent past, computer vision community has developed centralized benchmarks for performance evaluation of a variety tasks, including generic object and pedestrian detection, 3D reconstruction, optical flow, single-object short-term tracking, stereo estimation. Despite potential pitfalls such benchmarks, they have proved to be extremely helpful advance state art in respective area. Interestingly, there been rather limited work on standardization quantitative multiple target tracking....

10.48550/arxiv.1504.01942 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, often provide most objective measure performance therefore important guides research. The benchmark Multiple Object Tracking, MOTChallenge, was launched with goal to establish a standardized evaluation multiple object tracking methods. challenge focuses on people tracking, since pedestrians well studied in community, precise detection has...

10.48550/arxiv.2003.09003 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Existing systems for video-based pose estimation and tracking struggle to perform well on realistic videos with multiple people often fail output body-pose trajectories consistent over time. To address this shortcoming paper introduces PoseTrack which is a new large-scale benchmark human articulated tracking. Our encompasses three tasks focusing i) single-frame multi-person estimation, ii) in videos, iii) establish the benchmark, we collect, annotate release dataset that features labeled...

10.1109/cvpr.2018.00542 article EN 2018-06-01

In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding m-best solutions to an integer linear program. The key advantage of approach is that it makes JPDA computationally tractable applications with high target and/or clutter density, such as spot tracking fluorescence microscopy sequences pedestrian surveillance footage. We also show our algorithm embedded simple framework surprisingly...

10.1109/iccv.2015.349 article EN 2015-12-01

Abstract Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since advent deep learning. Although leaderboards should not be over-claimed, they often provide most objective measure and are therefore important guides for research. We present MOTChallenge , a benchmark single-camera Multiple Object Tracking (MOT) launched late 2014, to collect existing new data create framework standardized evaluation multiple object tracking methods....

10.1007/s11263-020-01393-0 article EN cc-by International Journal of Computer Vision 2020-12-23

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking an unknown number persons in unconstrained videos. Existing methods for images cannot be applied directly to problem, since it also requires solve person association over time addition each person. We therefore propose a novel method that jointly models single formulation. To end, represent body detections video by spatio-temporal graph integer linear program partition into sub-graphs...

10.1109/cvpr.2017.495 article EN 2017-07-01

The task of tracking multiple targets is often addressed with the so-called tracking-by-detection paradigm, where first step to obtain a set target hypotheses for each frame independently. Tracking can then be regarded as solving two separate, but tightly coupled problems. carry out data association, i.e., determine origin available observations. second problem reconstruct actual trajectories that describe spatio-temporal motion pattern individual target. former inherently discrete problem,...

10.1109/tpami.2015.2505309 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2015-12-03

Tracking-by-detection has proven to be the most successful strategy address task of tracking multiple targets in unconstrained scenarios [e.g. 40, 53, 55]. Traditionally, a set sparse detections, generated preprocessing step, serves as input high-level tracker whose goal is correctly associate these "dots" over time. An obvious short-coming this approach that information available image sequences simply ignored by thresholding weak detection responses and applying non-maximum suppression. We...

10.1109/cvpr.2015.7299178 article EN 2015-06-01

When tracking multiple targets in crowded scenarios, modeling mutual exclusion between distinct becomes important at two levels: (1) data association, each target observation should support most one trajectory and be assigned per frame, (2) estimation, trajectories remain spatially separated all times to avoid collisions. Yet, existing trackers often sidestep these constraints. We address this using a mixed discrete-continuous conditional random field (CRF) that explicitly models both types...

10.1109/cvpr.2013.472 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2013-06-01

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and also been the first choice for dense prediction problems such as semantic segmentation depth estimation. However, repeated subsampling operations like pooling or convolution striding CNNs lead to a significant decrease initial image resolution. Here, we present RefineNet, generic multi-path refinement network that explicitly exploits all information available along...

10.1109/tpami.2019.2893630 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2019-01-01

The Amazon Robotics Challenge enlisted sixteen teams to each design a pick-and-place robot for autonomous warehousing, addressing development in robotic vision and manipulation. This paper presents the of our custom-built, cost-effective, Cartesian system Cartman, which won first place competition finals by stowing 14 (out 16) picking all 9 items 27 minutes, scoring total 272 points. We highlight experience-centred methodology key aspects that contributed competitiveness. believe these are...

10.1109/icra.2018.8463191 preprint EN 2018-05-01

Autonomous robotic manipulation in clutter is challenging. A large variety of objects must be perceived complex scenes, where they are partially occluded and embedded among many distractors, often restricted spaces. To tackle these challenges, we developed a deep-learning approach that combines object detection semantic segmentation. The scenes captured with RGB-D cameras, for which depth fusion method. Employing pretrained features makes learning from small annotated datasets possible. We...

10.1177/0278364917713117 article EN The International Journal of Robotics Research 2017-06-20

People tracking in crowded real-world scenes is challenging due to frequent and long-term occlusions. Recent methods obtain the image evidence from object (people) detectors, but typically use off-the-shelf detectors treat them as black box components. In this paper we argue that for best performance one should explicitly train people on failure cases of overall tracker instead. To end, first propose a novel joint detector combines state-of-the-art single person with pairs people, which...

10.1109/iccv.2013.134 article EN 2013-12-01

Evaluating multi-target tracking based on ground truth data is a surprisingly challenging task. Erroneous or ambiguous annotations, numerous evaluation protocols, and the lack of standardized benchmarks make direct quantitative comparison different approaches rather difficult. The goal this paper to raise awareness common pitfalls related objective evaluation. We investigate influence software, training procedures using several publicly available resources, point out limitations current...

10.1109/cvprw.2013.111 article EN 2013-06-01

Part handling in warehouse automation is challenging if a large variety of items must be accommodated and are stored unordered piles. To foster research this domain, Amazon holds picking challenges. We present our system which achieved second third place the Picking Challenge 2016 tasks. The challenge required participants to pick list from shelf or stow into shelf. Using two deep-learning approaches for object detection semantic segmentation one item model registration method, localizes...

10.1109/icra.2017.7989348 article EN 2017-05-01

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, often provide most objective measure performance therefore important guides research. The benchmark Multiple Object Tracking, MOTChallenge, was launched with goal to establish a standardized evaluation multiple object tracking methods. challenge focuses on people tracking, since pedestrians well studied in community, precise detection has...

10.48550/arxiv.1906.04567 preprint EN other-oa arXiv (Cornell University) 2019-01-01
Coming Soon ...