Boris N. Oreshkin

ORCID: 0000-0002-1869-2004
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Domain Adaptation and Few-Shot Learning
  • Multimodal Machine Learning Applications
  • Target Tracking and Data Fusion in Sensor Networks
  • Time Series Analysis and Forecasting
  • Distributed Control Multi-Agent Systems
  • Human Pose and Action Recognition
  • Stock Market Forecasting Methods
  • Distributed Sensor Networks and Detection Algorithms
  • Energy Load and Power Forecasting
  • Advanced Neural Network Applications
  • Forecasting Techniques and Applications
  • Human Motion and Animation
  • 3D Shape Modeling and Analysis
  • Algorithms and Data Compression
  • Neural Networks Stability and Synchronization
  • Opportunistic and Delay-Tolerant Networks
  • Hydrological Forecasting Using AI
  • Cooperative Communication and Network Coding
  • Machine Learning and Data Classification
  • Image and Signal Denoising Methods
  • Cancer-related molecular mechanisms research
  • Gaussian Processes and Bayesian Inference
  • Traffic Prediction and Management Techniques
  • Direction-of-Arrival Estimation Techniques
  • Advanced Data Compression Techniques

Michigan State University
2021

University of Waterloo
2020

McGill University
2007-2019

Université de Montréal
2015-2019

Computer Research Institute of Montréal
2016

Few-shot learning has become essential for producing models that generalize from few examples. In this work, we identify metric scaling and task conditioning are important to improve the performance of few-shot algorithms. Our analysis reveals simple completely changes nature algorithm parameter updates. Metric provides improvements up 14% in accuracy certain metrics on mini-Imagenet 5-way 5-shot classification task. We further propose a effective way learner sample set, resulting...

10.48550/arxiv.1805.10123 preprint EN other-oa arXiv (Cornell University) 2018-01-01

We focus on solving the univariate times series point forecasting problem using deep learning. propose a neural architecture based backward and forward residual links very stack of fully-connected layers. The has number desirable properties, being interpretable, applicable without modification to wide array target domains, fast train. test proposed several well-known datasets, including M3, M4 TOURISM competition datasets containing time from diverse domains. demonstrate state-of-the-art...

10.48550/arxiv.1905.10437 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Recent progress in neural forecasting accelerated improvements the performance of large-scale systems. Yet, long-horizon remains a very difficult task. Two common challenges afflicting task are volatility predictions and their computational complexity. We introduce NHITS, model which addresses both by incorporating novel hierarchical interpolation multi-rate data sampling techniques. These techniques enable proposed method to assemble its sequentially, emphasizing components with different...

10.1609/aaai.v37i6.25854 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Deep learning has thrived by training on large-scale datasets. However, in robotics applications sample efficiency is critical. We propose a novel adaptive masked proxies method that constructs the final segmentation layer weights from few labelled samples. It utilizes multiresolution average pooling base embeddings with label to act as positive proxy for new class, while fusing it previously learned class signatures. Our evaluated PASCAL-5 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/iccv.2019.00535 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose leverage cross-modal information enhance metric-based learning methods. Visual and semantic feature spaces different structures by definition. For certain concepts, visual features might be richer more discriminative than text ones. While for others, the inverse true. Moreover, when support from is limited in image classification, representations (learned...

10.48550/arxiv.1902.07104 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Forecasting of multivariate time-series is an important problem that has applications in traffic management, cellular network configuration, and quantitative finance. A special case the arises when there a graph available captures relationships between time-series. In this paper we propose novel learning architecture achieves performance competitive with or better than best existing algorithms, without requiring knowledge graph. The key element our proposed learnable fully connected hard...

10.1609/aaai.v35i10.17114 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

In this paper, we demonstrate, both theoretically and by numerical examples, that adding a local prediction component to the update rule can significantly improve convergence rate of distributed averaging algorithms. We focus on case where predictor is linear combination node's two previous values (i.e., memory taps), our computes usual weighted received from neighbouring nodes. derive optimal mixing parameter for combining with neighbors' values, carry out theoretical analysis improvement...

10.1109/tsp.2010.2043127 article EN IEEE Transactions on Signal Processing 2010-02-17

Can meta-learning discover generic ways of processing time series (TS) from a diverse dataset so as to greatly improve generalization on new TS coming different datasets? This work provides positive evidence this using broad framework which we show subsumes many existing algorithms. Our theoretical analysis suggests that residual connections act adaptation mechanism, generating subset task-specific parameters based given input, thus gradually expanding the expressive power architecture...

10.1609/aaai.v35i10.17115 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

This paper proposes an approach to accelerate local, linear iterative network algorithms asymptotically achieving distributed average consensus. We focus on the class of in which each node initializes its ldquostate valuerdquo local measurement and then at iteration algorithm, updates this state value by adding a weighted sum own neighbors' values. Provided weight matrix satisfies certain convergence conditions, values converge measurements, but is generally slow, impeding practical...

10.1109/tsp.2008.2010376 article EN IEEE Transactions on Signal Processing 2008-12-08

Significant progress has been made recently in developing few-shot object segmentation methods. Learning is shown to be successful settings, using pixel-level, scribbles and bounding box supervision. This paper takes another approach, i.e., only requiring image-level label for segmentation. We propose a novel multi-modal interaction module that utilizes co-attention mechanism both visual word embedding. Our model labels achieves 4.8% improvement over previously proposed It also outperforms...

10.24963/ijcai.2020/120 article EN 2020-07-01

We present a distributed particle filtering algorithm for target tracking in sensor networks. Several existing algorithms rely on the establishment and maintenance of spanning path or tree. This is challenging networks with dynamic topologies induced by mobile nodes changing wireless conditions; are vulnerable to link node failure. More recent employ consensus improve robustness but they adopt suboptimal fusion rules leading significant deterioration performance. In our algorithm, run local...

10.1109/icif.2010.5712070 article EN 2010-07-01

We examine the requirements and available methods software to provide (or imitate) uniform random numbers in parallel computing environments. In this context, for great majority of applications, independent streams are required, each being computed on a single processing element at time. Sometimes, thousands or even millions such needed. explain how they can be produced managed. devote particular attention multiple GPU devices.

10.1016/j.matcom.2016.05.005 article EN cc-by Mathematics and Computers in Simulation 2016-06-07

We propose, develop, and compare new stochastic models for the daily arrival rate in a call center. Following standard practice, day is divided into time periods of equal length (e.g., 15 or 30 minutes), assumed random but constant each period, arrivals are from Poisson process, conditional on rate. The period taken as deterministic base (or expected rate) multiplied by busyness factor having mean 1. Models which factors independent across periods, common applies to all have been studied...

10.1287/opre.2016.1484 article EN Operations Research 2016-03-22

This paper presents <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">greedy gossip with eavesdropping</i> (GGE), a novel randomized algorithm for distributed computation of the average consensus problem. In algorithms, nodes in network randomly communicate their neighbors and exchange information iteratively. The algorithms are simple decentralized, making them attractive wireless applications. general, robust to unreliable conditions time varying...

10.1109/tsp.2010.2046593 article EN IEEE Transactions on Signal Processing 2010-03-26

We show that the task of synthesizing human motion conditioned on a set key frames can be solved more accurately and effectively if deep learning based interpolator operates in delta mode using spherical linear as baseline. empirically demonstrate strength our approach publicly available datasets achieving state-of-the-art performance. further generalize these results by showing <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/tvcg.2023.3309107 article EN IEEE Transactions on Visualization and Computer Graphics 2023-08-28

This paper proposes a novel framework for delay-tolerant particle filtering that is computationally efficient and has limited memory requirements. Within this the informativeness of delayed (out-of-sequence) measurement (OOSM) estimated using lightweight procedure uninformative measurements are immediately discarded. The requires identification threshold separates informative from uninformative; selection task formulated as constrained optimization problem, where goal to minimize tracking...

10.1109/tsp.2011.2140110 article EN IEEE Transactions on Signal Processing 2011-04-08

Recent progress in neural forecasting accelerated improvements the performance of large-scale systems. Yet, long-horizon remains a very difficult task. Two common challenges afflicting task are volatility predictions and their computational complexity. We introduce N-HiTS, model which addresses both by incorporating novel hierarchical interpolation multi-rate data sampling techniques. These techniques enable proposed method to assemble its sequentially, emphasizing components with different...

10.48550/arxiv.2201.12886 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Deep learning has thrived by training on large-scale datasets. However, in robotics applications sample efficiency is critical. We propose a novel adaptive masked proxies method that constructs the final segmentation layer weights from few labelled samples. It utilizes multi-resolution average pooling base embeddings with label to act as positive proxy for new class, while fusing it previously learned class signatures. Our evaluated PASCAL-$5^i$ dataset and outperforms state-of-the-art...

10.48550/arxiv.1902.11123 preprint EN cc-by arXiv (Cornell University) 2019-01-01
Coming Soon ...