Matthew R. Scott

ORCID: 0000-0003-1404-4437
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Dementia and Cognitive Impairment Research
  • Advanced Neural Network Applications
  • Alzheimer's disease research and treatments
  • Domain Adaptation and Few-Shot Learning
  • Advanced Image and Video Retrieval Techniques
  • Advanced Neuroimaging Techniques and Applications
  • Functional Brain Connectivity Studies
  • Generative Adversarial Networks and Image Synthesis
  • Medical Image Segmentation Techniques
  • Face recognition and analysis
  • Brain Tumor Detection and Classification
  • Human Pose and Action Recognition
  • Hand Gesture Recognition Systems
  • Machine Learning and Data Classification
  • Video Surveillance and Tracking Methods
  • Cannabis and Cannabinoid Research
  • Forensic Toxicology and Drug Analysis
  • Natural Language Processing Techniques
  • Multimodal Machine Learning Applications
  • Anomaly Detection Techniques and Applications
  • Video Analysis and Summarization
  • Gaze Tracking and Assistive Technology
  • Neurological and metabolic disorders
  • Image Retrieval and Classification Techniques
  • MRI in cancer diagnosis

Boston University
2005-2025

Massachusetts General Hospital
2017-2023

Harvard University
2019-2023

Framingham Heart Study
2023

Athinoula A. Martinos Center for Biomedical Imaging
2019-2022

Banner Alzheimer’s Institute
2022

University of Toronto
2022

Sunnybrook Research Institute
2022

Florey Institute of Neuroscience and Mental Health
2022

The University of Melbourne
2022

A family of loss functions built on pair-based computation have been proposed in the literature which provide a myriad solutions for deep metric learning. In this pa-per, we general weighting framework under-standing recent functions. Our contributions are three-fold: (1) establish General Pair Weighting (GPW) framework, casts sampling problem learning into unified view pair through gradient analysis, providing powerful tool understanding functions; (2) show that with GPW, various existing...

10.1109/cvpr.2019.00516 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

One-stage object detection is commonly implemented by optimizing two sub-tasks: classification and localization, using heads with parallel branches, which might lead to a certain level of spatial misalignment in predictions between the tasks. In this work, we propose Task-aligned Object Detection (TOOD) that explicitly aligns tasks learning-based manner. First, design novel Head (T-Head) offers better balance learning task-interactive task-specific features, as well greater flexibility learn...

10.1109/iccv48922.2021.00349 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Siamese-based trackers have achieved excellent performance on visual object tracking. However, the target template is not updated online, and features of search image are computed independently in a Siamese architecture. In this paper, we propose Deformable Attention Networks, referred to as SiamAttn, by introducing new attention mechanism that computes deformable self-attention cross-attention. The learns strong context information via spatial attention, selectively emphasizes...

10.1109/cvpr42600.2020.00676 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

<h3>Importance</h3> Mounting evidence suggests that sex differences exist in the pathologic trajectory of Alzheimer disease. Previous literature shows elevated levels cerebrospinal fluid tau women compared with men as a function apolipoprotein E (APOE) ε4 status and β-amyloid (Aβ). What remains unclear is association regional deposition clinically normal individuals. <h3>Objective</h3> To examine cross-sectional between Aβ measured positron emission tomography (PET). <h3>Design, Setting...

10.1001/jamaneurol.2018.4693 article EN JAMA Neurology 2019-02-04

Mining informative negative instances are of central importance to deep metric learning (DML). However, the hard-mining ability existing DML methods is intrinsically limited by mini-batch training, where only a accessible at each iteration. In this paper, we identify “slow drift” phenomena observing that embedding features drift exceptionally slow even as model parameters updating throughout training process. It suggests computed preceding iterations can considerably approximate their...

10.1109/cvpr42600.2020.00642 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

We present ClothFlow, an appearance-flow-based generative model to synthesize clothed person for posed-guided image generation and virtual try-on. By estimating a dense flow between source target clothing regions, ClothFlow effectively models the geometric changes naturally transfers appearance novel images as shown in Figure 1. achieve this with three-stage framework: 1) Conditioned on pose, we first estimate semantic layout provide richer guidance process. 2) Built two feature pyramid...

10.1109/iccv.2019.01057 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Recent progress has been made on developing a unified framework for joint text detection and recognition in natural images, but existing models were mostly built two-stage by involving ROI pooling, which can degrade the performance task. In this work, we propose convolutional character networks, referred as CharNet, is an one-stage model that process two tasks simultaneously one pass. CharNet directly outputs bounding boxes of words characters, with corresponding labels. We utilize basic...

10.1109/iccv.2019.00922 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Fine-grained image categorization is challenging due to the subtle inter-class differences. We posit that exploiting rich relationships between channels can help capture such differences since different correspond semantics. In this paper, we propose a channel interaction network (CIN), which models channel-wise interplay both within an and across images. For single image, self-channel (SCI) module proposed explore correlation image. This allows model learn complementary features from...

10.1609/aaai.v34i07.6712 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Objective The goal of this study was to examine sex differences in tau distribution across the brain older adults, using positron emission tomography (PET), and investigate how these might associate with cognitive trajectories. Methods Participants were 343 clinically normal individuals (women, 58%; 73.8 [8.5] years) 55 mild impairment (MCI; women, 38%; 76.9 [7.3] from Harvard Aging Brain Study Alzheimer's Disease Neuroimaging Initiative. We examined 18 F‐Flortaucipir (FTP)‐positron (PET)...

10.1002/ana.25878 article EN Annals of Neurology 2020-08-16

Visual compatibility is critical for fashion analysis, yet missing in existing image synthesis systems. In this paper, we propose to explicitly model visual through inpainting. We present Fashion Inpainting Networks (FiNet), a two-stage image-to-image generation framework that able perform compatible and diverse Disentangling the of shape appearance ensure photorealistic results, our consists network an network. More importantly, each network, introduce two encoders interacting with one...

10.1109/iccv.2019.00458 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Many Large-scale image databases such as ImageNet have significantly advanced classification and other visual recognition tasks. However much of these datasets are constructed only for single-label coarse object-level classification. For real-world applications, multiple labels fine-grained categories often needed, yet very few exist publicly, especially those large-scale high quality. In this work, we contribute to the community a new dataset called iMaterialist Fashion Attribute...

10.1109/iccvw.2019.00377 preprint EN 2019-10-01

Training an object detector on a data-rich domain and applying it to data-poor one with limited performance drop is highly attractive in industry, because saves huge annotation cost. Recent research unsupervised adaptive detection has verified that aligning data distributions between source target images through adversarial learning very useful. The key when, where how use achieve best practice. We propose Image-Instance Full Alignment Networks (iFAN) tackle this problem by precisely feature...

10.1609/aaai.v34i07.7015 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Abstract Animal and human imaging research reported that the presence of cortical Alzheimer’s Disease’s (AD) neuropathology, beta-amyloid neurofibrillary tau, is associated with altered neuronal activity circuitry failure, together facilitating clinical progression. The locus coeruleus (LC), one initial subcortical regions harboring pretangle hyperphosphorylated has widespread connections to cortex modulating cognition. Here we investigate whether LC’s in-vivo functional connectivity (FC)...

10.1038/s41467-022-28986-2 article EN cc-by Nature Communications 2022-03-23

Weakly-supervised instance segmentation aims to detect and segment object instances precisely, given image-level labels only. Unlike previous methods which are composed of multiple offline stages, we propose Sequential Label Propagation Enhancement Networks (referred as Label-PEnet) that progressively transforms pixel-wise in a coarse-to-fine manner. We design four cascaded modules including multi-label classification, detection, refinement segmentation, implemented sequentially by sharing...

10.1109/iccv.2019.00344 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Unawareness, or anosognosia, of memory deficits is a challenging manifestation Alzheimer's disease (AD) that adversely affects patient's safety and decision-making. However, there lack consensus regarding the presence, as well evolution, altered awareness function across preclinical prodromal stages AD. Here, we aimed to characterize change in abilities its relationship beta-amyloid (Aβ) burden large cohort (N = 1,070) individuals spectrum.

10.1002/ana.25649 article EN Annals of Neurology 2019-11-21

A family of loss functions built on pair-based computation have been proposed in the literature which provide a myriad solutions for deep metric learning. In this paper, we general weighting framework understanding recent functions. Our contributions are three-fold: (1) establish General Pair Weighting (GPW) framework, casts sampling problem learning into unified view pair through gradient analysis, providing powerful tool functions; (2) show that with GPW, various existing methods can be...

10.48550/arxiv.1904.06627 preprint EN other-oa arXiv (Cornell University) 2019-01-01

MR images (MRIs) accurate segmentation of brain lesions is important for improving cancer diagnosis, surgical planning, and prediction outcome. However, manual from 3D MRIs highly expensive, time-consuming, prone to user biases. We present an efficient yet conceptually simple network (referred as Brain SegNet), which a residual framework automatic voxel-wise lesion. Our model able directly predict dense voxel tumor or ischemic stroke regions in MRIs. The proposed can run at about 0.5s per -...

10.1186/s12880-020-0409-2 article EN cc-by BMC Medical Imaging 2020-02-11

Most existing 3D CNNs for video representation learning are clip-based methods, and thus do not consider video-level temporal evolution of spatio-temporal features. In this paper, we propose Video-level 4D Convolutional Neural Networks, referred as V4D, to model the long-range with convolutions, at same time, preserve strong residual connections. Specifically, design a new block able capture inter-clip interactions, which could enhance power original clip-level CNNs. The blocks can be easily...

10.48550/arxiv.2002.07442 preprint EN other-oa arXiv (Cornell University) 2020-01-01

The ability to synthesize multi-modality data is highly desirable for many computer-aided medical applications, e.g. clinical diagnosis and neuroscience research, since rich imaging cohorts offer diverse complementary information unraveling human tissues. However, collecting acquisitions can be limited by adversary factors such as patient discomfort, expensive cost scanner unavailability. In this paper, we propose a multi-task coherent modality transferable GAN (MCMT-GAN) address issue brain...

10.1109/tip.2020.3011557 article EN IEEE Transactions on Image Processing 2020-01-01
Coming Soon ...