NFDI4DS | UHH-SEMS - Publication Details

Fartash Faghri

ORCID: 0000-0001-5975-5158

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5036601505

Research Areas

Domain Adaptation and Few-Shot Learning
Adversarial Robustness in Machine Learning
Advanced Neural Network Applications
Multimodal Machine Learning Applications
Advanced Image and Video Retrieval Techniques
Natural Language Processing Techniques
Topic Modeling
Stochastic Gradient Optimization Techniques
Digital Media Forensic Detection
Generative Adversarial Networks and Image Synthesis
Anomaly Detection Techniques and Applications
Medical Image Segmentation Techniques
Face and Expression Recognition
Digital Rights Management and Security
Bacillus and Francisella bacterial research
Machine Learning and Data Classification
Subtitles and Audiovisual Media
Physical Unclonable Functions (PUFs) and Hardware Security
Video Surveillance and Tracking Methods
Advanced Vision and Imaging
Reinforcement Learning in Robotics
Hand Gesture Recognition Systems
Handwritten Text Recognition Techniques
Semantic Web and Ontologies
Advanced Data Compression Techniques

Apple (United Kingdom)
2023-2024

University of Toronto
2015-2021

Sharif University of Technology
2012

VSE++: Improving Visual-Semantic Embeddings with Hard Negatives

OPENALEX - Publications

Fartash Faghri David J. Fleet Jamie Kiros Sanja Fidler

We present a new technique for learning visual-semantic embeddings cross-modal retrieval. Inspired by hard negative mining, the use of negatives in structured prediction, and ranking loss functions, we introduce simple change to common functions used multi-modal embeddings. That, combined with fine-tuning augmented data, yields significant gains retrieval performance. showcase our approach, VSE++, on MS-COCO Flickr30K datasets, using ablation studies comparisons existing methods. On approach...

10.48550/arxiv.1707.05612 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

OPENALEX - Publications

Nicolas Papernot Fartash Faghri Nicholas Carlini Ian Goodfellow Reuben Feinman and 21 more

CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and training. The may be used to develop more robust machine learning models provide benchmarks models' performance in the setting. Benchmarks constructed without implementation are not comparable each other, because good result indicate model or it merely weak procedure. This technical report structured as follows. Section 1 an overview examples software. 2...

10.48550/arxiv.1610.00768 preprint EN other-oa arXiv (Cornell University) 2016-01-01

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

OPENALEX - Publications

Haoxiang Wang Pavan Kumar Anasosalu Vasu Fartash Faghri Raviteja Vemulapalli Mehrdad Farajtabar and 4 more

10.1109/cvprw63382.2024.00367 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024-06-17

Adversarial Manipulation of Deep Representations

OPENALEX - Publications

Sara Sabour Yanshuai Cao Fartash Faghri David J. Fleet

We show that the representation of an image in a deep neural network (DNN) can be manipulated to mimic those other natural images, with only minor, imperceptible perturbations original image. Previous methods for generating adversarial images focused on designed produce erroneous class labels, while we concentrate internal layers DNN representations. In this way our new differs qualitatively from others. While adversary is perceptually similar one image, its appears remarkably different...

10.48550/arxiv.1511.05122 preprint EN cc-by-nc-sa arXiv (Cornell University) 2015-01-01

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

OPENALEX - Publications

Pavan Kumar Anasosalu Vasu Hadi Pouransari Fartash Faghri Raviteja Vemulapalli Oncel Tuzel

10.1109/cvpr52733.2024.01511 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Adaptive Gradient Quantization for Data-Parallel SGD

OPENALEX - Publications

Fartash Faghri Iman Tabrizian Ilia Markov Dan Alistarh Daniel M. Roy and 1 more

Many communication-efficient variants of SGD use gradient quantization schemes. These schemes are often heuristic and fixed over the course training. We empirically observe that statistics gradients deep models change during Motivated by this observation, we introduce two adaptive schemes, ALQ AMQ. In both processors update their compression in parallel efficiently computing sufficient a parametric distribution. improve validation accuracy almost 2% on CIFAR-10 1% ImageNet challenging...

10.48550/arxiv.2010.12460 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Adversarial Spheres

OPENALEX - Publications

Justin Gilmer Luke Metz Fartash Faghri Samuel S. Schoenholz Maithra Raghu and 2 more

State of the art computer vision models have been shown to be vulnerable small adversarial perturbations input. In other words, most images in data distribution are both correctly classified by model and very close a visually similar misclassified image. Despite substantial research interest, cause phenomenon is still poorly understood remains unsolved. We hypothesize that this counter intuitive behavior naturally occurring result high dimensional geometry manifold. As first step towards...

10.48550/arxiv.1801.02774 preprint EN other-oa arXiv (Cornell University) 2018-01-01

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

OPENALEX - Publications

Haoxiang Wang Pavan Kumar Anasosalu Vasu Fartash Faghri Raviteja Vemulapalli Mehrdad Farajtabar and 4 more

The landscape of publicly available vision foundation models (VFMs), such as CLIP and Segment Anything Model (SAM), is expanding rapidly. VFMs are endowed with distinct capabilities stemming from their pre-training objectives. For instance, excels in semantic understanding, while SAM specializes spatial understanding for segmentation. In this work, we introduce a simple recipe to efficiently merge into unified model that absorbs expertise. Our method integrates techniques multi-task...

10.48550/arxiv.2310.15308 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement

OPENALEX - Publications

Fartash Faghri Hadi Pouransari Sachin Mehta Mehrdad Farajtabar Ali Farhadi and 2 more

We propose Dataset Reinforcement, a strategy to improve dataset once such that the accuracy of any model architecture trained on reinforced is improved at no additional training cost for users. Reinforcement based data augmentation and knowledge distillation. Our generic designed extensive analysis across CNN- transformer-based models performing large-scale study distillation with state-of-the-art various augmentations. create version ImageNet dataset, called <sup...

10.1109/iccv51070.2023.01562 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

A Study of Gradient Variance in Deep Learning

OPENALEX - Publications

Fartash Faghri David Duvenaud David J. Fleet Jimmy Ba

The impact of gradient noise on training deep models is widely acknowledged but not well understood. In this context, we study the distribution gradients during training. We introduce a method, Gradient Clustering, to minimize variance average mini-batch with stratified sampling. prove that minimized if elements are sampled from weighted clustering in space. measure common learning benchmarks and observe that, contrary assumptions, increases training, smaller rates coincide higher variance....

10.48550/arxiv.2007.04532 preprint EN other-oa arXiv (Cornell University) 2020-01-01

DataComp-LM: In search of the next generation of training sets for language models

OPENALEX - Publications

Jeffrey Li Alex Chengyu Fang Georgios Smyrnis Maor Ivgi Matt Jordan and 54 more

We introduce DataComp for Language Models (DCLM), a testbed controlled dataset experiments with the goal of improving language models. As part DCLM, we provide standardized corpus 240T tokens extracted from Common Crawl, effective pretraining recipes based on OpenLM framework, and broad suite 53 downstream evaluations. Participants in DCLM benchmark can experiment data curation strategies such as deduplication, filtering, mixing at model scales ranging 412M to 7B parameters. baseline conduct...

10.48550/arxiv.2406.11794 preprint EN arXiv (Cornell University) 2024-06-17

CLIP with Quality Captions: A Strong Pretraining for Vision Tasks

OPENALEX - Publications

Pavan Kumar Anasosalu Vasu Hadi Pouransari Fartash Faghri Oncel Tuzel

CLIP models perform remarkably well on zero-shot classification and retrieval tasks. But recent studies have shown that learnt representations in are not suited for dense prediction tasks like object detection, semantic segmentation or depth estimation. More recently, multi-stage training methods was introduced to mitigate the weak performance of downstream In this work, we find simply improving quality captions image-text datasets improves CLIP's visual representations, resulting...

10.48550/arxiv.2405.08911 preprint EN arXiv (Cornell University) 2024-05-14

SOAR: Second-Order Adversarial Regularization

OPENALEX - Publications

Avery Ma Fartash Faghri Nicolas Papernot Amir‐massoud Farahmand

Adversarial training is a common approach to improving the robustness of deep neural networks against adversarial examples. In this work, we propose novel regularization as an alternative. To derive regularizer, formulate problem under robust optimization framework and approximate loss function using second-order Taylor series expansion. Our proposed regularizer (SOAR) upper bound based on approximation inner-max in objective. We empirically show that method significantly improves...

10.48550/arxiv.2004.01832 preprint EN other-oa arXiv (Cornell University) 2020-01-01

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

OPENALEX - Publications

Sachin Mehta Maxwell Horton Fartash Faghri Mohammad Hossein Sekhavat Mahyar Najibi and 3 more

Contrastive learning has emerged as a transformative method for effective visual representations through the alignment of image and text embeddings. However, pairwise similarity computation in contrastive loss between pairs poses computational challenges. This paper presents novel weakly supervised pre-training vision models on web-scale image-text data. The proposed reframes data classification task. Consequently, it eliminates need computations loss, achieving remarkable $2.7\times$...

10.48550/arxiv.2404.15653 preprint EN arXiv (Cornell University) 2024-04-24

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

OPENALEX - Publications

Jessica Maria Echterhoff Fartash Faghri Raviteja Vemulapalli Ting-Yao Hu Chun‐Liang Li and 2 more

Large Language Models (LLMs) are frequently updated due to data or architecture changes improve their performance. When updating models, developers often focus on increasing overall performance metrics with less emphasis being compatible previous model versions. However, users build a mental of the functionality and capabilities particular machine learning they interacting with. They have adapt every update -- draining task that can lead user dissatisfaction. In practice, fine-tuned...

10.48550/arxiv.2407.09435 preprint EN arXiv (Cornell University) 2024-07-12

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

OPENALEX - Publications

Mohammad Samragh Iman Mirzadeh Keivan Alizadeh Vahid Fartash Faghri Minsik Cho and 3 more

The pre-training phase of language models often begins with randomly initialized parameters. With the current trends in scaling models, training their large number parameters can be extremely slow and costly. In contrast, small are less expensive to train, but they cannot achieve accuracy models. this paper, we explore an intriguing idea connect these two different regimes: Can develop a method initialize using smaller pre-trained models? Will such initialization bring any benefits terms...

10.48550/arxiv.2409.12903 preprint EN arXiv (Cornell University) 2024-09-19

Coming Soon ...