NFDI4DS | UHH-SEMS - Publication Details

Visual Search at Pinterest

OPENALEX - Publications

Yushi Jing David Liu Dmitry Kislyuk Andrew Zhai Jiajing Xu and 2 more

We demonstrate that, with the availability of distributed computation platforms such as Amazon Web Services and open-source tools, it is possible for a small engineering team to build, launch maintain cost-effective, large-scale visual search system. also demonstrate, through comprehensive set live experiments at Pinterest, that content recommendation powered by improves user engagement. By sharing our implementation details learnings from launching commercial engine scratch, we hope becomes...

10.1145/2783258.2788621 article EN 2015-08-07

Classification is a Strong Baseline for Deep Metric Learning

OPENALEX - Publications

Andrew Zhai Haoyu Wu

Deep metric learning aims to learn a function mapping image pixels embedding feature vectors that model the similarity between images. Two major applications of are content-based retrieval and face verification. For tasks, majority current state-of-the-art (SOTA) approaches triplet-based non-parametric training. verification however, recent SOTA have adopted classification-based parametric In this paper, we look into effectiveness classification based on datasets. We evaluate several...

10.48550/arxiv.1811.12649 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Toward Transformer-Based Object Detection

OPENALEX - Publications

Josh Beal Eric Kim Eric Tzeng Dong Huk Park Andrew Zhai and 1 more

Transformers have become the dominant model in natural language processing, owing to their ability pretrain on massive amounts of data, then transfer smaller, more specific tasks via fine-tuning. The Vision Transformer was first major attempt apply a pure transformer directly images as input, demonstrating that compared convolutional networks, transformer-based architectures can achieve competitive results benchmark classification tasks. However, computational complexity attention operator...

10.48550/arxiv.2012.09958 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Visual Discovery at Pinterest

OPENALEX - Publications

Andrew Zhai Dmitry Kislyuk Yushi Jing Michael Feng Eric Tzeng and 3 more

Over the past three years Pinterest has experimented with several visual search and recommendation systems, from enhancing existing products such as Related Pins (2014), to powering new Similar Looks (2015), Flashlight (2016), Lens (2017). This paper presents an overview of our discovery engine these services, shares rationales behind technical product decisions use object detection interactive user interfaces. We conclude that this significantly improves engagement in both tasks.

10.1145/3041021.3054201 article EN 2017-01-01

Structure and dynamics of a pentameric KCTD5/CUL3/Gβγ E3 ubiquitin ligase complex

OPENALEX - Publications

Duc Minh Nguyen D. Rath Dominic Devost Darlaine Pétrin Robert Rizk and 13 more

Heterotrimeric G proteins can be regulated by posttranslational modifications, including ubiquitylation. KCTD5, a pentameric substrate receptor protein consisting of an N-terminal BTB domain and C-terminal domain, engages CUL3 to form the central scaffold cullin-RING E3 ligase complex (CRL3 KCTD5 ) that ubiquitylates Gβγ reduces levels in cells. The cryo-EM structure 5:5:5 KCTD5/CUL3 NTD /Gβ 1 γ 2 assembly reveals highly dynamic with rotations over 60° between /CUL3 CTD /Gβγ moieties...

10.1073/pnas.2315018121 article EN cc-by-nc-nd Proceedings of the National Academy of Sciences 2024-04-16

MultiSage

OPENALEX - Publications

Carl Yang Aditya Pal Andrew Zhai Nikil Pancha Jiawei Han and 2 more

Graph convolutional networks (GCNs) are a powerful class of graph neural networks. Trained in semi-supervised end-to-end fashion, GCNs can learn to integrate node features and structures generate high-quality embeddings that be used for various downstream tasks like search recommendation. However, existing mostly work on homogeneous graphs consider single embedding each node, which do not sufficiently model the multi-facet nature complex interaction nodes real-world Here, we present...

10.1145/3394486.3403293 article EN 2020-08-20

ItemSage: Learning Product Embeddings for Shopping Recommendations at Pinterest

OPENALEX - Publications

Paul Baltescu Haoyu Chen Nikil Pancha Andrew Zhai Jure Leskovec and 1 more

Learned embeddings for products are an important building block web-scale e-commerce recommendation systems. At Pinterest, we build a single set of product called ItemSage to provide relevant recommendations in all shopping use cases including user, image and search based recommendations. This approach has led significant improvements engagement conversion metrics, while reducing both infrastructure maintenance cost. While most prior work focuses on from features coming modality, introduce...

10.1145/3534678.3539170 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

Learning a Unified Embedding for Visual Search at Pinterest

OPENALEX - Publications

Andrew Zhai Haoyu Wu Eric Tzeng Dong Huk Park Charles Rosenberg

At Pinterest, we utilize image embeddings throughout our search and recommendation systems to help users navigate through visual content by powering experiences like browsing of related searching for exact products shopping. In this work describe a multi-task deep metric learning system learn single unified embedding which can be used power multiple products. The solution present not only allows us train application objectives in neural network architecture, but takes advantage correlated...

10.1145/3292500.3330739 article EN 2019-07-25

User-Driven Geolocation of Untagged Desert Imagery Using Digital Elevation Models

OPENALEX - Publications

Eric Tzeng Andrew Zhai Matthew Clements Raphael J.L. Townshend Avideh Zakhor

We propose a system for user-aided visual localization of desert imagery without the use any metadata such as GPS readings, camera focal length, or field-of-view. The makes only publicly available digital elevation models (DEMs) to rapidly and accurately locate photographs in non-urban environments deserts. Our generates synthetic skyline views from DEM extracts stable concavity-based features these skylines form database. To localize queries, user manually traces on an input photograph. is...

10.1109/cvprw.2013.42 article EN 2013-06-01

Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations

OPENALEX - Publications

Josh Beal Haoyu Wu Dong Huk Park Andrew Zhai Dmitry Kislyuk

Large-scale pretraining of visual representations has led to state-of-the-art performance on a range benchmark computer vision tasks, yet the benefits these techniques at extreme scale in complex production systems been relatively unexplored. We consider case popular discovery product, where are trained with multi-task learning, from use-case specific understanding (e.g. skin tone classification) general representation learning for all content embeddings retrieval). In this work, we describe...

10.1109/wacv51458.2022.00150 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022-01-01

PinnerFormer

OPENALEX - Publications

Nikil Pancha Andrew Zhai Jure Leskovec Charles Rosenberg

Sequential models have become increasingly popular in powering personalized recommendation systems over the past several years. These approaches traditionally model a user's actions on website as sequence to predict next action. While theoretically simplistic, these are quite challenging deploy production, commonly requiring streaming infrastructure reflect latest user activity and potentially managing mutable data for encoding hidden state. Here we introduce PinnerFormer, representation...

10.1145/3534678.3539156 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

TransAct: Transformer-based Realtime User Action Model for Recommendation at Pinterest

OPENALEX - Publications

X.-G. Xia Pong Eksombatchai Nikil Pancha Dhruvil Badani Po-Wei Wang and 5 more

Sequential models that encode user activity for next action prediction have become a popular design choice building web-scale personalized recommendation systems. Traditional methods of sequential either utilize end-to-end learning on realtime actions, or learn representations separately in an offline batch-generated manner. This paper (1) presents Pinterest's ranking architecture Homefeed, our product and the largest engagement surface; (2) proposes TransAct, model extracts users'...

10.1145/3580305.3599918 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2023-08-04

Shop The Look

OPENALEX - Publications

Raymond Shiau Haoyu Wu Eric Kim Yue Li Du Anqi Guo and 5 more

As online content becomes ever more visual, the demand for searching by visual queries grows correspondingly stronger. Shop The Look is an shopping discovery service at Pinterest, leveraging search to enable users find and buy products within image. In this work, we provide a holistic view of how built Look, oriented system, along with lessons learned from addressing needs. We discuss topics including core technology across object detection embeddings, serving infrastructure realtime...

10.1145/3394486.3403372 preprint EN 2020-08-20

Structure and dynamics of a pentameric KCTD5/Cullin3/Gβγ E3 ubiquitin ligase complex

OPENALEX - Publications

Duc Minh Nguyen D. Rath Dominic Devost Darlaine Pétrin Robert Rizk and 13 more

Abstract Heterotrimeric G proteins can be regulated by post-translational modifications, including ubiquitylation. KCTD5, a pentameric substrate receptor protein consisting of an N-terminal BTB domain and C-terminal (CTD), engages CUL3 to form the central scaffold cullin- RING E3 ligase complex (CRL3 KCTD5 ) that ubiquitylates Gβγ reduces levels in cells. The cryo-EM structure 5:5:5 KCTD5/CUL3 NTD /Gβ 1 γ 2 assembly reveals highly dynamic with rotations over 60° between /CUL3 CTD /Gβγ...

10.1101/2023.09.20.558662 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-09-20

Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

OPENALEX - Publications

Jiajing Xu Andrew Zhai Charles Rosenberg

In this work, we present our journey to revolutionize the personalized recommendation engine through end-to-end learning from raw user actions. We encode user's long-term interest in PinnerFormer, a embedding optimized for future actions via new dense all-action loss, and capture short-term intention by directly real-time action sequences. conducted both offline online experiments validate performance of model architecture, also address challenge serving such complex using mixed CPU/GPU...

10.1145/3523227.3547394 preprint EN 2022-09-13

MultiBiSage

OPENALEX - Publications

Saket Gurukar Nikil Pancha Andrew Zhai Eric Kim Samson Hu and 3 more

Graph Convolutional Networks (GCN) can efficiently integrate graph structure and node features to learn high-quality embeddings. At Pinterest, we have developed deployed PinSage, a data-efficient GCN that learns pin embeddings from the Pin-Board graph. Pinterest relies heavily on PinSage which in turn only leverages However, there exist several entities at heterogeneous interactions among these entities. These diverse provide important signal for recommendations modeling. In this work, show...

10.14778/3574245.3574262 article EN Proceedings of the VLDB Endowment 2022-12-01

Bootstrapping Complete The Look at Pinterest

OPENALEX - Publications

Eileen Li Eric Kim Andrew Zhai Josh Beal Kunlong Gu

Putting together an ideal outfit is a process that involves creativity and style intuition. This makes it particularly difficult task to automate. Existing styling products generally involve human specialists highly curated set of fashion items. In this paper, we will describe how bootstrapped the Complete The Look (CTL) system at Pinterest. technology aims learn subjective "style compatibility" in order recommend complementary items complete outfit. particular, want show recommendations...

10.1145/3394486.3403382 preprint EN 2020-08-20

PinnerFormer: Sequence Modeling for User Representation at Pinterest

OPENALEX - Publications

Nikil Pancha Andrew Zhai Jure Leskovec Charles Rosenberg

Sequential models have become increasingly popular in powering personalized recommendation systems over the past several years. These approaches traditionally model a user's actions on website as sequence to predict next action. While theoretically simplistic, these are quite challenging deploy production, commonly requiring streaming infrastructure reflect latest user activity and potentially managing mutable data for encoding hidden state. Here we introduce PinnerFormer, representation...

10.48550/arxiv.2205.04507 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Large Area Cell Based Image Localization

OPENALEX - Publications

Andrew Zhai Matthew Clements Avideh Zakhor

We present a memory scalable image localization system that uses distributed kd-trees created on overlapping geographic cells using database of 10 million Google Street View images for an area approximately 10,000 square kilometers in Taiwan. Given collection over region interest (ROI), we generate by dynamically creating are optimized so each cell contains roughly the same number images. then create from SIFT features extracted cell. When querying system, run traditional feature matching...

10.1109/ism.2014.79 article EN 2014-12-01

Visual Discovery at Pinterest

OPENALEX - Publications

Andrew Zhai Dmitry Kislyuk Yushi Jing Michael Feng Eric Tzeng and 3 more

Over the past three years Pinterest has experimented with several visual search and recommendation services, including Related Pins (2014), Similar Looks (2015), Flashlight (2016) Lens (2017). This paper presents an overview of our discovery engine powering these shares rationales behind technical product decisions such as use object detection interactive user interfaces. We conclude that this significantly improves engagement in both tasks.

10.48550/arxiv.1702.04680 preprint EN cc-by arXiv (Cornell University) 2017-01-01

MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest

OPENALEX - Publications

Saket Gurukar Nikil Pancha Andrew Zhai Eric Kim Samson Hu and 3 more

Graph Convolutional Networks (GCN) can efficiently integrate graph structure and node features to learn high-quality embeddings. These embeddings then be used for several tasks such as recommendation search. At Pinterest, we have developed deployed PinSage, a data-efficient GCN that learns pin from the Pin-Board graph. The contains board entities captures belongs interaction. However, there exist at Pinterest users, idea pins, creators, heterogeneous interactions among these add-to-cart,...

10.48550/arxiv.2205.10666 preprint EN cc-by arXiv (Cornell University) 2022-01-01