NFDI4DS | UHH-SEMS - Publication Details

Sami Abu-El-Haija

ORCID: 0009-0001-8617-3193

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5069379894

Research Areas

Advanced Graph Neural Networks
Complex Network Analysis Techniques
Topic Modeling
Domain Adaptation and Few-Shot Learning
Graph Theory and Algorithms
Generative Adversarial Networks and Image Synthesis
Advanced Neural Network Applications
Anomaly Detection Techniques and Applications
Human Pose and Action Recognition
Video Analysis and Summarization
Functional Brain Connectivity Studies
Multimodal Machine Learning Applications
Spam and Phishing Detection
Software Engineering Research
Epigenetics and DNA Methylation
Natural Language Processing Techniques
Artificial Intelligence in Healthcare
Advanced Image Processing Techniques
Recommender Systems and Techniques
Bioinformatics and Genomic Networks
Data Mining Algorithms and Applications
Advanced Image and Video Retrieval Techniques
Misinformation and Its Impacts
Machine Learning and Data Classification
Brain Tumor Detection and Classification

Google (United States)
2016-2023

University of Southern California
2019-2021

Southern California University for Professional Studies
2020-2021

Integrated Systems Incorporation (United States)
2021

Stanford University
2013-2020

YouTube-8M: A Large-Scale Video Classification Benchmark

OPENALEX - Publications

Sami Abu-El-Haija Nisarg Kothari Joonseok Lee Apostol Natsev George Toderici and 2 more

Many recent advancements in Computer Vision are attributed to large datasets. Open-source software packages for Machine Learning and inexpensive commodity hardware have reduced the barrier of entry exploring novel approaches at scale. It is possible train models over millions examples within a few days. Although large-scale datasets exist image understanding, such as ImageNet, there no comparable size video classification In this paper, we introduce YouTube-8M, largest multi-label dataset,...

10.48550/arxiv.1609.08675 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Detecting Events and Key Actors in Multi-person Videos

OPENALEX - Publications

Vignesh Ramanathan Jonathan Huang Sami Abu-El-Haija Alexander N. Gorban Kevin Murphy and 1 more

Multi-person event recognition is a challenging task, often with many people active in the scene but only small subset contributing to an actual event. In this paper, we propose model which learns detect events such videos while automatically "attending" responsible for Our does not use explicit annotations regarding who or where those are during training and testing. particular, track recurrent neural network (RNN) represent features. We learn time-varying attention weights combine these...

10.1109/cvpr.2016.332 article EN 2016-06-01

MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing

OPENALEX - Publications

Sami Abu-El-Haija Bryan Perozzi Amol Kapoor Nazanin Alipourfard Kristina Lerman and 3 more

Existing popular methods for semi-supervised learning with Graph Neural Networks (such as the Convolutional Network) provably cannot learn a general class of neighborhood mixing relationships. To address this weakness, we propose new model, MixHop, that can these relationships, including difference operators, by repeatedly feature representations neighbors at various distances. Mixhop requires no additional memory or computational complexity, and outperforms on challenging baselines. In...

10.48550/arxiv.1905.00067 preprint EN cc-by arXiv (Cornell University) 2019-01-01

Machine Learning on Graphs: A Model and Comprehensive Taxonomy

OPENALEX - Publications

Ines Chami Sami Abu-El-Haija Bryan Perozzi Christopher Ré Kevin Murphy

There has been a surge of recent interest in learning representations for graph-structured data. Graph representation methods have generally fallen into three main categories, based on the availability labeled The first, network embedding (such as shallow graph or auto-encoders), focuses unsupervised relational structure. second, regularized neural networks, leverages graphs to augment losses with regularization objective semi-supervised learning. third, aims learn differentiable functions...

10.48550/arxiv.2005.03675 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Watch Your Step: Learning Node Embeddings via Graph Attention

OPENALEX - Publications

Sami Abu-El-Haija Bryan Perozzi Rami Al‐Rfou Alex Alemi

Graph embedding methods represent nodes in a continuous vector space, preserving information from the graph (e.g. by sampling random walks). There are many hyper-parameters to these (such as walk length) which have be manually tuned for every graph. In this paper, we replace with trainable parameters that automatically learn via backpropagation. particular, novel attention model on power series of transition matrix, guides optimize an upstream objective. Unlike previous approaches models,...

10.48550/arxiv.1710.09599 preprint EN cc-by arXiv (Cornell University) 2017-01-01

Identifying and Analyzing Cryptocurrency Manipulations in Social Media

OPENALEX - Publications

Mehrnoosh Mirtaheri Sami Abu-El-Haija Fred Morstatter Greg Ver Steeg Aram Galstyan

Interest surrounding cryptocurrencies, digital or virtual currencies that are used as a medium for financial transactions, has grown tremendously in the recent years. The anonymity these makes investors particularly susceptible to fraudity-such “pump and dump” scams-where goal is artificially inflate perceived worth of currency, luring victims into investing before fraudsters can sell their holdings. Because speed relative offered by social platforms such Twitter Telegram, media become...

10.1109/tcss.2021.3059286 article EN IEEE Transactions on Computational Social Systems 2021-03-05

Learning Edge Representations via Low-Rank Asymmetric Projections

OPENALEX - Publications

Sami Abu-El-Haija Bryan Perozzi Rami Al‐Rfou

We propose a new method for embedding graphs while preserving directed edge information. Learning such continuous-space vector representations (or embeddings) of nodes in graph is an important first step using network information (from social networks, user-item graphs, knowledge bases, etc.) many machine learning tasks. Unlike previous work, we (1) explicitly model as function node embeddings, and (2) novel objective, the likelihood, which contrasts from sampled random walks with...

10.1145/3132847.3132959 preprint EN 2017-11-06

Collaborative Deep Metric Learning for Video Understanding

OPENALEX - Publications

Joonseok Lee Sami Abu-El-Haija Balakrishnan Varadarajan Apostol Natsev

The goal of video understanding is to develop algorithms that enable machines understand videos at the level human experts. Researchers have tackled various domains including classification, search, personalized recommendation, and more. However, there a research gap in combining these one unified learning framework. Towards that, we propose deep network embeds using their audio-visual content, onto metric space which preserves video-to-video relationships. Then, use trained embedding tackle...

10.1145/3219819.3219856 article EN 2018-07-19

End-to-End Learning of Compressible Features

OPENALEX - Publications

Saurabh Singh Sami Abu-El-Haija Nick Johnston Johannes Ballé Abhinav Shrivastava and 1 more

Pre-trained convolutional neural networks (CNNs) are powerful off-the-shelf feature generators and have been shown to perform very well on a variety of tasks. Unfortunately, the generated features high dimensional expensive store: potentially hundreds thousands floats per example when processing videos. Traditional entropy based lossless compression methods little help as they do not yield desired level compression, while general purpose lossy energy compaction (e.g. PCA followed by...

10.1109/icip40778.2020.9190860 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2020-09-30

Large-Scale Content-Only Video Recommendation

OPENALEX - Publications

Joonseok Lee Sami Abu-El-Haija

Traditional recommendation systems using collaborative filtering (CF) approaches work relatively well when the candidate videos are sufficiently popular With increase of user-created videos, however, recommending fresh gets more and important, but pure CF-based may not perform in such cold-start situation. In this paper, we model as a video content-based similarity learning problem, learn deep embeddings trained to predict relationships identified by co-watch-based system only visual audial...

10.1109/iccvw.2017.121 article EN 2017-10-01

N-GCN: Multi-scale Graph Convolution for Semi-supervised Node Classification

OPENALEX - Publications

Sami Abu-El-Haija Amol Kapoor Bryan Perozzi Joonseok Lee

Graph Convolutional Networks (GCNs) have shown significant improvements in semi-supervised learning on graph-structured data. Concurrently, unsupervised of graph embeddings has benefited from the information contained random walks. In this paper, we propose a model: Network GCNs (N-GCN), which marries these two lines work. At its core, N-GCN trains multiple instances over node pairs discovered at different distances walks, and learns combination instance outputs optimizes classification...

10.48550/arxiv.1802.08888 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Identifying and Analyzing Cryptocurrency Manipulations in Social Media

OPENALEX - Publications

Mehrnoosh Mirtaheri Sami Abu-El-Haija Fred Morstatter Greg Ver Steeg Aram Galstyan

Interest surrounding cryptocurrencies, digital or virtual currencies that are used as a medium for financial transactions, has grown tremendously in recent years. The anonymity these makes investors particularly susceptible to fraud---such ``pump and dump'' scams---where the goal is artificially inflate perceived worth of currency, luring victims into investing before fraudsters can sell their holdings. Because speed relative offered by social platforms such Twitter Telegram, media become...

10.31219/osf.io/dqz89 preprint EN 2019-02-03

TF-GNN: Graph Neural Networks in TensorFlow

OPENALEX - Publications

Oleksandr Ferludin Arno Eigenwillig Martin Blais Dustin Zelle Jan Pfeifer and 19 more

TensorFlow-GNN (TF-GNN) is a scalable library for Graph Neural Networks in TensorFlow. It designed from the bottom up to support kinds of rich heterogeneous graph data that occurs today's information ecosystems. In addition enabling machine learning researchers and advanced developers, TF-GNN offers low-code solutions empower broader developer community learning. Many production models at Google use TF-GNN, it has been recently released as an open source project. this paper we describe...

10.48550/arxiv.2207.03522 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Detecting events and key actors in multi-person videos

OPENALEX - Publications

Vignesh Ramanathan Jonathan Huang Sami Abu-El-Haija Alexander N. Gorban Kevin Murphy and 1 more

10.48550/arxiv.1511.02917 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Toward collaboration sensing

OPENALEX - Publications

Bertrand Schneider Sami Abu-El-Haija Jim Reesman Roy Pea

In this paper we describe preliminary applications of network analysis techniques to eye-tracking data. a previous study, the first author conducted collaborative learning experiment in which subjects had access (or not) gaze-awareness tool: their task was learn from neuroscience diagrams remote collaboration. treatment group, they could see gaze partner displayed on screen real-time. control not. Dyads group achieved higher quality collaboration and gain. paper, how can further illuminate...

10.1145/2460296.2460317 article EN 2013-04-08

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

OPENALEX - Publications

Phitchaya Mangpo Phothilimthana Sami Abu-El-Haija Kaidi Cao Bahare Fatemi Charith Mendis and 1 more

Precise hardware performance models play a crucial role in code optimizations. They can assist compilers making heuristic decisions or aid autotuners identifying the optimal configuration for given program. For example, autotuner XLA, machine learning compiler, discovered 10-20% speedup on state-of-the-art serving substantial production traffic at Google. Although there exist few datasets program prediction, they target small sub-programs such as basic blocks kernels. This paper introduces...

10.48550/arxiv.2308.13490 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Coming Soon ...