NFDI4DS | UHH-SEMS - Publication Details

Hamid Palangi

ORCID: 0000-0003-2912-4579

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5033846851

Research Areas

Topic Modeling
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Blind Source Separation Techniques
Human Pose and Action Recognition
Text Readability and Simplification
Adversarial Robustness in Machine Learning
Neural Networks and Applications
Domain Adaptation and Few-Shot Learning
Sparse and Compressive Sensing Techniques
Image and Signal Denoising Methods
Explainable Artificial Intelligence (XAI)
Advanced Image and Video Retrieval Techniques
EEG and Brain-Computer Interfaces
Advanced Adaptive Filtering Techniques
Neural Networks and Reservoir Computing
Inertial Sensor and Navigation
Microwave Imaging and Scattering Analysis
Speech and dialogue systems
Anomaly Detection Techniques and Applications
GNSS positioning and interference
Software Engineering Research
Speech and Audio Processing
Currency Recognition and Detection
Video Analysis and Summarization

Microsoft Research (United Kingdom)
2018-2023

Microsoft (United States)
2017-2023

University of North Texas
2023

Massachusetts Institute of Technology
2022

Carnegie Mellon University
2022

Allen Institute
2022

Allen Institute for Artificial Intelligence
2022

University of Washington
2022

University of British Columbia
2013-2016

Sharif University of Technology
2009-2012

Sparks of Artificial General Intelligence: Early experiments with GPT-4

OPENALEX - Publications

Sébastien Bubeck Varun Chandrasekaran Ronen Eldan Johannes Gehrke Eric Horvitz and 9 more

Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains tasks, challenging our understanding learning cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale compute data. In this paper, we report on investigation early version when it still in active development OpenAI. We contend (this of) GPT-4 is part new cohort LLMs (along with ChatGPT...

10.48550/arxiv.2303.12712 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Unified Vision-Language Pre-Training for Image Captioning and VQA

OPENALEX - Publications

Luowei Zhou Hamid Palangi Lei Zhang Houdong Hu Jason J. Corso and 1 more

This paper presents a unified Vision-Language Pre-training (VLP) model. The model is in that (1) it can be fine-tuned for either vision-language generation (e.g., image captioning) or understanding visual question answering) tasks, and (2) uses shared multi-layer transformer network both encoding decoding, which differs from many existing methods where the encoder decoder are implemented using separate models. VLP pre-trained on large amount of image-text pairs unsupervised learning...

10.1609/aaai.v34i07.7005 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

OPENALEX - Publications

Hamid Palangi Li Deng Yelong Shen Jianfeng Gao Xiaodong He and 3 more

This paper develops a model that addresses sentence embedding, hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through sentence, and when reaches last word, hidden layer of network provides semantic representation whole sentence. In this paper, is trained weakly supervised manner on user...

10.1109/taslp.2016.2520371 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2016-01-21

Optimized deep neural network architecture for robust detection of epileptic seizures using EEG signals

OPENALEX - Publications

Ramy Hussein Hamid Palangi Rabab Ward Z. Jane Wang

10.1016/j.clinph.2018.10.010 article EN Clinical Neurophysiology 2018-11-15

ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection

OPENALEX - Publications

Thomas Hartvigsen Saadia Gabriel Hamid Palangi Maarten Sap Dipankar Ray and 1 more

Thomas Hartvigsen, Saadia Gabriel, Hamid Palangi, Maarten Sap, Dipankar Ray, Ece Kamar. Proceedings of the 60th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2022.

10.18653/v1/2022.acl-long.234 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

OPENALEX - Publications

Subhabrata Mukherjee Arindam Mitra Ganesh Jawahar Sahaj Agarwal Hamid Palangi and 1 more

Recent research has focused on enhancing the capability of smaller models through imitation learning, drawing outputs generated by large foundation (LFMs). A number issues impact quality these models, ranging from limited signals shallow LFM outputs; small scale homogeneous training data; and most notably a lack rigorous evaluation resulting in overestimating model's as they tend to learn imitate style, but not reasoning process LFMs. To address challenges, we develop Orca (We are working...

10.48550/arxiv.2306.02707 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Distributed Compressive Sensing: A Deep Learning Approach

OPENALEX - Publications

Hamid Palangi Rabab Ward Li Deng

Various studies that address the compressed sensing problem with Multiple Measurement Vectors (MMVs) have been recently carried.These assume vectors of different channels to be jointly sparse.In this paper, we relax condition.Instead these sparse depend on each other but dependency is unknown.We capture by computing conditional probability entry in vector being non-zero, given "residuals" all previous vectors.To estimate probabilities, propose use Long Short-Term Memory (LSTM) [1], a data...

10.1109/tsp.2016.2557301 article EN IEEE Transactions on Signal Processing 2016-04-21

Orca 2: Teaching Small Language Models How to Reason

OPENALEX - Publications

Arindam Mitra Luciano Del Corro Shweti Mahajan Andrés Codas Clarisse Simoes and 10 more

Orca 1 learns from rich signals, such as explanation traces, allowing it to outperform conventional instruction-tuned models on benchmarks like BigBench Hard and AGIEval. In 2, we continue exploring how improved training signals can enhance smaller LMs' reasoning abilities. Research small LMs has often relied imitation learning replicate the output of more capable models. We contend that excessive emphasis may restrict potential seek teach employ different solution strategies for tasks,...

10.48550/arxiv.2311.11045 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Epileptic Seizure Detection: A Deep Learning Approach

OPENALEX - Publications

Ramy Hussein Hamid Palangi Rabab Ward Zhenzhen Wang

Epilepsy is the second most common brain disorder after migraine. Automatic detection of epileptic seizures can considerably improve patients' quality life. Current Electroencephalogram (EEG)-based seizure systems encounter many challenges in real-life situations. The EEGs are non-stationary signals and patterns vary across patients recording sessions. Moreover, EEG data prone to numerous noise types that negatively affect accuracy seizures. To address these challenges, we introduce use a...

10.48550/arxiv.1803.09848 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Semantic Modelling with Long-Short-Term Memory for Information Retrieval

OPENALEX - Publications

Hamid Palangi Li Deng Yelong Shen Jianfeng Gao Xiaodong He and 3 more

In this paper we address the following problem in web document and information retrieval (IR): How can use long-term context to gain better IR performance? Unlike common methods that bag of words representation for queries documents, treat them as a sequence long short term memory (LSTM) capture contextual dependencies. To best our knowledge, is first time LSTM applied tasks. training traditional LSTMs, strategy different due special nature problem. Experimental evaluation on an task derived...

10.48550/arxiv.1412.6629 preprint EN other-oa arXiv (Cornell University) 2014-01-01

A Large-Scale Robustness Analysis of Video Action Recognition Models

OPENALEX - Publications

Madeline Chantry Schiappa Naman Biyani Prudvi Kamtam Shruti Vyas Hamid Palangi and 2 more

We have seen a great progress in video action recognition recent years. There are several models based on convolutional neural network (CNN) and some transformer approaches which provide top performance existing benchmarks. In this work, we perform large-scale robustness analysis of these for recognition. focus against real-world distribution shift perturbations instead adversarial perturbations. propose four different benchmark datasets, HMDB51-P, UCF101-P, Kinetics400-P, SSv2-P to...

10.1109/cvpr52729.2023.01412 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Evaluating Cognitive Maps and Planning in Large Language Models with CogEval

OPENALEX - Publications

Ida Momennejad Hosein Hasanbeig Felipe do Nascimento Vieira Hiteshi Sharma Robert Osazuwa Ness and 3 more

Recently an influx of studies claim emergent cognitive abilities in large language models (LLMs). Yet, most rely on anecdotes, overlook contamination training sets, or lack systematic Evaluation involving multiple tasks, control conditions, iterations, and statistical robustness tests. Here we make two major contributions. First, propose CogEval, a science-inspired protocol for the evaluation capacities Large Language Models. The CogEval can be followed various abilities. Second, here follow...

10.48550/arxiv.2309.15129 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems

OPENALEX - Publications

Shangbin Feng Zifeng Wang Palash Goyal Yike Wang Weijia Shi and 6 more

We propose Heterogeneous Swarms, an algorithm to design multi-LLM systems by jointly optimizing model roles and weights. represent as directed acyclic graphs (DAGs) of LLMs with topological message passing for collaborative generation. Given a pool LLM experts utility function, Swarms employs two iterative steps: role-step weight-step. For role-step, we interpret learning DAG that specifies the flow inputs outputs between LLMs. Starting from swarm random continuous adjacency matrices, decode...

10.48550/arxiv.2502.04510 preprint EN arXiv (Cornell University) 2025-02-06

Question-Answering with Grammatically-Interpretable Representations

OPENALEX - Publications

Hamid Palangi Paul Smolensky Xiaodong He Li Deng

We introduce an architecture, the Tensor Product RecurrentNetwork (TPRN). In our application of TPRN, internal representations—learned by end-to-end optimization in a deep neural network performing textual question-answering(QA) task—can be interpreted using basic concepts from linguistic theory. No performance penalty need paid for this increased interpretability: proposed model performs comparably to state-of-the-art system on SQuAD QA task.The representation which is Representation: each...

10.1609/aaai.v32i1.12004 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators

OPENALEX - Publications

Kuang-Huei Lee Hamid Palangi Xi Chen Houdong Hu Jianfeng Gao

Grounding language to visual relations is critical various language-and-vision applications. In this work, we tackle two fundamental tasks: image-text matching and image captioning, demonstrate that neural scene graph generators can learn effective relation features facilitate grounding subsequently improve the end By combining with state-of-the-art models, our experiments show significant improvement on standard Flickr30K MSCOCO benchmarks. Our experimental results analysis downstream...

10.48550/arxiv.1909.09953 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Benchmarking Spatial Relationships in Text-to-Image Generation

OPENALEX - Publications

Tejas Gokhale Hamid Palangi Besmira Nushi Vibhav Vineet Eric Horvitz and 3 more

Spatial understanding is a fundamental aspect of computer vision and integral for human-level reasoning about images, making it an important component grounded language understanding. While recent text-to-image synthesis (T2I) models have shown unprecedented improvements in photorealism, unclear whether they reliable spatial capabilities. We investigate the ability T2I to generate correct relationships among objects present VISOR, evaluation metric that captures how accurately relationship...

10.48550/arxiv.2212.10015 preprint EN cc-by-nc-nd arXiv (Cornell University) 2022-01-01

Convolutional Deep Stacking Networks for distributed compressive sensing

OPENALEX - Publications

Hamid Palangi Rabab Ward Li Deng

10.1016/j.sigpro.2016.07.006 article EN Signal Processing 2016-07-15

Robust Detection of Epileptic Seizures Using Deep Neural Networks

OPENALEX - Publications

Ramy Hussein Hamid Palangi Z. Jane Wang Rabab Ward

Robust detection of epileptic seizures in the presence inevitable artifacts Electroencephalogram (EEG) signals is addressed. The EEG dataset considered contains 300 recorded from 15 volunteers. Current seizure systems achieve good performance when data entirely free noise. However, their drastically decays with authentic polluted by real artifacts. We introduce a robust method that can address clean and noisy data. proposed uses Long Short-Term Memory (LSTM) neural networks to extract...

10.1109/icassp.2018.8462029 article EN 2018-04-01

Compositional Processing Emerges in Neural Networks Solving Math Problems

OPENALEX - Publications

Jacob Russin Roland Fernandez Hamid Palangi Eric Rosen Nebojša Jojić and 2 more

A longstanding question in cognitive science concerns the learning mechanisms underlying compositionality human cognition. Humans can infer structured relationships (e.g., grammatical rules) implicit their sensory observations auditory speech), and use this knowledge to guide composition of simpler meanings into complex wholes. Recent progress artificial neural networks has shown that when large models are trained on enough linguistic data, structure emerges representations. We extend work...

10.48550/arxiv.2105.08961 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors

OPENALEX - Publications

Thomas Hartvigsen Swami Sankaranarayanan Hamid Palangi Yoon Kim Marzyeh Ghassemi

Deployed language models decay over time due to shifting inputs, changing user needs, or emergent world-knowledge gaps. When such problems are identified, we want make targeted edits while avoiding expensive retraining. However, current model editors, which modify behaviors of pre-trained models, degrade performance quickly across multiple, sequential edits. We propose GRACE, a lifelong editing method, implements spot-fixes on streaming errors deployed model, ensuring minimal impact...

10.48550/arxiv.2211.11031 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

OPENALEX - Publications

Abdelrahman Zayed Prasanna Parthasarathi Gonçalo Mordido Hamid Palangi Samira Shabanian and 1 more

Data-driven predictive solutions predominant in commercial applications tend to suffer from biases and stereotypes, which raises equity concerns. Prediction models may discover, use, or amplify spurious correlations based on gender other protected personal characteristics, thus discriminating against marginalized groups. Mitigating bias has become an important research focus natural language processing (NLP) is area where annotated corpora are available. Data augmentation reduces by adding...

10.1609/aaai.v37i12.26706 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Using deep stacking network to improve structured compressed sensing with Multiple Measurement Vectors

OPENALEX - Publications

Hamid Palangi Rabab Ward Li Deng

We study the MMV (Multiple Measurement Vectors) compressive sensing setting with a specific sparse structured support. The locations of non-zero rows in matrix are not known. All that is known have probabilities vary from one group to another. propose two novel greedy algorithms for exact recovery this problem. first algorithm models structure using shallow non- linear neural network. input network residual after prediction and output be recovered. second improves by stacking operation form...

10.1109/icassp.2013.6638276 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2013-05-01

Coming Soon ...