NFDI4DS | UHH-SEMS - Publication Details

MultiWOZ 2.1: A Consolidated Multi-Domain Dialogue Dataset with State Corrections and State Tracking Baselines

OPENALEX - Publications

Mihail Eric Rahul Goel Shachi Paul Adarsh Kumar Abhishek Sethi and 5 more

MultiWOZ 2.0 (Budzianowski et al., 2018) is a recently released multi-domain dialogue dataset spanning 7 distinct domains and containing over 10,000 dialogues. Though immensely useful one of the largest resources its kind to-date, has few shortcomings. Firstly, there substantial noise in state annotations utterances which negatively impact performance state-tracking models. Secondly, follow-up work (Lee 2019) augmented original with user acts. This leads to multiple co-existent versions same...

10.48550/arxiv.1907.01669 preprint EN cc-by arXiv (Cornell University) 2019-01-01

Interactive Segmentation of Radiance Fields

OPENALEX - Publications

Rahul Goel Dhawal Sirikonda Saurabh Saini P. J. Narayanan

Radiance Fields (RF) are popular to represent casually-captured scenes for new view synthesis and several applications beyond it. Mixed reality on personal spaces needs understanding manipulating represented as RFs, with semantic segmentation of objects an important step. Prior efforts show promise but don't scale complex diverse appearance. We present the ISRF method interactively segment fine structure Nearest neighbor feature matching using distilled features identifies high-confidence...

10.1109/cvpr52729.2023.00409 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Advances in Magnetic Materials: Classification and Synthesis

OPENALEX - Publications

Anjali Yadav Aruna Sharma Meenu Devi Bhawana Jangir Rahul Goel

Nanoscience plays a pivotal role in mitigating lethal pollutants, contributing to environmental rejuvenation. The rising demand for magnetic nanomaterials is driven by their extensive applications drug delivery, biosensors, remediation, resonance imaging (MRI), catalysis and cell separation. Various synthesis methods including solvothermal, co-precipitation, thermal decomposition, hydrothermal microemulsion processes, have been developed prepare these materials. This study highlights the...

10.25303/295rjce1770184 article EN Research Journal of Chemistry and Environment 2025-03-31

PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

OPENALEX - Publications

Rahul Goel Waleed Ammar Aditya Gupta Siddharth Vashishtha Motoki Sano and 11 more

Rahul Goel, Waleed Ammar, Aditya Gupta, Siddharth Vashishtha, Motoki Sano, Faiz Surani, Max Chang, HyunJeong Choe, David Greene, Chuan He, Rattima Nitisaroj, Anna Trukhina, Shachi Paul, Pararth Shah, Rushin Zhou Yu. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.

10.18653/v1/2023.emnlp-main.667 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2023-01-01

On Evaluating and Comparing Open Domain Dialog Systems

OPENALEX - Publications

Anu Venkatesh Chandra Khatri Ashwin Ram Fenfei Guo Raefer Gabriel and 8 more

Conversational agents are exploding in popularity. However, much work remains the area of non goal-oriented conversations, despite significant growth research interest over recent years. To advance state art conversational AI, Amazon launched Alexa Prize, a 2.5-million dollar university competition where sixteen selected teams built to deliver best social experience. Prize provided academic community with unique opportunity perform live system used by millions users. The subjectivity...

10.48550/arxiv.1801.03625 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Detecting Offensive Content in Open-domain Conversations using Two Stage Semi-supervision

OPENALEX - Publications

Chandra Khatri Behnam Hedayatnia Rahul Goel A. G. Venkatesh Raefer Gabriel and 1 more

As open-ended human-chatbot interaction becomes commonplace, sensitive content detection gains importance. In this work, we propose a two stage semi-supervised approach to bootstrap large-scale data for automatic language from publicly available web resources. We explore various selection methods including 1) using blacklist rank online discussion forums by the level of their sensitiveness followed randomly sampling utterances and 2) training weakly supervised model in conjunction with...

10.48550/arxiv.1811.12900 preprint EN other-oa arXiv (Cornell University) 2018-01-01

GSN: Generalisable Segmentation in Neural Radiance Field

OPENALEX - Publications

Vinayak Gupta Rahul Goel Sirikonda Dhawal P. J. Narayanan

Traditional Radiance Field (RF) representations capture details of a specific scene and must be trained afresh on each scene. Semantic feature fields have been added to RFs facilitate several segmentation tasks. Generalised RF learn the principles view interpolation. A generalised can render new views an unknown untrained scene, given few views. We present way distil into GNT representation. Our GSN representation generates unseen scenes fly along with consistent, per-pixel semantic...

10.1609/aaai.v38i3.27972 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

CST5: Data Augmentation for Code-Switched Semantic Parsing

OPENALEX - Publications

Anmol Agarwal Jigar Gupta Rahul Goel Shyam Upadhyay Pankaj Joshi and 1 more

Extending semantic parsers to code-switched input has been a challenging problem, primarily due lack of supervised training data. In this work, we introduce CST5, new data augmentation technique that finetunes T5 model using small seed set ($\approx$100 utterances) generate utterances from English utterances. We show CST5 generates high quality data, both intrinsically (per human evaluation) and extrinsically by comparing baseline models which are trained without with augmented Empirically...

10.48550/arxiv.2211.07514 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Reducing Model Churn: Stable Re-training of Conversational Agents

OPENALEX - Publications

Christopher Hidey Fei Liu Rahul Goel

Retraining modern deep learning systems can lead to variations in model performance even when trained using the same data and hyper-parameters by simply different random seeds. This phenomenon is known as churn or jitter. issue often exacerbated real world settings, where noise may be introduced collection process. In this work we tackle problem of stable retraining with a novel focus on structured prediction for conversational semantic parsing. We first quantify introducing metrics...

10.18653/v1/2022.sigdial-1.2 article EN cc-by 2022-01-01

GSN: Generalisable Segmentation in Neural Radiance Field

OPENALEX - Publications

Vinayak Gupta Rahul Goel Sirikonda Dhawal P. J. Narayanan

Traditional Radiance Field (RF) representations capture details of a specific scene and must be trained afresh on each scene. Semantic feature fields have been added to RFs facilitate several segmentation tasks. Generalised RF learn the principles view interpolation. A generalised can render new views an unknown untrained scene, given few views. We present way distil into GNT representation. Our GSN representation generates unseen scenes fly along with consistent, per-pixel semantic...

10.48550/arxiv.2402.04632 preprint EN arXiv (Cornell University) 2024-02-07

Real-Time Decompression and Rasterization of Massive Point Clouds

OPENALEX - Publications

Rahul Goel Markus Schütz P. J. Narayanan Bernhard Kerbl

Large-scale capturing of real-world scenes as 3D point clouds (e.g., using LIDAR scanning) generates billions points that are challenging to visualize. High storage requirements prevent the quick and easy inspection captured datasets on user-grade hardware. The fastest real-time rendering methods limited by available GPU memory render only around 1 billion interactively. We show we can achieve state-of-the-art in both while simultaneously supporting surpass capabilities other methods....

10.1145/3675373 article EN Proceedings of the ACM on Computer Graphics and Interactive Techniques 2024-08-09

Taming 3DGS: High-Quality Radiance Fields with Limited Resources

OPENALEX - Publications

Saswat Subhajyoti Mallick Rahul Goel Bernhard Kerbl Markus Steinberger Francisco Vicente Carrasco and 1 more

3D Gaussian Splatting (3DGS) has transformed novel-view synthesis with its fast, interpretable, and high-fidelity rendering. However, resource requirements limit usability. Especially on constrained devices, training performance degrades quickly often cannot complete due to excessive memory consumption of the model. The method converges an indefinite number Gaussians—many them redundant—making rendering unnecessarily slow preventing usage in downstream tasks that expect fixed-size inputs. To...

10.1145/3680528.3687694 article EN cc-by 2024-12-03

PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

OPENALEX - Publications

Rahul Goel Waleed Ammar Aditya Gupta Siddharth Vashishtha Motoki Sano and 11 more

Research interest in task-oriented dialogs has increased as systems such Google Assistant, Alexa and Siri have become ubiquitous everyday life. However, the impact of academic research this area been limited by lack datasets that realistically capture wide array user pain points. To enable on some more challenging aspects parsing realistic conversations, we introduce PRESTO, a public dataset over 550K contextual multilingual conversations between humans virtual assistants. PRESTO contains...

10.48550/arxiv.2303.08954 preprint EN cc-by arXiv (Cornell University) 2023-01-01

DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue

OPENALEX - Publications

William A. Held Christopher Hidey Fei Liu Eric Y. Zhu Rahul Goel and 2 more

William Held, Christopher Hidey, Fei Liu, Eric Zhu, Rahul Goel, Diyi Yang, Rushin Shah. Proceedings of the 61st Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2023.

10.18653/v1/2023.acl-long.199 article EN cc-by 2023-01-01

StyleTRF: Stylizing Tensorial Radiance Fields✱

OPENALEX - Publications

Rahul Goel Sirikonda Dhawal Saurabh Saini P. J. Narayanan

Stylized view generation of scenes captured casually using a camera has received much attention recently. The geometry and appearance the scene are typically as neural point sets or radiance fields in previous work. An image stylization method is used to stylize by training its network jointly iteratively with structure capture network. state-of-the-art SNeRF [29] trains NeRF an alternating manner. These methods have high time require joint optimization. In this work, we present StyleTRF,...

10.1145/3571600.3571643 preprint EN 2022-12-08

Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues

OPENALEX - Publications

Shachi Paul Rahul Goel Dilek Hakkani‐Tür

Machine learning approaches for building task-oriented dialogue systems require large conversational datasets with labels to train on. We are interested in from human-human conversations, which may be available ample amounts existing customer care center logs or can collected crowd workers. Annotating these prohibitively expensive. Recently multiple annotated human-machine have been released, however their annotation schema varies across different collections, even well-defined categories...

10.48550/arxiv.1907.03020 preprint EN cc-by arXiv (Cornell University) 2019-01-01

Online Embedding Compression for Text Classification using Low Rank Matrix Factorization

OPENALEX - Publications

Anish Acharya Rahul Goel Angeliki Metallinou Inderjit S. Dhillon

Deep learning models have become state of the art for natural language processing (NLP) tasks, however deploying these in production system poses significant memory constraints. Existing compression methods are either lossy or introduce latency. We propose a method that leverages low rank matrix factorization during training,to compress word embedding layer which represents size bottleneck most NLP models. Our trained, compressed and then further re-trained on downstream task to recover...

10.48550/arxiv.1811.00641 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments

OPENALEX - Publications

Christopher Hidey Fei Liu Rahul Goel

Retraining modern deep learning systems can lead to variations in model performance even when trained using the same data and hyper-parameters by simply different random seeds. We call this phenomenon jitter. This issue is often exacerbated production settings, where models are retrained on noisy data. In work we tackle problem of stable retraining with a focus conversational semantic parsers. first quantify jitter introducing agreement metric showing variation dataset noise sizes. then...

10.48550/arxiv.2204.04735 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Improving Top-K Decoding for Non-Autoregressive Semantic Parsing via Intent Conditioning

OPENALEX - Publications

Geunseob Oh Rahul Goel Chris Hidey Shachi Paul Aditya Gupta and 2 more

Semantic parsing (SP) is a core component of modern virtual assistants like Google Assistant and Amazon Alexa. While sequence-to-sequence-based auto-regressive (AR) approaches are common for conversational semantic parsing, recent studies employ non-autoregressive (NAR) decoders reduce inference latency while maintaining competitive quality. However, major drawback NAR the difficulty generating top-k (i.e., k-best) outputs with such as beam search. To address this challenge, we propose novel...

10.48550/arxiv.2204.06748 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Pre-Trained Language Transformers are Universal Image Classifiers

OPENALEX - Publications

Rahul Goel Modar Sulaiman Kimia Noorbakhsh Mahdi Sharifi Rajesh Sharma and 2 more

Facial images disclose many hidden personal traits such as age, gender, race, health, emotion, and psychology. Understanding these will help to classify the people in different attributes. In this paper, we have presented a novel method for classifying using pretrained transformer model. We apply binary classification of facial criminal non-criminal classes. The GPT-2 is trained generate text then fine-tuned images. During finetuning process with images, most layers GT-2 are frozen during...

10.48550/arxiv.2201.10182 preprint EN cc-by arXiv (Cornell University) 2022-01-01

TableFormer: Robust Transformer Modeling for Table-Text Encoding

OPENALEX - Publications

Jingfeng Yang Aditya Gupta Shyam Upadhyay Luheng He Rahul Goel and 1 more

Understanding tables is an important aspect of natural language understanding. Existing models for table understanding require linearization the structure, where row or column order encoded as unwanted bias. Such spurious biases make model vulnerable to and perturbations. Additionally, prior work has not thoroughly modeled structures table-text alignments, hindering ability. In this work, we propose a robust structurally aware encoding architecture TableFormer, tabular structural are...

10.48550/arxiv.2203.00274 preprint EN cc-by arXiv (Cornell University) 2022-01-01

FusedRF: Fusing Multiple Radiance Fields

OPENALEX - Publications

Rahul Goel Dhawal Sirikonda Rajvi Shah P. J. Narayanan

Radiance Fields (RFs) have shown great potential to represent scenes from casually captured discrete views. Compositing parts or whole of multiple could greatly interest several XR applications. Prior works can generate new views such by tracing each scene in parallel. This increases the render times and memory requirements with number components. In this work, we provide a method create single, compact, fused RF representation for composited using RFs. The has same utilizations as single...

10.48550/arxiv.2306.04180 preprint EN public-domain arXiv (Cornell University) 2023-01-01

Parsing Coordination for Spoken Language Understanding

OPENALEX - Publications

Sanchit Agarwal Rahul Goel Tagyoung Chung Abhishek Sethi Arindam Mandal and 1 more

Typical spoken language understanding systems provide narrow semantic parses using a domain-specific ontology. The contain intents and slots that are directly consumed by downstream domain applications. In this work we discuss expanding such to handle compound entities introducing domain-agnostic shallow parser handles linguistic coordination. We show our model for parsing coordination learns domain-independent slot-independent features is able segment conjunct boundaries of many different...

10.48550/arxiv.1810.11497 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Contextual Topic Modeling For Dialog Systems

OPENALEX - Publications

Chandra Khatri Rahul Goel Behnam Hedayatnia Angeliki Metanillou A. G. Venkatesh and 2 more

Accurate prediction of conversation topics can be a valuable signal for creating coherent and engaging dialog systems. In this work, we focus on context-aware topic classification methods identifying in free-form human-chatbot dialogs. We extend previous work neural unsupervised keyword detection by incorporating conversational context act features. On annotated data, show that acts leads to relative gains accuracy 35% recall 11% interactions where frequently span multiple utterances....

10.48550/arxiv.1810.08135 preprint EN other-oa arXiv (Cornell University) 2018-01-01