NFDI4DS | UHH-SEMS - Publication Details

Natraj Raman

ORCID: 0009-0008-8866-1482

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5065006343

Research Areas

Topic Modeling
Natural Language Processing Techniques
Human Pose and Action Recognition
Stock Market Forecasting Methods
Advanced Text Analysis Techniques
Multimodal Machine Learning Applications
Semantic Web and Ontologies
Complex Systems and Time Series Analysis
Handwritten Text Recognition Techniques
Credit Risk and Financial Regulations
Anomaly Detection Techniques and Applications
Complex Network Analysis Techniques
Gait Recognition and Analysis
Image Retrieval and Classification Techniques
Financial Markets and Investment Strategies
Bayesian Methods and Mixture Models
Sentiment Analysis and Opinion Mining
Video Surveillance and Tracking Methods
Data Mining Algorithms and Applications
Sustainable Finance and Green Bonds
Generative Adversarial Networks and Image Synthesis
Statistical Methods and Inference
Music and Audio Processing
Explainable Artificial Intelligence (XAI)
Video Analysis and Summarization

Harvard University
2025

Morgan Stanley (United Kingdom)
2022-2023

JPMorgan Chase & Co (United States)
2022

Georgia Institute of Technology
2022

IBM Research - Almaden
2021

California University of Pennsylvania
2021

University of Colorado System
2021

Hong Kong University of Science and Technology
2020

University of Hong Kong
2020

Carleton College
2020

Estimating Upper Extremity Fugl-Meyer Assessment Scores From Reaching Motions Using Wearable Sensors

OPENALEX - Publications

Yu Zhou Natraj Raman Tommaso Proietti James Arnold Prabhat Pathak and 6 more

The Fugl Meyer Assessment (FMA) is a widely-used assessment for tracking motor function recovery post-stroke. Due to the limited access rehabilitation, there exists need remote and automated solutions. Wearable sensors data-driven methods have shown promise enabling automatic upper extremity FMA (FMA-UE) estimation, but minimizing user input motion aligning with current clinical activities will aid adoption of sensor-based assessments. In this work, we present an FMA-UE estimator which can...

10.1109/jbhi.2025.3542037 article EN IEEE Journal of Biomedical and Health Informatics 2025-01-01

When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domain

OPENALEX - Publications

Raj C. Shah Kunal Chawla Dheeraj Eidnani Agam Shah Wendi Du and 5 more

Raj Shah, Kunal Chawla, Dheeraj Eidnani, Agam Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022.

10.18653/v1/2022.emnlp-main.148 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

Activity recognition using a supervised non-parametric hierarchical HMM

OPENALEX - Publications

Natraj Raman Stephen J. Maybank

10.1016/j.neucom.2016.03.024 article EN Neurocomputing 2016-03-29

Synthetic Data Applications in Finance

OPENALEX - Publications

Vamsi K. Potluru Daniel Borrajo Andrea Coletta Niccolò Dalmasso Yousef El-Laham and 15 more

Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications synthetic the financial sector particular provide richer details for few select ones. These cover wide variety modalities tabular, time-series, event-series, unstructured arising from both markets retail applications. Since finance is highly regulated industry, potential approach dealing with issues related to...

10.48550/arxiv.2401.00081 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Mapping ESG Trends by Distant Supervision of Neural Language Models

OPENALEX - Publications

Natraj Raman Grace Bang Armineh Nourbakhsh

The integration of Environmental, Social and Governance (ESG) considerations into business decisions investment strategies have accelerated over the past few years. It is important to quantify extent which ESG-related conversations are carried out by companies so that their impact on operations can be objectively assessed. However, profiling ESG language challenging due its multi-faceted nature lack supervised datasets. This research study aims detect historical trends in discussions...

10.3390/make2040025 article EN cc-by Machine Learning and Knowledge Extraction 2020-10-21

DocLLM: A Layout-Aware Generative Language Model for Multimodal Document Understanding

OPENALEX - Publications

Dongsheng Wang Natraj Raman Mathieu Sibue Zhiqiang Ma Petr Babkin and 4 more

10.18653/v1/2024.acl-long.463 article EN 2024-01-01

DocLLM: A layout-aware generative language model for multimodal document understanding

OPENALEX - Publications

Dongsheng Wang Natraj Raman Mathieu Sibue Zhiqiang Ma Petr Babkin and 4 more

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual spatial modalities. The visual cues offered by their complex layouts play a crucial role in comprehending these effectively. In this paper, we present DocLLM, lightweight extension to traditional large language models (LLMs) for reasoning over documents, taking into account both layout. Our model differs from existing multimodal LLMs...

10.48550/arxiv.2401.00908 preprint EN cc-by arXiv (Cornell University) 2024-01-01

A comparison of classification models for natural disaster and critical event detection from news

OPENALEX - Publications

Tim Nugent Fabio Petroni Natraj Raman Lucas Carstens Jochen L. Leidner

We present a contrastive study of document-level event classification range seven different types, namely floods, storms, fires, armed conflict, terrorism, infrastructure breakdown and labour unavailability from English-language news. Our compares supervised approaches, Support Vector Machine (SVM), Random Forest (RF), Convolutional Neural Network (CNN) Hierarchical Attention (HAN). While past systems for Topic Detection Tracking (TDT) extraction have proposed machine learning models, to...

10.1109/bigdata.2017.8258374 article EN 2021 IEEE International Conference on Big Data (Big Data) 2017-12-01

Action classification using a discriminative multilevel HDP-HMM

OPENALEX - Publications

Natraj Raman Stephen J. Maybank

10.1016/j.neucom.2014.12.009 article EN Neurocomputing 2014-12-16

An Extensible Event Extraction System With Cross-Media Event Resolution

OPENALEX - Publications

Fabio Petroni Natraj Raman Tim Nugent Armineh Nourbakhsh Žarko Panić and 2 more

The automatic extraction of breaking news events from natural language text is a valuable capability for decision support systems. Traditional systems tend to focus on extracting single media source and often ignore cross-media references. Here, we describe large-scale automated system disasters critical both newswire social media. We outline comprehensive architecture that can identify, categorize summarize seven different event types - namely floods, storms, fires, armed conflict,...

10.1145/3219819.3219827 article EN 2018-07-19

Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

OPENALEX - Publications

Tianyu Cao Natraj Raman Danial Dervovic Chenhao Tan

As large language models (LLMs) expand the power of natural processing to handle long inputs, rigorous and systematic analyses are necessary understand their abilities behavior. A salient application is summarization, due its ubiquity controversy (e.g., researchers have declared death summarization). In this paper, we use financial report summarization as a case study because reports not only but also numbers tables extensively. We propose computational framework for characterizing...

10.48550/arxiv.2404.06162 preprint EN arXiv (Cornell University) 2024-04-09

Synthetic document generator for annotation-free layout recognition

OPENALEX - Publications

Natraj Raman Sameena Shah Manuela Veloso

10.1016/j.patcog.2022.108660 article EN Pattern Recognition 2022-03-24

BizGraphQA: A Dataset for Image-based Inference over Graph-structured Diagrams from Business Domains

OPENALEX - Publications

Petr Babkin William Watson Zhiqiang Ma Lucas Cecchi Natraj Raman and 2 more

Graph-structured diagrams, such as enterprise ownership charts or management hierarchies, are a challenging medium for deep learning models they not only require the capacity to model language and spatial relations but also topology of links between entities varying semantics what those represent. Devising Question Answering that automatically process understand diagrams have vast applications many domains, can move state-of-the-art on multimodal document understanding new frontier. Curating...

10.1145/3539618.3591875 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023-07-18

Non-parametric Hidden Conditional Random Fields for action classification

OPENALEX - Publications

Natraj Raman Stephen J. Maybank

Conditional Random Fields (CRF), a structured prediction method, combines probabilistic graphical models and discriminative classification techniques in order to predict class labels sequence recognition problems. Its extension the Hidden (HCRF) uses hidden state variables capture intermediate structures. The number of states an HCRF must be specified priori. This is often not known advance. A non-parametric HCRF, with automatically inferred from data, proposed here. significant advantage...

10.1109/ijcnn.2016.7727615 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2016-07-01

Structure and Semantics Preserving Document Representations

OPENALEX - Publications

Natraj Raman Sameena Shah Manuela Veloso

Retrieving relevant documents from a corpus is typically based on the semantic similarity between document content and query text. The inclusion of structural relationship can benefit retrieval mechanism by addressing gaps. However, incorporating these relationships requires tractable mechanisms that balance structure with semantics take advantage prevalent pre-train/fine-tune paradigm. We propose here holistic approach to learning representations integrating intra-document inter-document...

10.1145/3477495.3532062 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2022-07-06

Municipal Bond Pricing: A Data Driven Method

OPENALEX - Publications

Natraj Raman Jochen L. Leidner

Price evaluations of municipal bonds have traditionally been performed by human experts based on their market knowledge and trading experience. Automated evaluation is an attractive alternative providing the advantage objective estimation that transparent, consistent, scalable. In this paper, we present a statistical model to automatically estimate U.S bond yields trade transactions study agreement between machine generated estimates. The uses piecewise polynomials constructed using basis...

10.3390/ijfs6030080 article EN cc-by International Journal of Financial Studies 2018-09-11

MultiGraph Attention Network for analyzing Company Relations

OPENALEX - Publications

Natraj Raman Grace Bang Azadeh Nematzadeh

When analyzing companies in financial markets, it is essential to identify those that share similar characteristics order assess their relative strengths and weaknesses. This challenging task requires representing the rich set of information associated with complex interrelations between them a form amenable pattern recognition. We present here new deep representation learning method encodes network graph low-dimensional embedding space, preserving its topological structure. Our solution...

10.1145/3373509.3373542 article EN 2019-10-23

ViziTex: Interactive Visual Sense-Making of Text Corpora

OPENALEX - Publications

Natraj Raman Sameena Shah Tucker Balch Manuela Veloso

Information visualization is critical to analytical reasoning and knowledge discovery. We present an interactive studio that integrates perceptive techniques with powerful text analytics algorithms assist humans in sense-making of large complex corpora. The novel visual representations introduced here encode the features delivered by modern mining models using advanced metaphors such as hypergraphs, nested topologies tessellated planes. They enhance human-computer interaction experience for...

10.18653/v1/2021.dash-1.3 article EN cc-by 2021-01-01

Scalable Representation Learning for Multimodal Tabular Transactions

OPENALEX - Publications

Natraj Raman Sumitra Ganesh Manuela Veloso

Large language models (LLMs) are primarily designed to understand unstructured text. When directly applied structured formats such as tabular data, they may struggle discern inherent relationships and overlook critical patterns. While representation learning methods can address some of these limitations, existing efforts still face challenges with sparse high-cardinality fields, precise numerical reasoning, column-heavy tables. Furthermore, leveraging learned representations for downstream...

10.48550/arxiv.2410.07851 preprint EN arXiv (Cornell University) 2024-10-10

Global Graph Counterfactual Explanation: A Subgraph Mapping Approach

OPENALEX - Publications

Yinhan He Wei Zheng Yaochen Zhu Jing Ma Saumitra Mishra and 3 more

Graph Neural Networks (GNNs) have been widely deployed in various real-world applications. However, most GNNs are black-box models that lack explanations. One strategy to explain is through counterfactual explanation, which aims find minimum perturbations on input graphs change the GNN predictions. Existing works explanations primarily concentrate local-level perspective (i.e., generating counterfactuals for each individual graph), suffers from information overload and lacks insights into...

10.48550/arxiv.2410.19978 preprint EN arXiv (Cornell University) 2024-10-25

Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation

OPENALEX - Publications

Ayan Sengupta Vaibhav Seth Ashok Kumar Pathak Natraj Raman Sriram Gopalakrishnan and 1 more

Large Language Models (LLMs) are highly resource-intensive to fine-tune due their enormous size. While low-rank adaptation is a prominent parameter-efficient fine-tuning approach, it suffers from sensitivity hyperparameter choices, leading instability in model performance on downstream tasks. This paper highlights the importance of effective parameterization reduce estimator variance and enhance stability final outputs. We propose MonteCLoRA, an efficient technique, employing Monte Carlo...

10.48550/arxiv.2411.04358 preprint EN arXiv (Cornell University) 2024-11-06

Coming Soon ...