Natraj Raman

ORCID: 0009-0008-8866-1482
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Human Pose and Action Recognition
  • Stock Market Forecasting Methods
  • Advanced Text Analysis Techniques
  • Multimodal Machine Learning Applications
  • Semantic Web and Ontologies
  • Complex Systems and Time Series Analysis
  • Handwritten Text Recognition Techniques
  • Credit Risk and Financial Regulations
  • Anomaly Detection Techniques and Applications
  • Complex Network Analysis Techniques
  • Gait Recognition and Analysis
  • Image Retrieval and Classification Techniques
  • Financial Markets and Investment Strategies
  • Bayesian Methods and Mixture Models
  • Sentiment Analysis and Opinion Mining
  • Video Surveillance and Tracking Methods
  • Data Mining Algorithms and Applications
  • Sustainable Finance and Green Bonds
  • Generative Adversarial Networks and Image Synthesis
  • Statistical Methods and Inference
  • Music and Audio Processing
  • Explainable Artificial Intelligence (XAI)
  • Video Analysis and Summarization

Harvard University
2025

Morgan Stanley (United Kingdom)
2022-2023

JPMorgan Chase & Co (United States)
2022

Georgia Institute of Technology
2022

IBM Research - Almaden
2021

California University of Pennsylvania
2021

University of Colorado System
2021

Hong Kong University of Science and Technology
2020

University of Hong Kong
2020

Carleton College
2020

The Fugl Meyer Assessment (FMA) is a widely-used assessment for tracking motor function recovery post-stroke. Due to the limited access rehabilitation, there exists need remote and automated solutions. Wearable sensors data-driven methods have shown promise enabling automatic upper extremity FMA (FMA-UE) estimation, but minimizing user input motion aligning with current clinical activities will aid adoption of sensor-based assessments. In this work, we present an FMA-UE estimator which can...

10.1109/jbhi.2025.3542037 article EN IEEE Journal of Biomedical and Health Informatics 2025-01-01

Raj Shah, Kunal Chawla, Dheeraj Eidnani, Agam Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022.

10.18653/v1/2022.emnlp-main.148 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

Synthetic data has made tremendous strides in various commercial settings including finance, healthcare, and virtual reality. We present a broad overview of prototypical applications synthetic the financial sector particular provide richer details for few select ones. These cover wide variety modalities tabular, time-series, event-series, unstructured arising from both markets retail applications. Since finance is highly regulated industry, potential approach dealing with issues related to...

10.48550/arxiv.2401.00081 preprint EN other-oa arXiv (Cornell University) 2024-01-01

The integration of Environmental, Social and Governance (ESG) considerations into business decisions investment strategies have accelerated over the past few years. It is important to quantify extent which ESG-related conversations are carried out by companies so that their impact on operations can be objectively assessed. However, profiling ESG language challenging due its multi-faceted nature lack supervised datasets. This research study aims detect historical trends in discussions...

10.3390/make2040025 article EN cc-by Machine Learning and Knowledge Extraction 2020-10-21

Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the intersection of textual spatial modalities. The visual cues offered by their complex layouts play a crucial role in comprehending these effectively. In this paper, we present DocLLM, lightweight extension to traditional large language models (LLMs) for reasoning over documents, taking into account both layout. Our model differs from existing multimodal LLMs...

10.48550/arxiv.2401.00908 preprint EN cc-by arXiv (Cornell University) 2024-01-01

We present a contrastive study of document-level event classification range seven different types, namely floods, storms, fires, armed conflict, terrorism, infrastructure breakdown and labour unavailability from English-language news. Our compares supervised approaches, Support Vector Machine (SVM), Random Forest (RF), Convolutional Neural Network (CNN) Hierarchical Attention (HAN). While past systems for Topic Detection Tracking (TDT) extraction have proposed machine learning models, to...

10.1109/bigdata.2017.8258374 article EN 2021 IEEE International Conference on Big Data (Big Data) 2017-12-01

The automatic extraction of breaking news events from natural language text is a valuable capability for decision support systems. Traditional systems tend to focus on extracting single media source and often ignore cross-media references. Here, we describe large-scale automated system disasters critical both newswire social media. We outline comprehensive architecture that can identify, categorize summarize seven different event types - namely floods, storms, fires, armed conflict,...

10.1145/3219819.3219827 article EN 2018-07-19

As large language models (LLMs) expand the power of natural processing to handle long inputs, rigorous and systematic analyses are necessary understand their abilities behavior. A salient application is summarization, due its ubiquity controversy (e.g., researchers have declared death summarization). In this paper, we use financial report summarization as a case study because reports not only but also numbers tables extensively. We propose computational framework for characterizing...

10.48550/arxiv.2404.06162 preprint EN arXiv (Cornell University) 2024-04-09

Graph-structured diagrams, such as enterprise ownership charts or management hierarchies, are a challenging medium for deep learning models they not only require the capacity to model language and spatial relations but also topology of links between entities varying semantics what those represent. Devising Question Answering that automatically process understand diagrams have vast applications many domains, can move state-of-the-art on multimodal document understanding new frontier. Curating...

10.1145/3539618.3591875 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023-07-18

Conditional Random Fields (CRF), a structured prediction method, combines probabilistic graphical models and discriminative classification techniques in order to predict class labels sequence recognition problems. Its extension the Hidden (HCRF) uses hidden state variables capture intermediate structures. The number of states an HCRF must be specified priori. This is often not known advance. A non-parametric HCRF, with automatically inferred from data, proposed here. significant advantage...

10.1109/ijcnn.2016.7727615 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2016-07-01

Retrieving relevant documents from a corpus is typically based on the semantic similarity between document content and query text. The inclusion of structural relationship can benefit retrieval mechanism by addressing gaps. However, incorporating these relationships requires tractable mechanisms that balance structure with semantics take advantage prevalent pre-train/fine-tune paradigm. We propose here holistic approach to learning representations integrating intra-document inter-document...

10.1145/3477495.3532062 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2022-07-06

Price evaluations of municipal bonds have traditionally been performed by human experts based on their market knowledge and trading experience. Automated evaluation is an attractive alternative providing the advantage objective estimation that transparent, consistent, scalable. In this paper, we present a statistical model to automatically estimate U.S bond yields trade transactions study agreement between machine generated estimates. The uses piecewise polynomials constructed using basis...

10.3390/ijfs6030080 article EN cc-by International Journal of Financial Studies 2018-09-11

When analyzing companies in financial markets, it is essential to identify those that share similar characteristics order assess their relative strengths and weaknesses. This challenging task requires representing the rich set of information associated with complex interrelations between them a form amenable pattern recognition. We present here new deep representation learning method encodes network graph low-dimensional embedding space, preserving its topological structure. Our solution...

10.1145/3373509.3373542 article EN 2019-10-23

Information visualization is critical to analytical reasoning and knowledge discovery. We present an interactive studio that integrates perceptive techniques with powerful text analytics algorithms assist humans in sense-making of large complex corpora. The novel visual representations introduced here encode the features delivered by modern mining models using advanced metaphors such as hypergraphs, nested topologies tessellated planes. They enhance human-computer interaction experience for...

10.18653/v1/2021.dash-1.3 article EN cc-by 2021-01-01

Large language models (LLMs) are primarily designed to understand unstructured text. When directly applied structured formats such as tabular data, they may struggle discern inherent relationships and overlook critical patterns. While representation learning methods can address some of these limitations, existing efforts still face challenges with sparse high-cardinality fields, precise numerical reasoning, column-heavy tables. Furthermore, leveraging learned representations for downstream...

10.48550/arxiv.2410.07851 preprint EN arXiv (Cornell University) 2024-10-10

Graph Neural Networks (GNNs) have been widely deployed in various real-world applications. However, most GNNs are black-box models that lack explanations. One strategy to explain is through counterfactual explanation, which aims find minimum perturbations on input graphs change the GNN predictions. Existing works explanations primarily concentrate local-level perspective (i.e., generating counterfactuals for each individual graph), suffers from information overload and lacks insights into...

10.48550/arxiv.2410.19978 preprint EN arXiv (Cornell University) 2024-10-25

Large Language Models (LLMs) are highly resource-intensive to fine-tune due their enormous size. While low-rank adaptation is a prominent parameter-efficient fine-tuning approach, it suffers from sensitivity hyperparameter choices, leading instability in model performance on downstream tasks. This paper highlights the importance of effective parameterization reduce estimator variance and enhance stability final outputs. We propose MonteCLoRA, an efficient technique, employing Monte Carlo...

10.48550/arxiv.2411.04358 preprint EN arXiv (Cornell University) 2024-11-06
Coming Soon ...