NFDI4DS | UHH-SEMS - Publication Details

Souvick Ghosh

ORCID: 0000-0003-1610-9038

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5050277461

Research Areas

Topic Modeling
Advanced Text Analysis Techniques
Speech and dialogue systems
Expert finding and Q&A systems
Natural Language Processing Techniques
AI in Service Interactions
Sentiment Analysis and Opinion Mining
Misinformation and Its Impacts
Hate Speech and Cyberbullying Detection
Information Retrieval and Search Behavior
Public Relations and Crisis Communication
Spam and Phishing Detection
Authorship Attribution and Profiling
Social Media and Politics
Text and Document Classification Technologies
Mobile Crowdsensing and Crowdsourcing
Complex Network Analysis Techniques
Social Robot Interaction and HRI
Recommender Systems and Techniques
Open Education and E-Learning
Usability and User Interface Design
Text Readability and Simplification
Generative Adversarial Networks and Image Synthesis
Names, Identity, and Discrimination Research
Vehicular Ad Hoc Networks (VANETs)

San Jose State University
2021-2024

Human Computer Interaction (Switzerland)
2022

Rutgers, The State University of New Jersey
2017-2019

Rutgers Sexual and Reproductive Health and Rights
2019

Jadavpur University
2015-2018

Toward Multimodal Cyberbullying Detection

OPENALEX - Publications

Vivek K. Singh Souvick Ghosh Christin Jose

As human beings utilize computing technologies to mediate multiple aspects of their lives, cyberbullying has grown as an important societal challenge. Cyberbullying may lead deep psychiatric and emotional disorders for those affected. Hence, there is urgent need devise automated methods detection prevention. While recent efforts have defined sophisticated text processing detection, are yet few that leverage visual data automatically detect cyberbullying. Based on early analysis a public,...

10.1145/3027063.3053169 article EN 2017-05-01

Searching as Learning

OPENALEX - Publications

Souvick Ghosh R Manasa Chirag Shah

In this paper, we investigate the relationship between searching and learning, by conceptualizing information seeking as a learning process, an outcome of process. We present participants with four search tasks, each them designed to represent different cognitive levels learning. Through quantitative analysis participants» Web logs, examine how individual behavior is influenced task complexity tasks in hierarchical order. also explore perceived outcomes processes, actions, are related...

10.1145/3176349.3176386 article EN 2018-01-01

Towards automatic fake news classification

OPENALEX - Publications

Souvick Ghosh Chirag Shah

ABSTRACT The interaction of technology with humans has many adverse effects. rapid growth and outreach the social media Web have led to dissemination questionable untrusted content among a wider audience, which negatively influenced their lives judgment. Many research studies been conducted tackle detection spreading fake news, is misinformation that looks genuine. While first step such tasks would be classify claims associated based on credibility, next steps involve identifying hidden...

10.1002/pra2.2018.14505501125 article EN Proceedings of the Association for Information Science and Technology 2018-01-01

Unmasking Public Sentiment: A Sample Efficient Approach to Analyzing Twitter Opinion on US Aid to Ukraine

OPENALEX - Publications

Satanu Ghosh Souvick Ghosh Nikhil Dewitt D. F. McCoy

10.24251/hicss.2025.300 article EN Proceedings of the ... Annual Hawaii International Conference on System Sciences/Proceedings of the Annual Hawaii International Conference on System Sciences 2025-01-01

Empowering customer service with generative AI: enhancing agent performance while navigating challenges

OPENALEX - Publications

Carlos Costa Souvick Ghosh

Introduction. As large language models (LLMs), such as GPTs, become more intelligent, a key area of exploration is how these technologies can improve the customer experience. Contrary to common belief, many consumers, including Gen Z, prefer human-provided service, illustrating importance human-AI collaboration in space. Method. By leveraging author’s real-world knowledge enterprise management and service delivery, we reviewed numerous literature about AI, management, design synthesised...

10.47989/ir30iconf47566 article EN cc-by-nc Information Research an international electronic journal 2025-03-11

Part-of-speech Tagging of Code-Mixed Social Media Text

OPENALEX - Publications

Souvick Ghosh Satanu Ghosh Dipankar Das

A common step in the processing of any text is part-of-speech tagging input text.In this paper, we present an approach to tackle code-mixed from three different languages Bengali, Hindi, and Tamilapart English.Our system uses Conditional Random Field, a sequence learning method, which useful capture patterns sequences containing code switching tag each word with accurate information.We have used various pre-processing post-processing modules improve performance our system.The results were...

10.18653/v1/w16-5811 article EN cc-by 2016-01-01

Sentiment Identification in Code-Mixed Social Media Text

OPENALEX - Publications

Souvick Ghosh Satanu Ghosh Dipankar Das

Sentiment analysis is the Natural Language Processing (NLP) task dealing with detection and classification of sentiments in texts. While some tasks deal identifying presence sentiment text (Subjectivity analysis), other aim at determining polarity categorizing them as positive, negative neutral. Whenever there a text, it has source (people, group people or any entity) directed towards entity, object, event person. to determine subject, target valence sentiment. In our work, we try...

10.48550/arxiv.1707.01184 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Report on the future conversations workshop at CHIIR 2021

OPENALEX - Publications

Damiano Spina Johanne R. Trippas Paul Thomas Hideo Joho Katriina Byström and 15 more

The Future Conversations workshop at CHIIR'21 looked to the future of search, recommendation, and information interaction ask: where are opportunities for conversational interactions? What do we need get there? Furthermore, who stands benefit? was hands-on interactive. Rather than a series technical talks, solicited position statements on opportunities, problems, solutions in search all modalities (written, spoken, or multimodal). This paper -co-authored by organisers participants workshop-...

10.1145/3476415.3476421 article EN ACM SIGIR Forum 2021-06-01

Authorship Verification - An Approach based on Random Forest

OPENALEX - Publications

Promita Maitra Souvick Ghosh Dipankar Das

Authorship attribution, being an important problem in many areas in-cluding information retrieval, computational linguistics, law and journalism etc., has been identified as a subject of increasingly research interest the re-cent years. In case Author Identification task PAN at CLEF 2015, main focus was given on cross-genre cross-topic author verification tasks. We have used several word-based style-based features to identify dif-ferences between known unknown problems one set label ones...

10.48550/arxiv.1607.08885 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Complexity Metric for Code-Mixed Social Media Text

OPENALEX - Publications

Souvick Ghosh Satanu Ghosh Dipankar Das

An evaluation metric is an absolute necessity for measuring the performance of any system and complexity data. In this paper, we have discussed how to determine level code-mixed social media texts that are growing rapidly due multilingual interference. general, written in multiple languages often hard comprehend analyze. At same time, order meet demands analysis, it also necessary a particular document or text segment. Thus, present existing metrics determining code-mixing corpus, their...

10.13053/cys-21-4-2852 article EN Computación y Sistemas 2018-01-01

Identifying Citation Sentiment and its Influence while Indexing Scientific Papers

OPENALEX - Publications

Souvick Ghosh Chirag Shah

Sentiment analysis has proven to be a popular research area for analyzing social media texts, newspaper articles, and product reviews. However, sentiment of citation instances is relatively unexplored research. For scientific papers, it often assumed that the associated with inherently positive. This assumption due hedged nature in citations, which difficult identify classify. As result, most existing indexes focus only on frequency citation. In this paper, we highlight importance...

10.24251/hicss.2020.307 article EN Proceedings of the ... Annual Hawaii International Conference on System Sciences/Proceedings of the Annual Hawaii International Conference on System Sciences 2020-01-01

Exploring Online and Offline Search Behavior Based on the Varying Task Complexity

OPENALEX - Publications

R Manasa Souvick Ghosh Chirag Shah

In an information seeking episode, users often look for sources in online and offline environments depending on the task at hand. However, most times consider factors such as ease, time taken to complete task, number of be consulted essential while fulfilling task. our study, we explore role different cost variables -- explored based cognitive complexity levels, from Bloom»s taxonomy, by conducting a user study. We study search behaviors shown levels three variables. observed intriguing...

10.1145/3176349.3176890 article EN 2018-01-01

Toward Automatic Fake News Classification

OPENALEX - Publications

Souvick Ghosh Chirag Shah

The interaction of technology with humans have many adverse effects. rapid growth and outreach the social media Web led to dissemination questionable untrusted content among a wider audience, which has negatively influenced their lives judgment. Different election campaigns around world highlighted how ''fake news'' - misinformation that looks genuine can be targeted towards specific communities manipulate confuse them. Ever since, automatic fake news detection gained widespread attention...

10.24251/hicss.2019.273 article EN cc-by-nc-nd Proceedings of the ... Annual Hawaii International Conference on System Sciences/Proceedings of the Annual Hawaii International Conference on System Sciences 2019-01-01

Labeling of Query Words using Conditional Random Field

OPENALEX - Publications

Satanu Ghosh Souvick Ghosh Dipankar Das

This paper describes our approach on Query Word Labeling as an attempt in the shared task Mixed Script Information Retrieval at Forum for Evaluation (FIRE) 2015. The query is written Roman script and words were English or transliterated from Indian regional languages. A total of eight languages present addition to English. We also identified Named Entities special symbols part task. CRF based machine learning framework was used labeling individual with their corresponding language labels. a...

10.48550/arxiv.1607.08883 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Voices of the Stacks: A Multifaceted Inquiry into Academic Librarians' Tweets

OPENALEX - Publications

Souvick Ghosh James Thajudeen

ABSTRACT Twitter has emerged as an important forum for discussion among academic librarians. In this research, we take a mixed‐methods approach to study the thematic content and sentiment of tweets authored by librarians in United States, Canada, Kingdom. We found differences semantic themes present data from each country that point how engage on Twitter. While more work remains be done, cast new light members professional communities use social media. Our qualitative analysis identified 11...

10.1002/pra2.778 article EN Proceedings of the Association for Information Science and Technology 2023-10-01

Informing the Design of Conversational IR Systems

OPENALEX - Publications

Souvick Ghosh

Recent developments in conversational IR have raised questions about the nature of interactions which occur between user and system cognitive capabilities expected such systems. In our research, we investigate completeness existing theoretical frameworks explaining search data propose modifications to The linear transient speech makes it cognitively challenging for process a large amount information. We study evaluate users' preference modalities when using will help us understand how...

10.1145/3331184.3331422 article EN 2019-07-18

Exploring the Ideal Depth of Neural Network when Predicting Question Deletion on Community Question Answering

OPENALEX - Publications

Souvick Ghosh Satanu Ghosh

In recent years, Community Question Answering (CQA) has emerged as a popular platform for knowledge curation and archival. An interesting aspect of question answering is that it combines aspects from natural language processing, information retrieval, machine learning. this paper, we have explored how the depth neural network influences accuracy prediction deleted questions in question-answering forums. We used different shallow deep models analyzed relationships between number hidden...

10.1145/3368567.3368568 article EN 2019-12-12

Classifying Speech Acts using Multi-channel Deep Attention Network for Task-oriented Conversational Search Agents

OPENALEX - Publications

Souvick Ghosh Satanu Ghosh

Understanding human spoken dialogues in an information-seeking scenario is a significant challenge for IR researchers. Prior literature intelligent systems suggests that by identifying speech acts dialogues, we can identify the search intent and information needs of user. Therefore, this paper, have used to address problem natural language understanding conversational systems. First, collected human-system interaction data through Wizard-of-Oz study. Next, developed gold-standard dataset...

10.1145/3406522.3446057 article EN 2021-02-27

Information seeking in learning‐oriented search

OPENALEX - Publications

Souvick Ghosh Chirag Shah

ABSTRACT In this paper, we describe the different search behaviors exhibited by participants while performing learning‐oriented tasks. The tasks have been designed to represent cognitive levels of learning hierarchically. We investigate how searcher's behavior and perceived outcomes vary with increasing complexity. study, analyze log data participants, self‐reports, questionnaires interviews both descriptively statistically. Our results suggest that topic knowledge interest difficulty...

10.1002/pra2.2017.14505401115 article EN Proceedings of the Association for Information Science and Technology 2017-01-01

Beyond Bloom's Taxonomy: Integrating “searching as learning” and e‐learning research perspectives

OPENALEX - Publications

Rebecca Reynolds Eric M. Meyers Souvick Ghosh Alamir Novin

ABSTRACT Searching as learning work is growing in interest, however definitions of ‘learning’ this space have been somewhat narrow. Here we propose a panel sponsored by SIG InfoLearn that will feature presentations from three scholars whose falls the domain “searching learning,” followed synthesis presented fourth scholar along with one panelists, who draw key conceptual intersections among empirical research papers and then explicate linkages to existing sciences has potential further...

10.1002/pra2.2018.14505501093 article EN Proceedings of the Association for Information Science and Technology 2018-01-01

Determining sentiment in citation text and analyzing its impact on the proposed ranking index

OPENALEX - Publications

Souvick Ghosh Dipankar Das Tanmoy Chakraborty

Whenever human beings interact with each other, they exchange or express opinions, emotions, and sentiments. These opinions can be expressed in text, speech images. Analysis of these sentiments is one the popular research areas present day researchers. Sentiment analysis, also known as opinion mining tries to identify classify into two broad categories - positive negative. In recent years, scientific community has taken a lot interest analyzing sentiment textual data available various social...

10.48550/arxiv.1707.01425 preprint EN other-oa arXiv (Cornell University) 2017-01-01

"Don’t Downvote A\$\$\$\$\$\$s!!": An Exploration of Reddit’s Advice Communities

OPENALEX - Publications

Emily Cannon Bianca Crouse Souvick Ghosh Nicholas Rihn Kristen Chua

Advice forums are a crowdsourced way to reinforce cultural norms and moral behavior.Sites like Reddit contain massive amounts of natural language human interaction, with rules unique each individual subreddit community.To explore this data, we created dataset top 1000 posts from two such forums, r/AmItheAsshole r/relationships, extracted features including sentiment, similarity, word frequency, demographics using both algorithmic manual methods.Further, developed method extract demographic...

10.24251/hicss.2022.363 article EN Proceedings of the ... Annual Hawaii International Conference on System Sciences/Proceedings of the Annual Hawaii International Conference on System Sciences 2022-01-01

Coming Soon ...