- Hate Speech and Cyberbullying Detection
- Knowledge Management and Sharing
- Social Media and Politics
- Indian History and Philosophy
- Spam and Phishing Detection
- South Asian Studies and Conflicts
- Biomedical Text Mining and Ontologies
- Cybercrime and Law Enforcement Studies
- Bullying, Victimization, and Aggression
- Privacy, Security, and Data Protection
- Natural Language Processing Techniques
- Digital Marketing and Social Media
- Misinformation and Its Impacts
- Semantic Web and Ontologies
- Information Retrieval and Search Behavior
- Politics and Conflicts in Afghanistan, Pakistan, and Middle East
- Topic Modeling
- Complex Network Analysis Techniques
- Advanced Text Analysis Techniques
- Human Mobility and Location-Based Analysis
- Mobile Crowdsensing and Crowdsourcing
University of Maryland, College Park
2017-2022
A fundamental part of conducting cross-disciplinary web science research is having useful, high-quality datasets that provide value to studies across disciplines. In this paper, we introduce a large, hand-coded corpus online harassment data. team researchers collaboratively developed codebook using grounded theory and labeled 35,000 tweets. Our resulting dataset has roughly 15% positive examples 85% negative examples. This data useful for training machine learning models, identifying textual...
ABSTRACT During the COVID‐19 pandemic, some research that otherwise would have been conducted in person pivoted to online platforms. This poster paper describes lessons learned from an study of information behavior by individuals with long‐term needs, focusing on what was about how conduct such a online. Broadly, three themes are evident: (1) Trust mechanisms were weaker than be expected for in‐person study, resulting greater coordination difficulties; (2) What seemed fair reimbursement rate...
A user with a standing need for updates on current events uses structured exploration process finding and reviewing new documents, the comparing document information to her mental model. To avoid missing key changes topic, should see some documents each of subtopics available that day. This research includes system evaluation approach this use case.
This paper analyzes 246 fake news websites previously identified in three research projects.From this dataset, we extract a set of authors who have written for these sites 2016, which make publicly available.Applying novel shared authorship construct, analyze network sites.This analysis shows tight cluster sites, with trend article reposting, wherein copy content from each other but preserve author bylines.We also show the most central authors, while associated different share common...