Chang‐Tien Lu

ORCID: 0000-0003-3675-0199
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Anomaly Detection Techniques and Applications
  • Data Management and Algorithms
  • Data-Driven Disease Surveillance
  • Complex Network Analysis Techniques
  • Human Mobility and Location-Based Analysis
  • Traffic Prediction and Management Techniques
  • Geographic Information Systems Studies
  • Topic Modeling
  • Advanced Statistical Methods and Models
  • Advanced Database Systems and Queries
  • Data Mining Algorithms and Applications
  • Advanced Text Analysis Techniques
  • Advanced Graph Neural Networks
  • Network Security and Intrusion Detection
  • Natural Language Processing Techniques
  • Time Series Analysis and Forecasting
  • Spam and Phishing Detection
  • Domain Adaptation and Few-Shot Learning
  • Multimodal Machine Learning Applications
  • Misinformation and Its Impacts
  • Music and Audio Processing
  • Web Data Mining and Analysis
  • Algorithms and Data Compression
  • Sentiment Analysis and Opinion Mining
  • Data Stream Mining Techniques

Virginia Tech
2016-2025

Institute for Forecasting of the Slovak Academy of Sciences
2025

China South Industries Group (China)
2024

Stevens Institute of Technology
2024

Boston Children's Museum
2024

Boston Children's Hospital
2024

Kuwait University
2023

Microsoft (United States)
2023

University of Virginia
2013-2014

University of Houston - Victoria
2014

Due to the dramatic increase of fraud which results in loss billions dollars worldwide each year, several modern techniques detecting are continually developed and applied many business fields. Fraud detection involves monitoring behavior populations users order estimate, detect, or avoid undesirable behavior. Undesirable is a broad term including delinquency, fraud, intrusion, account defaulting. This paper presents survey current used credit card detection, telecommunication computer...

10.1109/icnsc.2004.1297040 article EN 2004-06-10

With the advance of sensor technologies, Multivariate Time Series classification (MTSC) problem, perhaps one most essential problems in time series data mining domain, has continuously received a significant amount attention recent decades. Traditional approaches based on Bag-of-Patterns or Shapelet have difficulty dealing with huge amounts feature candidates generated high-dimensional multivariate but promising performance even when training set is small. In contrast, deep learning methods...

10.1609/aaai.v34i04.6165 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

We describe the design, implementation, and evaluation of EMBERS, an automated, 24x7 continuous system for forecasting civil unrest across 10 countries Latin America using open source indicators such as tweets, news sources, blogs, economic indicators, other data sources. Unlike retrospective studies, EMBERS has been making forecasts into future since Nov 2012 which have (and continue to be) evaluated by independent T&E team (MITRE). Of note, successfully forecast June 2013 protests in...

10.1145/2623330.2623373 article EN 2014-08-22

10.1023/a:1023455925009 article EN GeoInformatica 2003-01-01

A quantitative analysis of tweets during the Ebola crisis reveals that lies, half-truths, and rumors can spread just like true news.

10.1109/mc.2014.361 article EN Computer 2014-12-01

Spatial event forecasting from social media is an important problem but encounters critical challenges, such as dynamic patterns of features (keywords) and geographic heterogeneity (e.g., spatial correlations, imbalanced samples, different populations in locations). Most existing approaches LASSO regression, query expansion, burst detection) are designed to address some these not all them. This paper proposes a novel multi-task learning framework which aims concurrently the challenges....

10.1145/2783258.2783377 article EN 2015-08-07

Identification of travelers' transportation modes is a fundamental step for various problems that arise in the domain such as travel demand analysis, transport planning, and traffic management. In this paper, we aim to identify purely based on their GPS trajectories. First, segmentation process developed partition user's trip into segments with only one mode. A majority studies have proposed mode inference models hand-crafted features, which might be vulnerable environmental conditions....

10.1109/tkde.2019.2896985 article EN IEEE Transactions on Knowledge and Data Engineering 2019-02-01

Social media is often viewed as a sensor into various societal events such disease outbreaks, protests, and elections. We describe the use of social crowdsourced to gain insight ongoing cyber-attacks. Our approach detects broad range cyber-attacks (e.g., distributed denial service (DDoS) attacks, data breaches, account hijacking) in weakly supervised manner using just small set seed event triggers requires no training or labeled samples. A new query expansion strategy based on convolution...

10.1145/3132847.3132866 preprint EN 2017-11-06

Amidst the COVID-19 pandemic, cyberbullying has become an even more serious threat. Our work aims to investigate viability of automatic multiclass detection model that is able classify whether a cyberbully targeting victim's age, ethnicity, gender, religion, or other quality. Previous literature not yet explored making fine-grained classifications o f s uch m agnitude, nd existing datasets suffer from quite severe class imbalances. To combat these challenges, we establish framework for...

10.1109/bigdata50022.2020.9378065 article EN 2021 IEEE International Conference on Big Data (Big Data) 2020-12-10

Statement of problemAI technology presents a variety benefits and challenges for educators.PurposeTo investigate whether ChatGPT Bard are valuable resources generating multiple-choice questions educators dental caries.Material methodsA book on caries was used. Sixteen paragraphs were extracted by an expert consultant based applicability potential developing questions. language models used to produce this input, 64 generated. Three specialists assessed the relevance, accuracy, complexity...

10.1016/j.heliyon.2024.e28198 article EN cc-by-nc-nd Heliyon 2024-03-19

Spatial databases, addressing the growing data management and analysis needs of spatial applications such as geographic information systems, have been an active area research for more than two decades. This has produced a taxonomy models space, types operators, query languages processing strategies, well indexes clustering techniques. However, is needed to improve support network field data, (e.g., cost models, bulk load). Another important need apply accomplishments newer applications,...

10.1109/69.755614 article EN IEEE Transactions on Knowledge and Data Engineering 1999-01-01

Identification of outliers can lead to the discovery unexpected, interesting, and useful knowledge. Existing methods are designed for detecting spatial in multidimensional geometric data sets, where a distance metric is available. In this paper, we focus on graph structured sets. We define statistical tests, analyze foundation underlying our approach, design several fast algorithms detect outliers, provide cost model outlier detection procedures. addition, experimental results from...

10.1145/502512.502567 article EN 2001-08-26

A spatial outlier is a spatially referenced object whose non-spatial attribute values are significantly different from the of its neighborhood. Identification outliers can lead to discovery unexpected, interesting, and useful patterns for further analysis. One drawback existing methods that normal objects tend be falsely detected as when their neighborhood contains true outliers. We propose suite detection algorithms overcome this disadvantage. formulate problem in general way design which...

10.1109/icdm.2003.1250986 article EN 2004-04-23

Spatial outliers are the spatial objects with distinct features from their surrounding neighbors. Detection of helps reveal valuable information large data sets. In many real applications, can not be simply abstracted as isolated points. They have different boundary, size, volume, and location. These properties affect impact a object on its neighbors should taken into consideration. this paper, we propose two outlier detection methods which integrate to outlierness measurement. Experimental...

10.1137/1.9781611972764.71 article EN 2006-04-20

This study explores factors significantly impact the acceptance of Wireless Internet via Mobile Technology (WIMT) in China. The results indicate that WIMT is related with of: perceived usefulness, ease use, social influences, wireless trust environment, and facilitating conditions. It provides diagnostic insight into how different influence user intention to accept China, thus help business build solid strategy prompt m-commerce there.

10.58729/1941-6687.1008 article EN Communications of the IIMA 2014-05-29

Crowd sourcing is based on a simple but powerful concept: Virtually anyone has the potential to plug in valuable information. The concept revolves around large groups of people or community handling tasks that have traditionally been associated with specialist small group experts. With advent smart devices, many mobile applications are already tapping into crowd report issues and traffic problems, more can be done. While most these work well for average user, it neglects information needs...

10.1145/2093973.2094064 article EN 2011-11-01

Infectious disease epidemics such as influenza and Ebola pose a serious threat to global public health. It is crucial characterize the evolution of ongoing epidemic efficiently accurately. Computational epidemiology can model progress underlying contact network, but suffers from lack real-time fine-grained surveillance data. Social media, on other hand, provides timely detailed surveillance, insensible network model. This paper proposes novel semi-supervised deep learning framework that...

10.1109/icdm.2015.39 article EN 2015-11-01

Traffic prediction is critical for the success of intelligent transportation systems (ITS). However, most spatio-temporal models suffer from high mathematical complexity and low tune-up flexibility. This article presents a novel random effects (STRE) model that has reduced computational due to dimension reduction, with additional flexibility provided by basis function capable taking traffic patterns into account. Bellevue, WA, was selected as test site its widespread deployment loop...

10.1080/15472450.2015.1072050 article EN Journal of Intelligent Transportation Systems 2015-07-25

EMBERS is an anticipatory intelligence system forecasting population-level events in multiple countries of Latin America. A deployed from 2012, has been generating alerts 24x7 by ingesting a broad range data sources including news, blogs, tweets, machine coded events,currency rates, and food prices. In this paper, we describe our experiences operating continuously for nearly 4 years, with specific attention to the discoveries it enabled, correct as well missed forecasts, lessons learnt...

10.1145/2939672.2939709 article EN 2016-08-08

Event forecasting in Twitter is an important and challenging problem. Most existing approaches focus on temporal events (such as elections sports) do not consider spatial features their underlying correlations. In this paper, we propose a generative model for spatiotemporal event Twitter. Our characterizes the development of future by jointly modeling structural contexts burstiness. An effective inference algorithm developed to train parameters. Utilizing trained model, alignment likelihood...

10.1137/1.9781611974010.108 article EN 2015-06-30
Coming Soon ...