Yinghui Wu

ORCID: 0000-0003-3991-5155
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Graph Theory and Algorithms
  • Advanced Graph Neural Networks
  • Data Management and Algorithms
  • Advanced Database Systems and Queries
  • Data Quality and Management
  • Complex Network Analysis Techniques
  • Scientific Computing and Data Management
  • Semantic Web and Ontologies
  • Data Mining Algorithms and Applications
  • Anomaly Detection Techniques and Applications
  • Distributed systems and fault tolerance
  • Caching and Content Delivery
  • Seismic Imaging and Inversion Techniques
  • Cloud Computing and Resource Management
  • Complexity and Algorithms in Graphs
  • Energy Load and Power Forecasting
  • Solar Radiation and Photovoltaics
  • Network Security and Intrusion Detection
  • Machine Learning in Materials Science
  • Hydraulic Fracturing and Reservoir Analysis
  • Drilling and Well Engineering
  • Peer-to-Peer Network Technologies
  • Power System Optimization and Stability
  • Distributed and Parallel Computing Systems
  • Smart Grid Security and Resilience

Shanghai Jiao Tong University
2022-2025

Beijing Electronic Science and Technology Institute
2025

Wuhan Children's Hospital
2023-2024

Huazhong University of Science and Technology
2023-2024

Monash University
2024

Materials Science & Engineering
2024

Case Western Reserve University
1969-2024

Jinan Maternity And Care Hospital
2024

Suzhou Research Institute
2024

Kuang Tien General Hospital
2024

Graph pattern matching is typically defined in terms of subgraph isomorphism, which makes it an np-complete problem. Moreover, requires bijective functions, are often too restrictive to characterize patterns emerging applications. We propose a class graph patterns, edge denotes the connectivity data within predefined number hops. In addition, we define based on notion bounded simulation, extension simulation. show that with this revision, can be performed cubic-time, by providing such...

10.14778/1920841.1920878 article EN Proceedings of the VLDB Endowment 2010-09-01

It is common to find graphs with millions of nodes and billions edges in, e.g., social networks. Queries on such are often prohibitively expensive. These motivate us propose query preserving graph compression, compress relative a class Λ queries users' choice. We compute small Gr from G that (a) for any Q Ε Q, Q(G) = Q'(Gr), where Q' can be efficiently computed Q; (b) algorithm computing directly applied evaluating as is. That is, while we cannot lower the complexity queries, reduce data...

10.1145/2213836.2213855 article EN 2012-05-20

It is increasingly common to find graphs in which edges bear different types, indicating a variety of relationships. For such we propose class reachability queries and graph patterns, an edge specified with regular expression certain form, expressing the connectivity data via various types. In addition, define pattern matching based on revised notion simulation. On emerging applications as social networks, show that these are capable finding more sensible information than their traditional...

10.1109/icde.2011.5767858 article EN 2011-04-01

It is increasingly common to find real-life data represented as networks of labeled, heterogeneous entities. To query these networks, one often needs identify the matches a given graph in (typically large) network modeled target graph. Due noise and lack fixed schema graph, can substantially differ from its both structure node labels, thus bringing challenges querying tasks. In this paper, we propose NeMa (Network Match), neighborhood-based subgraph matching technique for networks. (1)...

10.14778/2535569.2448952 article EN Proceedings of the VLDB Endowment 2013-01-01

-With the advent of phasor measurement units (PMUs), high resolution synchronized measurements enables real time system monitoring and control. PMUs transmit data to local controllers in substations concentrators for wide control application center. They provide real-time critical power applications such as remedial action schemes, oscillation detection, state estimation. The quality from is smart grid applications. Several methods are developed detect anomalies series data, tailored PMU...

10.1109/tsg.2018.2816027 article EN publisher-specific-oa IEEE Transactions on Smart Grid 2018-03-15

Graph pattern matching has been widely used in e.g., social data analysis. A number of algorithms have developed that, given a graph Q and G , compute the set M(Q,G) matches . However, these often return an excessive matches, are expensive on large real-life graphs. Moreover, practice many queries to find specific node, rather than entire This paper studies top- k matching. (1) We revise defined terms simulation, by supporting designated output node u o Given it is those nodes that match...

10.14778/2536258.2536263 article EN Proceedings of the VLDB Endowment 2013-08-01

Graph pattern matching has become a routine process in emerging applications such as social networks. In practice data graph is typically large, and frequently updated with small changes. It often prohibitively expensive to recompute matches from scratch via batch algorithms when the updated. With this comes need for incremental that compute changes response updates, minimize unnecessary recomputation. This paper investigates defined terms of simulation, bounded simulation subgraph...

10.1145/1989323.1989420 article EN 2011-06-12

Graph pattern matching is commonly used in a variety of emerging applications such as social network analysis. These highlight the need for studying following two issues. First, graph traditionally defined terms subgraph isomorphism or simulation. notions, however, often impose too strong topological constraint on graphs to identify meaningful matches. Second, practice typically large, and frequently updated with small changes. It prohibitively expensive recompute matches starting from...

10.1145/2489791 article EN ACM Transactions on Database Systems 2013-08-01

We propose graph-pattern association rules (GPARs) for social media marketing. Extending item-sets, GPARs help us discover regularities between entities in graphs, and identify potential customers by exploring influence. study the problem of discovering top- k diversified GPARs. While this is NP-hard, we develop a parallel algorithm with accuracy bound. also identifying it provide scalable that guarantees polynomial speedup over sequential algorithms increase processors. Using real-life...

10.14778/2824032.2824048 article EN Proceedings of the VLDB Endowment 2015-08-01

We propose a class of functional dependencies for graphs, referred to as GFDs. GFDs capture both attribute-value and topological structures entities, subsume conditional (CFDs) special case. show that the satisfiability implication problems are coNP-complete NP-complete, respectively, no worse than their CFD counterparts. also validation problem is coNP-complete. Despite intractability, we develop parallel scalable algorithms catching violations in large-scale graphs. Using real-life...

10.1145/2882903.2915232 article EN Proceedings of the 2022 International Conference on Management of Data 2016-06-16

Querying heterogeneous and large-scale knowledge graphs is expensive. This paper studies a graph summarization framework to facilitate search. (1) We introduce class of reduced summaries. Characterized by approximate pattern matching, these summaries are capable summarizing entities in terms their neighborhood similarity up certain hop, using small informative patterns. (2) study diversified problem. Given graph, it discover top-k that maximize bi-criteria function, characterized both...

10.1109/tkde.2018.2807442 article EN publisher-specific-oa IEEE Transactions on Knowledge and Data Engineering 2018-02-22

Abstract The downhole monitoring of strain using Fiber Optics (FO) can reveal unique information about the propagation and geometry hydraulic fractures between nearby wells during stimulation production. This work aims at creating a catalogue commonly observed strain-rate signals captured in not yet stimulated observation well equipped with either permanently or temporarily installed FO cable. is result an informal collaboration experience users from academia, service providers, consulting...

10.2118/212370-ms article EN SPE Hydraulic Fracturing Technology Conference and Exhibition 2023-01-23

In a variety of emerging applications one needs to decide whether graph G matches another p , i.e. has topological structure similar that . The traditional notions homomorphism and isomorphism often fall short capturing the structural similarity in these applications. This paper studies revisions notions, providing full treatment from complexity algorithms. (1) We propose p-homomorphism (p -hom) 1-1 -hom, which extend subgraph isomorphism, respectively, by mapping edges paths another,...

10.14778/1920841.1920986 article EN Proceedings of the VLDB Endowment 2010-09-01

This paper presents GRAPE, a parallel system for graph computations. GRAPE differs from prior systems in its ability to parallelize existing sequential algorithms as whole. Underlying are simple programming model and principled approach, based on partial evaluation incremental computation. We show that can be "plugged into" with minor changes, get parallelized. As long the correct, their parallelization guarantees terminate correct answers under monotonic condition. Moreover, we MapReduce,...

10.1145/3035918.3035942 article EN 2017-05-09

Querying complex graph databases such as knowledge graphs is a challenging task for non-professional users. Due to their schemas and variational information descriptions, it becomes very hard users formulate query that can be properly processed by the existing systems. We argue user-friendly engine, must support various kinds of transformations synonym, abbreviation, ontology. Furthermore, derived results ranked in principled manner. In this paper, we introduce novel framework enabling...

10.14778/2732286.2732293 article EN Proceedings of the VLDB Endowment 2014-03-01

Protection systems are one of the most critical components in transmission system and becoming more digital with ongoing automation. These prone to vulnerabilities/attacks, exploitation these vulnerabilities may cause major impacts on electric grid performance. Multiple alarms reported control center could be a result faults (expected operations) or failures protection (anomalies/ unexpected operation). Situational awareness gained through sensors such as phasor measurement unit (PMU) data...

10.1109/tia.2019.2928500 article EN publisher-specific-oa IEEE Transactions on Industry Applications 2019-07-15

Summary This paper presents a case study of fault reactivation and induced seismicity during multistage hydraulic fracturing in Sichuan Basin, China. The field microseismicity data delineate activated near the toe horizontal well. spatio-temporal characteristics indicate that seismic activity on first three stages is directly related to fluid injection, while after Stage 3, possibly due relaxation fault. fault-related events have larger magnitudes different frequency-magnitude compared...

10.2118/199883-pa article EN SPE Journal 2019-12-31

With ongoing automation and digitization of the electric power system, several Phasor Measurement Units (PMUs) have been deployed for monitoring control. PMU data can multiple anomalies, many researchers in past concentrated on training machine/deep learning algorithms offline anomaly detection over (i.e., not real-time). These algorithms, when trained a sample rather than population dataset, fail to consider dynamic behavior grid real-time, resulting low accuracy. Considering (e.g., change...

10.1109/tsg.2021.3054375 article EN IEEE Transactions on Smart Grid 2021-01-26

Introduction With the rapid aging population, mental health of older adults is paid more and attention. Anxiety a common illness in adults. Therefore, study aimed to explore current situation anxiety its factors among elderly China. Methods Based on data from 2018 Chinese Longitudinal Healthy Longevity Survey (CLHLS), total 10,982 respondents aged 60 above were selected. Generalized Disorder (GAD-7) scale was used assess anxiety. Univariate multivariate analysis analyze influencing Random...

10.3389/fpsyt.2023.1038049 article EN cc-by Frontiers in Psychiatry 2023-02-02

Given a graph query Q posed on knowledge G, top-k querying is to find k matches in G with the highest ranking score according function. Fast search graphs challenging as both traversal and similarity are expensive. Conventional typically based threshold algorithm (TA), which can no long fit demand new setting. This work proposes STAR, framework. It has two components: (a) fast for star queries, (b) an assembling general queries. The uses building block iteratively sweeps match lists...

10.1109/icde.2016.7498307 article EN 2016-05-01

There has recently been a lot of ongoing research in the areas fairness, bias and explainability machine learning (ML) models due to self-evident or regulatory requirements various ML applications. We make following observation: All these approaches require robust understanding relationship between data used train them. In this work, we introduce provenance tracking problem: fundamental idea is automatically track which columns dataset have derive features/labels an model. discuss challenges...

10.1145/3394486.3403205 preprint EN 2020-08-20

In recent years, a large number of photovoltaic (PV) systems have been added to the electrical grid as well installed off-grid systems. The trend suggests that deployment PV will continue rise in future. Thus, accurate forecasting performance is critical for reliability Due complex non-linear variability power output systems, non-trivial task. This affects stability and planning system network, can reduce uncertainty caused during operation. this work, we leverage spatial temporal coherence...

10.1609/aaai.v35i17.17799 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Generating explanations for graph neural networks (GNNs) has been studied to understand their behavior in analytical tasks such as classification. Existing approaches aim the overall results of GNNs rather than providing specific class labels interest, and may return explanation structures that are hard access, nor directly queryable.We propose GVEX, a novel paradigm generates Graph Views EXplanation. (1) We design two-tier structure called views. An view consists set patterns induced...

10.1145/3639295 article EN Proceedings of the ACM on Management of Data 2024-03-12
Coming Soon ...