- Graph Theory and Algorithms
- Advanced Graph Neural Networks
- Data Quality and Management
- Data Mining Algorithms and Applications
- Data Management and Algorithms
- Ion Channels and Receptors
- Complexity and Algorithms in Graphs
- Cloud Computing and Resource Management
- Soil and Unsaturated Flow
- Rough Sets and Fuzzy Logic
- Complex Network Analysis Techniques
- Soil erosion and sediment transport
- Privacy-Preserving Technologies in Data
- Irrigation Practices and Water Management
- Wave and Wind Energy Systems
- Ocean Waves and Remote Sensing
- Cardiovascular, Neuropeptides, and Oxidative Stress Research
- Innovative Microfluidic and Catalytic Techniques Innovation
- Parallel Computing and Optimization Techniques
- Recommender Systems and Techniques
- Probabilistic and Robust Engineering Design
- Soil Moisture and Remote Sensing
- Merger and Competition Analysis
- Pluripotent Stem Cells Research
- Stock Market Forecasting Methods
Chinese Academy of Sciences
2006-2024
Beihang University
2013-2024
Taiyuan University of Science and Technology
2023
Institute of Geographic Sciences and Natural Resources Research
2023
University of Chinese Academy of Sciences
2023
University of Science and Technology of China
2019-2022
Institute of Software
2022
Guangzhou Institutes of Biomedicine and Health
2019-2021
Alibaba Group (United States)
2019-2021
Guangzhou Medical University
2021
This paper presents GRAPE, a parallel system for graph computations. GRAPE differs from prior systems in its ability to parallelize existing sequential algorithms as whole. Underlying are simple programming model and principled approach, based on partial evaluation incremental computation. We show that can be "plugged into" with minor changes, get parallelized. As long the correct, their parallelization guarantees terminate correct answers under monotonic condition. Moreover, we MapReduce,...
The incremental problem for a class Q of graph queries aims to compute, given query in 'Q, G, output Q(G) and updates Δ G as input, changes O such that Q(G ⊕ G) = O. It is called bounded if its cost can be expressed polynomial function the sizes Q, reduce computations on possibly big small No matter how desirable, however, our first results are negative: common traversal, connectivity, keyword search pattern matching, their problems unbounded.
Keys for graphs aim to uniquely identify entities represented by vertices in a graph. We propose class of keys that are recursively defined terms graph patterns, and interpreted with subgraph isomorphism. Extending conventional relations XML, these find applications object identification, knowledge fusion social network reconciliation. As an application, we study the entity matching problem that, given G set Σ keys, is all pairs (vertices) identified Σ. show intractable, cannot be...
This paper studies incremental graph partitioning. Given a (vertex-cut or edge-cut) partition C(G) of G and updates ΔG to G, it is compute changes ΔO C(G), yielding the updated such that (a) new load-balanced, (b) its cut size minimum, (c) are also minimum. We show this tri-criteria optimization problem NP-complete, even when has constant size. Worse yet, unbounded, i.e., there exists no algorithm computes with cost determined only by ΔO. approach proposing incrementalize widely-used...
GraphScope is a system and set of language extensions that enable new programming interface for large-scale distributed graph computing. It generalizes previous processing frameworks (e.g. , Pregel, GraphX) databases ( e.g ., Janus-Graph, Neptune) in two important ways: by exposing unified to wide variety computations such as traversal, pattern matching, iterative algorithms neural networks within high-level language; supporting the seamless integration highly optimized engine general...
This paper proposes to deduce certain fixes graphs G based on data quality rules Σ and ground truth Γ ( i.e., validated attribute values entity matches). We fix errors detected by in such that the are assured correct as long Σand correct. two paradigms. (a) interact with users "incrementally" online. Whenever pick a small set V 0 of nodes , we all pertaining accumulate process. (b) Based accumulated Γ, repair entire graph offline; while this may not guaranteed certain. develop techniques for...
This paper tackles two challenges to discovery of graph rules. Existing methods often (a) return an excessive number rules, and (b) do not scale with large graphs given the intractability problem. We propose application-driven strategy cut back rules data that are irrelevant users' interests, by training a machine learning (ML) model identify pertaining application. Moreover, we introduce sampling method reduce big G set H small sample graphs. Given expected support recall bounds, is able...
Bryopsis hypnoides Lamouroux is a siphonous green alga, and its extruded protoplasm can aggregate spontaneously in seawater develop into mature individuals. The chloroplast of B. the biggest organelle cell shows strong autonomy. To better understand this organelle, we sequenced analyzed genome alga.A total 111 functional genes, including 69 potential protein-coding 5 ribosomal RNA 37 tRNA genes were identified. size (153,429 bp), arrangement, inverted-repeat (IR)-lacking structure DNA...
This paper proposes a class of graph association rules, denoted by GARs, to specify regularities between entities in graphs. A GAR is combination pattern and dependency; it may take as predicates ML (machine learning) classifiers for link prediction. We show that GARs help us catch incomplete information schemaless graphs, predict links social identify potential customers digital marketing, extend functional dependencies (GFDs) capture both missing inconsistencies. formalize deduction with...
This paper studies how to catch duplicates, mismatches and conflicts in the same process. We adopt a class of entity enhancing rules that embed machine learning predicates, unify resolution conflict resolution, are collectively defined across multiple relations. detect discrepancies as violations such rules. establish complexity discrepancy detection incremental problems with rules; they both NP-complete W[1]-hard. To cope intractability scale large datasets, we develop parallel algorithms...
The incremental problem for a class \( {\mathcal {Q}} \) of graph queries aims to compute, given query Q \in , G answers ( ) in and updates ΔG as input, changes ΔO output such that ⊕ = )⊕ . It is called bounded if its cost can be expressed polynomial function the sizes which reduces computations on possibly big small No matter how desirable, however, our first results are negative: For common traversal, connectivity, keyword search, pattern matching, maximum cardinality their problems...
B = linear term coefficient vector bi element i of C quadratic matrix cij jth in ith row D variance X function E mean value f, Y, H generic functions h lining ring thickness index j column k the variable number set r input variables t time tc gas temperature X, Xi xmax maximum Yj output response Yj;max extremum c heat transfer 0 constant min minimum casing radial deformation expansion thermal conductivity normal distribution
This paper proposes a class of temporal association rules, denoted by TACOs, for event prediction. As opposed to previous graph TACOs monitor updates graphs, and can be used capture interests in recommendation catch frauds response behavior changes, among other things. are defined on graphs terms change patterns (temporal) conditions, may carry machine learning (ML) predicates We settle the complexity reasoning about including their satisfiability, implication prediction problems. develop...
Saline soils are of great concern globally. Selecting the Yellow River Delta as a model site, influence reclamation on soil salinity and saline quality was investigated. Soil index (SQI) applied to statistically analyze 210 profile samples collected at seven depth layers in 30 sampling sites among native three croplands (peanut, cotton, wheat) May 2020. After reclamation, salt content (SSC) reduced from 4.52 g/kg 1.44 after with degree reducing severe slight. The nitrate nitrogen (NO3−-N)...
The ever-growing amount of dynamic graph data demands efficient techniques incremental processing. However, algorithms are challenging to develop. Existing approaches usually require users manually design nontrivial operators, or choose different memoization strategies for certain specific types computation, limiting the usability and generality. In light these challenges, we propose Ingress, an automated system <u>in</u>cremental <u>g</u>raph...
We propose an extension of graph patterns, referred to as conditional patterns and denoted CGPs. In a CGP,one can specify simple condition on each edge such that the exists if only is satisfied. show CGPs allow us catch missing links, increase expressivity functional dependencies, provide succinct representation patterns. settle complexity their consistency, matching, incremental matching containment problems, in linear time,NP-complete,NP-complete p2-complete, respectively. These tell...
This paper proposes an approach for entity resolution (ER) and conflict (CR) in large-scale graphs. It is based on a class of Graph Cleaning Rules (GCRs), which support the primitives relational data cleaning rules, may embed machine learning classifiers as predicates. As opposed to previous graph GCRs are defined with dual pattern accommodate irregular structures schemaless graphs, adopt patterns star form reduce complexity. We show that satisfiability, implication validation problems all...
Transient receptor potential ankyrin 1 (TRPA1), the non-selective cation channel, was found that can mediate generation of multiple sclerosis, while mechanism is still controversial. Lysophosphatidylcholine (LPC) a critical trigger sclerosis which results from syndrome neuronal inflammation and demyelination. In this work, we suggested TRPA1 LPC-induced oxidative stress cytotoxicity in OLN-93 oligodendrocyte. The expression detected by using quantitative real-time PCR (qRT-PCR)...
Incremental algorithms are important to dynamic graph analyses, but hard write and analyze. Few incremental in place, even fewer offer performance guarantees.
Dravet syndrome is a neurological disorder characterized by treatment-resistant polymorphic seizures, primarily caused loss-of-function in the SCN1A gene. To develop an vitro model of this disease, previously study we generated induced pluripotent stem cell line from 10-year-old boy carrying NM_001165963.1:c.5768A to G (Q1923R) mutation SCN1A. Using TALEN-mediated genome editing, have now isogenic control which disease-causing found epilepsy patient iPSCs was corrected, order eliminate...
The seabed topography near small islands is usually in rugged state, and the water depth can be changed from tens of meters to a few short distance. Far field waves advancing towards shore will experience complex evolution on shoal. In this environment, hydrodynamic responses floating structure largely affected by non-uniform wave pattern changeable seabed, might quite different general platform with uniform depth. paper, coupling model structures reef was established; incident force,...
The saline soils in the Yellow River Delta are primarily affected by seawater intrusion resulting from intersection of land and sea, which exhibit variations salinity. quality these is also influenced anthropogenic reclamation, including fertilization irrigation. This study quantitatively evaluated distinct soil properties characteristics during different reclamation years (i.e., 0a, 1a, 5a, 10a, 20a) using principal component analysis (PCA) Soil Quality Index (SQI). findings indicated that...