- Blockchain Technology Applications and Security
- Advanced Data Storage Technologies
- Cloud Computing and Resource Management
- Distributed systems and fault tolerance
- Scientific Computing and Data Management
- Caching and Content Delivery
- Sentiment Analysis and Opinion Mining
- Advanced Text Analysis Techniques
- Cloud Data Security Solutions
- Cryptography and Data Security
- Research Data Management Practices
- Spam and Phishing Detection
- Topic Modeling
- Machine Learning and ELM
- Data Quality and Management
- Complex Systems and Time Series Analysis
- Face and Expression Recognition
- Web Data Mining and Analysis
- Security and Verification in Computing
- Distributed and Parallel Computing Systems
- Neural Networks and Applications
National University of Singapore
2018-2022
Agency for Science, Technology and Research
2016
The success of Bitcoin and other cryptocurrencies bring enormous interest to blockchains. A blockchain system implements a tamper-evident ledger for recording transactions that modify some global states. captures entire evolution history the management history, also known as data provenance or lineage, has been studied extensively in database systems. However, querying existing blockchains can only be done by replaying all transactions. This approach is applicable large-scale, offline...
Existing data storage systems offer a wide range of functionalities to accommodate an equally diverse applications. However, new classes applications have emerged, e.g., blockchain and collaborative analytics, featuring versioning, fork semantics, tamper-evidence or any combination thereof. They present opportunities for efficiently support such by embedding the above requirements into storage. In this paper, we ForkBase , engine designed forkable By integrating core application properties...
Smart contracts have enabled blockchain systems to evolve from simple cryptocurrency platforms general transactional systems. A new architecture called execute-order-validate has been proposed in Hyperledger Fabric support parallel transactions. However, this might render many invalid transactions when serializing them. This problem is further exaggerated as the block formation rate inherently limited due other factors beside data processing, such cryptography and consensus. Inspired by...
Blockchain has come a long way - system that was initially proposed specifically for cryptocurrencies is now being adapted and adopted as general-purpose transactional system. As blockchain evolves into another data management system, the natural question how it compares against distributed database systems. Existing works on this comparison focus high-level properties, such security throughput. They stop short of showing underlying design choices contribute to overall differences. Our work...
With the emergence of hybrid blockchain database systems, we aim to provide an in-depth analysis performance and trade-offs among a few representative systems. To achieve this goal, implement Veritas BlockchainDB from scratch. For Veritas, two flavors target crash fault-tolerant (CFT) Byzantine (BFT) application scenarios. Specifically, with Apache Kafka CFT scenarios, Tendermint BFT We compare these three systems existing open-source implementation BigchainDB. BigchainDB uses Tender-mint...
We present LedgerView -- a system that adds access control views to permissioned blockchains. The approach is motivated by an AT&T application of tracking refurbished devices. A blockchain decentralized tamper-resistant ledger managed group peers. It used in many applications for storing and sharing sensitive information, e.g., monetary transactions, health records, personal documents, etc. But blockchain, all the peers see stored while some applications, information should be limited, is,...
Existing data storage systems offer a wide range of functionalities to accommodate an equally diverse applications. However, new classes applications have emerged, e.g., blockchain and collaborative analytics, featuring versioning, fork semantics, tamper-evidence or any combination thereof. They present opportunities for efficiently support such by embedding the above requirements into storage. In this paper, we ForkBase, engine specifically designed provide efficient forkable By integrating...
The success of Bitcoin and other cryptocurrencies bring enormous interest to blockchains. A blockchain system implements a tamper-evident ledger for recording transactions that modify some global states. captures the entire evolution history management history, also known as data provenance or lineage, has been studied extensively in database systems. However, querying existing blockchains can only be done by replaying all transactions. This approach is feasible large-scale, offline...
Sentiment analysis is one of the most popular natural language processing techniques. It aims to identify sentiment polarity (positive, negative, neutral or mixed) within a given text. The proper lexicon knowledge very important for lexicon-based methods since they hinge on using lexical item determine text's polarity. However, it quite common that some items appear positive in text domain but negative another. In this paper, we propose an innovative building algorithm extract through...
Today's storage systems expose abstractions which are either too low-level (e.g., key-value store, raw-block store) that they require developers to re-invent the wheels, or high-level relational databases, Git) lack generality support many classes of applications. In this work, we propose and implement a general distributed data system, called UStore, has rich semantics. UStore delivers three key properties, namely immutability, sharing security, unify add values today's applications, also...
Data collaboration activities typically require systematic or protocol-based coordination to be scalable. Git, an effective enabler for collaborative coding, has been attested its success in countless projects around the world. Hence, applying Git philosophy general data beyond coding is motivating. We call it data. However, original design handles at file granule, which considered too coarse-grained many database applications. argue that should co-designed with systems. To this end, we...
Blockchain has come a long way: system that was initially proposed specifically for cryptocurrencies is now being adapted and adopted as general-purpose transactional system. As blockchain evolves into another data management system, the natural question how it compares against distributed database systems. Existing works on this comparison focus high-level properties, such security throughput. They stop short of showing underlying design choices contribute to overall differences. Our work...
Smart contracts have enabled blockchain systems to evolve from simple cryptocurrency platforms, such as Bitcoin, general transactional systems, Ethereum. Catering for emerging business requirements, a new architecture called execute-order-validate has been proposed in Hyperledger Fabric support parallel transactions and improve the blockchain's throughput. However, this might render many invalid when serializing them. This problem is further exaggerated block formation rate inherently...
Data collaboration activities typically require systematic or protocol-based coordination to be scalable. Git, an effective enabler for collaborative coding, has been attested its success in countless projects around the world. Hence, applying Git philosophy general data beyond coding is motivating. We call it data. However, original design handles at file granule, which considered too coarse-grained many database applications. argue that should co-designed with systems. To this end, we...