- Advanced Text Analysis Techniques
- Natural Language Processing Techniques
- Topic Modeling
- Sentiment Analysis and Opinion Mining
- Biomedical Text Mining and Ontologies
- Text and Document Classification Technologies
- Genetics, Bioinformatics, and Biomedical Research
- Traffic Prediction and Management Techniques
- Genomics and Phylogenetic Studies
- Advanced Steganography and Watermarking Techniques
- Data Mining Algorithms and Applications
- Machine Learning and Algorithms
- Evaluation Methods in Various Fields
- Hate Speech and Cyberbullying Detection
- Semantic Web and Ontologies
- Chaos-based Image/Signal Encryption
- Transportation Planning and Optimization
- Human Mobility and Location-Based Analysis
- Social Media and Politics
- Machine Learning in Bioinformatics
- Advanced Control Systems Optimization
- Machine Learning and Data Classification
- Data Quality and Management
- Digital Filter Design and Implementation
- Advanced Computational Techniques and Applications
Beijing Institute of Technology
2010-2024
Northwestern Polytechnical University
2024
Harbin Institute of Technology
2024
Beijing Institute of Genomics
2012-2013
Chinese Academy of Sciences
2012-2013
Zhejiang Gongshang University
2010
The web is being loaded daily with a huge volume of data, mainly unstructured textual which increases the need for information extraction and NLP systems significantly. Named‐entity recognition task key step towards efficiently understanding text data saving time effort. Being widely used language globally, English taking over most research conducted in this field, especially biomedical domain. Unlike other languages, Arabic suffers from lack resources. This work presents BERT‐based model to...
The car‐sharing system is a popular rental model for cars in shared use. It has become particularly attractive due to its flexibility; that is, the car can be rented and returned anywhere within one of authorized parking slots. main objective this research work predict usage stations investigate factors help improve prediction. Thus, new strategies designed make more on road fewer stations. To achieve that, various machine learning models, namely vector autoregression (VAR), support...
Rice is the most important staple food for a large part of world's human population and also key model organism biological studies crops as well other related plants. Here we present RiceWiki (http://ricewiki.big.ac.cn), wiki-based, publicly editable open-content platform community curation rice genes. Most existing databases are based on expert curation; with exponentially exploding volume knowledge relevant data, however, becomes increasingly laborious time-consuming to keep up-to-date,...
Politics is one of the hottest and most commonly mentioned viewed topics on social media networks nowadays. Microblogging platforms like Twitter Weibo are widely used by many politicians who have a huge number followers supporters those platforms. It essential to study supporters’ network political leaders because it can help in decision making when predicting their futures. This focuses three famous Pakistan, namely, Imran Khan (IK), Maryam Nawaz Sharif (MNS), Bilawal Bhutto Zardari (BBZ)....
Context . Social media platforms such as Facebook and Twitter carry a big load of people’s opinions about politics leaders, which makes them good source information for researchers to exploit different tasks that include election predictions. Objective Identify, categorize, present comprehensive overview the approaches, techniques, tools used in predictions on Twitter. Method Conducted systematic mapping study (SMS) provided empirical evidence work published between January 2010 2021....
Sentiment classification is an important data mining task. Previous researches tried various machine learning techniques while didn't make fully use of the difference among features. This paper proposes a novel method for improving sentiment by exploring different contribution The consists two parts. First, we highlight sentimental features increasing their weight. Second, bagging to construct multiple classifiers on feature spaces and combine them into aggregating classifier. Extensive...
Abstract Summary: Community curation—harnessing community intelligence in knowledge curation, bears great promise dealing with the flood of biological knowledge. To exploit full potential scientific for multiple wikis (bio-wikis) have been built to date. However, none them achieved a substantial impact on curation. One major limitations bio-wikis is insufficient participation, which intrinsically because lack explicit authorship and thus no credit increase curation bio-wikis, here we develop...
Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in an input-text sequence their correct references a knowledge graph. We tackle NED problem by leveraging two novel objectives for pre-training framework, and propose model. Especially, proposed model consists of: (i) concept-enhanced pre-training, aiming at identifying valid lexical semantic relations with concept constraints derived from external resource Probase; (ii) masked language model,...
Based on the analysis of existing opinion mining techniques, accuracy and complexity, this paper proposes a system which mines useful information from camera reviews by utilizing Semantic Role Labeling (SRL) polarity computing method. Feature lexicon sentiment are constructed to mine features emotional items. In end, comparison between positive negative presented visually. Our experimental results show is feasible effective.
Harnessing community intelligence in knowledge curation bears significant promise dealing with communication and education the flood of scientific knowledge. As is accumulated at ever-faster rates, nomenclature, a particular kind knowledge, concurrently generated all kinds fields. Since nomenclature system terms used to name things discipline, accurate translation different languages critical importance, not only for communications collaborations English-speaking people, but also...
With the rapid growth of online news services, users can actively respond to by making comments. Users often express subjective emotions in comments such as sadness, surprise and anger. Such help understand preferences perspectives individual users, therefore may facilitate publishers provide with more relevant services. This paper tackles task predicting for news. To best our knowledge, this is first research work addressing task. In particular, proposes a novel Meta classification approach...
The rapidly growing data in many areas, as well the biomedical domain, require assistance of information extraction systems to acquire much needed knowledge about specific entities such proteins, drugs, or diseases practically within a short time. Annotated corpora serve purpose facilitating process building NLP systems. While colossal work has been done this area for English language, other languages like Arabic seem lack these resources, especially healthcare area. Therefore, work, we...
In this paper, we propose a novel method that performs Cross Language Text Categorization (CLTC) from the perspective of Information Retrieval. We present an input document in target language form query source language. Then retrieve training documents and find K most relevant results. At last, use class labels results to predict document. The only external resource required by our is bilingual dictionary. Experimental show gives promising performance, which better than translation-based method.
This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in topic, and then finds central sentence each subtopic. The sentences are scored their importance document Two anti-redundancy strategies used to extract form summarization. Since our is intrinsically incremental, it effective when new documents added set. Experimental results indicate that proposed efficient.
Car-sharing systems require accurate demand prediction to ensure efficient resource allocation and scheduling decisions. However, developing precise predictive models for vehicle remains a challenging problem due the complex spatio-temporal relationships. This paper introduces USTIN, Unified Spatio-Temporal Inference Prediction Network, novel neural network architecture prediction. The model consists of three key components: temporal feature unit, spatial unit. unit utilizes historical data...
<title>Abstract</title> In recent years, nonlinear science, particularly chaotic systems, has garnered significant research interest. Cross-coupled maplattices (CCML) represent a typical spatiotemporal structure, commonly utilized in data security fields such as hashfunctions. However, traditional CCMLs have limited parameter range, and their effects need enhancement,restricting application cryptography. This paper proposes an improved signal-enhanced cross-coupled map lattice(SE-CCML),...
Given a set of labels, multi-label text classification (MLTC) aims to assign multiple relevant labels for text. Recently, deep learning models get inspiring results in MLTC. Training high-quality MLTC model typically demands large-scale labeled data. And comparing with annotations single-label data samples, samples are more time-consuming and expensive. Active can enable achieve optimal prediction performance using fewer samples. Although active has been considered models, there few studies...