- Topic Modeling
- Natural Language Processing Techniques
- Genetic Mapping and Diversity in Plants and Animals
- Academic integrity and plagiarism
- Authorship Attribution and Profiling
- Advanced Text Analysis Techniques
- Web Data Mining and Analysis
- Spam and Phishing Detection
- Imbalanced Data Classification Techniques
- Artificial Intelligence in Law
- Text and Document Classification Technologies
- Software Engineering Research
- Hate Speech and Cyberbullying Detection
- Software Engineering Techniques and Practices
- Data Quality and Management
- Advanced Software Engineering Methodologies
- Digital Humanities and Scholarship
- Handwritten Text Recognition Techniques
- Semantic Web and Ontologies
- Advanced Vision and Imaging
- Advanced Algorithms and Applications
- Caching and Content Delivery
- Recommender Systems and Techniques
- Image and Video Stabilization
- Big Data Technologies and Applications
Foshan University
2020-2023
Rice Research Institute
2023
Guangdong Academy of Agricultural Sciences
2023
South China Agricultural University
2022
Ministry of Agriculture and Rural Affairs
2022
Heilongjiang Institute of Technology
2008-2019
Harbin Engineering University
2014-2017
Attribution extraction refers to find the attributes for instances of a given semantic class, which is essential enhance schema knowledge graph. To facilitate attribution from query log, this article proposes pattern driven graph ranking approach jointly employ and context distribution information. First, simple on text applied automatically acquire seed attributes. Then, graph-based weight propagation designed rank patterns by algorithm Experimental results show that, Chinese log collected...
Detailed comparison is one important sub-task of external plagiarism detection. Seed heuristic between two documents often used in this task. Vector space model (VSM) and Jaccard coefficient are commonly VSM can produce high recall performance; precision performance. In paper, we propose a hybrid similarity measure on the basis fitting function optimal dividing line none-plagiarism where integrates into unified one, our method make full use advantage coefficient, it extract more reasonable...
Providing effective methods of identification high-obfuscation plagiarism seeds presents a significant research problem in the field detection. The conventional detection are based on single type features to capture seeds. But for detection, these not sufficient identifying effectively because varied used plagiarism. This paper multi-features fusion method highobfuscation identification. exploits Logical Regression model integrate lexicon features, syntax semantics and structure which...
The indica rice variety XYXZ carries elite traits including appearance and eating quality. Here, we report the de novo assembly of using Illumine paired-end whole-genome shotgun sequencing Nanopore sequencing. We annotated 39,722 protein-coding genes in 395.04 Mb assembly. In comparison to other cultivars, showed a larger gene size transcripts introns, more exons per gene. And hundreds ultra-long were also detected. A total 4362 complete LTRs annotated, among them, many located next or...
This paper addresses the issue of text matching for plagiarism detection. task aims at identifying segments in a pair suspicious document and its source document. All time, heuristic-based methods are mainly utilized to resolve this problem. But heuristics rely on experts' experiences fail integrate more features detect high obfuscation matches. In paper, statistical machine learning approach, named Ranking-based Text Matching Approach Plagiarism Detection, is proposed deal with issues The...
The mechanized seed production of hybrid rice (Oryza sativa L.) represents significant progress in modern agriculture. However, the technologies and crop management strategies are still immature. present study was conducted with three field experiments to explore effects different planting densities, flight height an agricultural unmanned aerial vehicle (AUAV) for assisting pollination, fertilization techniques, row-ratio restore line sterile on yield production. In experiment 1, densities...
The task of real-time microblog filtering is to decide if the subsequently posted tweets are relevant a given query representing special information needs. filters based on retrieval model or text classification main solutions for this task. To best exploit strengths two models, hybrid using as prior knowledge rectify hyperplane proposed. incorporates language and logistic regression model. Evaluated Text RetriEval Conference (TREC) 2012 track dataset, experimental results show that proposed...
The identification of high-obfuscation plagiarism seeds is one the most difficult problems to be solved in detection. Single feature type cannot identify effectively because varied methods used plagiarism. In this paper, a multi-features fusion method based on Logical Regression model for was proposed. This combine lexicon features, syntax semantics features and structure extracted from suspicious text fragments pairs. Experiments show that feasible effective.
In the recent years, use cases have been widely applied in software requirement engineering, and proven particularly valuable as part of requirements activities process. Use play more important roles some modern processes methods. Early aspects are defined crosscutting concerns early life cycle phases including analysis, domain analysis architecture design phases. case modeling approach which supports acquisition is proposed. It accepted increment iteration development ideas Unified Process....
Microblog retrieval has received much attention in recent years.In microblog retrieval, the content linked by URLs is one of most important information a microblog.We present Hyperlink-extended model for that combines microblogs and embedded hyperlinks webpages using probabilistic ranking function based on language model.Hyperlink-extended incorporates users' requirements author's expression needs.Using standard TREC 2011 2012 collection, various aspects our are evaluated.Results show...
This paper addresses the issue of source retrieval in plagiarism detection. The task is retrieving all plagiarized sources a suspicious document from corpus whilst minimizing costs. classification-based methods achieved best performance current researches retrieval. points out that it more important to cast problem as ranking and employ learning rank perform Specially, employs RankBoost Ranking SVM obtain candidate documents. Experimental results on dataset PAN@CLEF 2013 Source Retrieval...
The problem of text plagiarism has increased because the digital resources available on World Wide Web. Source Retrieval and Text Alignment are two core tasks detection. A source retrieval alignment system based relevance ranking model is described in this paper. Not only task but also all regarded as a process information retrieval, used to search sources obtain candidate seeds. For BM25 used, while for alignment, Vector Space Model exploited. Furthermore, detection named HawkEyes developed...