NFDI4DS | UHH-SEMS - Publication Details

Haoliang Qi

ORCID: 0000-0003-1321-5820

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5042722970

Research Areas

Topic Modeling
Natural Language Processing Techniques
Web Data Mining and Analysis
Text and Document Classification Technologies
Academic integrity and plagiarism
Spam and Phishing Detection
Information Retrieval and Search Behavior
Network Security and Intrusion Detection
Advanced Text Analysis Techniques
Authorship Attribution and Profiling
Imbalanced Data Classification Techniques
Semantic Web and Ontologies
Recommender Systems and Techniques
Handwritten Text Recognition Techniques
Insect-Plant Interactions and Control
Rough Sets and Fuzzy Logic
Complex Network Analysis Techniques
Advanced Computational Techniques and Applications
Data Quality and Management
Artificial Intelligence in Law
Insect Resistance and Genetics
Insect and Pesticide Research
Data Management and Algorithms
Advanced Algorithms and Applications
Online Learning and Analytics

Foshan University
2019-2023

Heilongjiang Institute of Technology
2010-2019

State Key Laboratory of Digital Publishing Technology
2018-2019

Chinese Academy of Agricultural Sciences
2016-2018

Institute of Plant Protection
2016-2018

Harbin Institute of Technology
2004-2017

Shandong Provincial Hospital
2017

Shandong University
2017

Shandong Agricultural University
2016

Harbin Engineering University
2015

Resistance selection of indoxacarb in Helicoverpa armigera (Hübner) (Lepidoptera: Noctuidae): cross‐resistance, biochemical mechanisms and associated fitness costs

OPENALEX - Publications

Li Cui Qinqin Wang Haoliang Qi Qiyuan Wang Huizhu Yuan and 1 more

The cotton bollworm Helicoverpa armigera is a worldwide insect pest with the ability to develop resistance many insecticides. Indoxacarb, sodium channel blocker, an important insecticide that used control H. armigera. Cross-resistance, metabolic mechanisms and life history traits were established for indoxacarb-selected (IND-SEL) population of armigera.After 11 generations selection, susceptibility indoxacarb was decreased by 4.43-fold estimated realized heritability (h2 ) only 0.072....

10.1002/ps.5056 article EN Pest Management Science 2018-04-30

Linear discriminant model for information retrieval

OPENALEX - Publications

Jianfeng Gao Haoliang Qi Xinsong Xia Jian‐Yun Nie

This paper presents a new discriminative model for information retrieval (IR), referred to as linear discriminant (LDM), which provides flexible framework incorporate arbitrary features. LDM is different from most existing models in that it takes into account variety of linguistic features are derived the component HMM widely used language modeling approaches IR. Therefore, means melding and generative We present two algorithms parameter learning LDM. One optimize average precision (AP)...

10.1145/1076034.1076085 article EN 2005-08-15

Cycloxaprid: A novel cis-nitromethylene neonicotinoid insecticide to control imidacloprid-resistant cotton aphid (Aphis gossypii)

OPENALEX - Publications

Li Cui Haoliang Qi Daibin Yang Huizhu Yuan Changhui Rui

10.1016/j.pestbp.2016.02.005 article EN Pesticide Biochemistry and Physiology 2016-03-26

aCat: Automatically Choosing Anchor Tokens in Prompt for Natural Language Understanding

OPENALEX - Publications

Zhanhong Ye Leilei Kong Haoliang Qi

<title>Abstract</title> P-tuning has demonstrated that anchor tokens are beneficial for improving the performance of downstream tasks. However, manual selection manually may result in subjective or suboptimal results. In this paper, we present aCat to automatically select tokens. Following framework soft-hard prompt paradigm, achieves automatic template construction. Experiments conducted on natural language understanding benchmarks demonstrate effectiveness our proposed method. On seven...

10.21203/rs.3.rs-5213761/v1 preprint EN Research Square (Research Square) 2025-04-15

Combination of VSM and Jaccard coefficient for external plagiarism detection

OPENALEX - Publications

Shuai Wang Haoliang Qi Leilei Kong Cuixia Nu

Detailed comparison is one important sub-task of external plagiarism detection. Seed heuristic between two documents often used in this task. Vector space model (VSM) and Jaccard coefficient are commonly VSM can produce high recall performance; precision performance. In paper, we propose a hybrid similarity measure on the basis fitting function optimal dividing line none-plagiarism where integrates into unified one, our method make full use advantage coefficient, it extract more reasonable...

10.1109/icmlc.2013.6890902 article EN International Conference on Machine Learning and Cybernetics 2013-07-01

Weed control effect of unmanned aerial vehicle (UAV) application in wheat field

OPENALEX - Publications

Yin Chen Haoliang Qi Guangze Li Yubin Lan

Wheat is a major food source throughout the world. Â However, biological factors like pests and weeds can lead to lower crop yield.Â Most protection nowadays involves pesticide herbicides application.Â This commonly conducted with knapsack in China, which inefficient high labor intensive.Â Unmanned aerial vehicle (UAV) are an spraying technology recently-developed.Â Using UAV application more flexible standardized, efficiency 60 times than sprayer.Â weed management using still challenge.Â...

10.33440/j.ijpaa.20190202.45 article EN International Journal of Precision Agricultural Aviation 2018-01-01

Extending BLEU Evaluation Method with Linguistic Weight

OPENALEX - Publications

Muyun Yang Junguo Zhu Jufeng Li Lixin Wang Haoliang Qi and 2 more

BLEU is one of the most popular metrics for automatic evaluation machine translation quality. Focusing on its ignorance different effects various units upon quality, this paper extends proper weights to words and n-grams in framework BLEU. The linear regression method adopted capture human perception quality via word types n-gram length. Compared with other linguistic-rich based learning, proposed approach simple largely preserves BLEUpsilas advantage language independence. Experimental...

10.1109/icycs.2008.362 article EN 2008-11-01

Detecting High Obfuscation Plagiarism: Exploring Multi-Features Fusion via Machine Learning

OPENALEX - Publications

Leilei Kong Zhimao Lu Haoliang Qi Zhongyuan Han

Providing effective methods of identification high-obfuscation plagiarism seeds presents a significant research problem in the field detection. The conventional detection are based on single type features to capture seeds. But for detection, these not sufficient identifying effectively because varied used plagiarism. This paper multi-features fusion method highobfuscation identification. exploits Logical Regression model integrate lexicon features, syntax semantics and structure which...

10.14257/ijunesst.2014.7.4.35 article EN International Journal of u- and e- Service Science and Technology 2014-08-31

Prediction of Users Retweet Times In Social Network

OPENALEX - Publications

Haihao Yu Xu Bai Chengzhe Huang Haoliang Qi

In view of the fact that propagation path topology cannot effectively deal with complex social network consists hundreds millions users.More researchers choose to use machine learning methods complete retweet prediction.Those classification method judge whether a message will be retweeted or not.This paper argues prediction should regression analysis problem, not just problem.Through collecting user characteristics on Twitter and selecting some features which have an important impact...

10.14257/ijmue.2015.10.5.29 article EN International Journal of Multimedia and Ubiquitous Engineering 2015-05-31

The Improved Logistic Regression Models for Spam Filtering

OPENALEX - Publications

Yong Han Muyun Yang Haoliang Qi Xiaoning He Sheng Li

The logistic regression model has achieved success in spam filtering. But it is disadvantaged by the equal adjustment of feature weights appeared both messages and ham ones during training period. This paper presents an improved which reduces impact features appearing ones. Byte level n-grams are employed to extract from messages, TONE (train on or near error) adopted, proved effective state-of-the-art filtering system. official runs CEAS (Conference email anti-spam) spam-filter Challenge...

10.1109/ialp.2009.74 article EN International Conference on Asian Language Processing 2009-12-01

Subcategorization acquisition and evaluation for Chinese verbs

OPENALEX - Publications

Xiwu Han Tiejun Zhao Haoliang Qi Hao Yu

This paper describes the technology and an experiment of subcategorization acquisition for Chinese verbs.The SCF hypotheses are generated by means linguistic heuristic information filtered via statistical methods.Evaluation on 20 multi-pattern verbs shows that our achieved similar precision recall with former researches.Besides, simple application acquired lexicon to a PCFG parser indicates great potentialities in fields NLP.

10.3115/1220355.1220459 article EN 2004-01-01

Predicting query potential for personalization, classification or regression?

OPENALEX - Publications

Chen Chen Muyun Yang Sheng Li Tiejun Zhao Haoliang Qi

The goal of predicting query potential for personalization is to determine which queries can benefit from personalization. In this paper, we investigate kind strategy better task: classification or regression. We quantify the benefits personalizing search results using two implicit click-based measures: Click entropy and Potential@N. Meanwhile, are characterized by features history features. Then build C-SVM model epsilon-SVM regression respectively according these measures. experimental...

10.1145/1835449.1835585 article EN 2010-07-19

Re-examination on lam% in spam filtering

OPENALEX - Publications

Haoliang Qi Muyun Yang Xiaoning He Sheng Li

Logistic average misclassification percentage (lam%) is a key measure for the spam filtering performance. This paper demonstrates that filter can achieve perfect 0.00% in lam%, minimal value theory, by simply setting biased threshold during classifier modeling. At same time, overall classification performance reaches only low accuracy. The result suggests role of lam% evaluation should be re-examined.

10.1145/1835449.1835601 article EN 2010-07-19

A Ranking-Based Text Matching Approach for Plagiarism Detection

OPENALEX - Publications

Leilei Kong Zhongyuan Han Haoliang Qi Zhimao Lu

This paper addresses the issue of text matching for plagiarism detection. task aims at identifying segments in a pair suspicious document and its source document. All time, heuristic-based methods are mainly utilized to resolve this problem. But heuristics rely on experts' experiences fail integrate more features detect high obfuscation matches. In paper, statistical machine learning approach, named Ranking-based Text Matching Approach Plagiarism Detection, is proposed deal with issues The...

10.1587/transfun.e101.a.799 article EN IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences 2018-04-30

Source Retrieval Model Focused on Aggregation for plagiarism detection

OPENALEX - Publications

Leilei Kong Zhongyuan Han Haoliang Qi Muyun Yang

10.1016/j.ins.2019.07.015 article EN Information Sciences 2019-07-04

A Hybrid Model for Microblog Real‐Time Filtering

OPENALEX - Publications

Zhongyuan Han Muyun Yang Leilei Kong Haoliang Qi Li Sheng

The task of real-time microblog filtering is to decide if the subsequently posted tweets are relevant a given query representing special information needs. filters based on retrieval model or text classification main solutions for this task. To best exploit strengths two models, hybrid using as prior knowledge rectify hyperplane proposed. incorporates language and logistic regression model. Evaluated Text RetriEval Conference (TREC) 2012 track dataset, experimental results show that proposed...

10.1049/cje.2016.05.007 article EN Chinese Journal of Electronics 2016-05-01

The Chinese-English Bilingual Sentence Alignment Based on Length

OPENALEX - Publications

Ding Hua-fu Lili Quan Haoliang Qi

Bilingual sentence pairs are key resource for statistical machine translation. Currently, most of the alignment corpus is between English and French or German. And there little specialized dataset Chinese. So our aim to create large-scale, high-precision English-Chinese aligned sentences. Length based method used align bilingual paragraphs which were extracted from CNKI (China National Knowledge Infrastructure). one largest academic website, contains huge Chinese-English paragraph. Our...

10.1109/ialp.2011.70 article EN International Conference on Asian Language Processing 2011-11-01

High obfuscation plagiarism detection using multi-feature fusion based on Logical Regression model

OPENALEX - Publications

Leilei Kong Zhimao Lu Haoliang Qi Zhongyuan Han

The identification of high-obfuscation plagiarism seeds is one the most difficult problems to be solved in detection. Single feature type cannot identify effectively because varied methods used plagiarism. In this paper, a multi-features fusion method based on Logical Regression model for was proposed. This combine lexicon features, syntax semantics features and structure extracted from suspicious text fragments pairs. Experiments show that feasible effective.

10.1109/iccsnt.2015.7490768 article EN 2015-12-01

Coming Soon ...