NFDI4DS | UHH-SEMS - Publication Details

Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover

Feature (linguistics)

DOI: 10.7717/peerj-cs.443 Publication Date: 2021-04-06T07:54:11Z

Abstract Supplemental Material References Cited by

AUTHORS (2)

Leon Fröhling

Arkaitz Zubiaga

ABSTRACT

The recent improvements of language models have drawn much attention to potential cases use and abuse automatically generated text. Great effort is put into the development methods detect machine generations among human-written text in order avoid scenarios which large-scale generation with minimal cost undermines trust human interaction factual information online. While most current approaches rely on availability expensive models, we propose a simple feature-based classifier for detection problem, using carefully crafted features that attempt model intrinsic differences between Our research contributes field producing method achieves performance competitive far more methods, offering an accessible “first line-of-defense” against models. Furthermore, our experiments show different sampling lead types flaws

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (50)

CITATIONS (61)

EXTERNAL LINKS

OPENALEX - Publications CROSSREF - Publications OPENAIRE - Products

PlumX Metrics

Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....