Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis

doi:10.48550/arXiv.2412.19076

Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis

Galat, Dima

The recent proliferation of AI-generated content has prompted significant interest in developing reliable detection methods. This study explores techniques for identifying AI-generated text through sentence-level evaluation within hybrid articles. Our findings indicate that ChatGPT-3.5 Turbo exhibits distinct, repetitive probability patterns that enable consistent in-domain detection. Empirical tests show that minor textual modifications, such as rewording, have minimal impact on detection accuracy. These results provide valuable insights for advancing AI detection methodologies, offering a pathway toward robust solutions to address the complexities of synthetic text identification.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2412.19076

arXiv:

arXiv:2412.19076

Bibcode:

2024arXiv241219076G

Keywords:

Computer Science - Computation and Language

E-Print:

ALTA 2024

ADS

Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis

Abstract