Author: Anton Shapkin ( aa.shapkin.aa@gmail.com )
Scientific adviser: Akim Tsvigun
This project aims to develop a tool for detecting AI-generated text/code. With the advent of ChatGPT and AI technology advances, distinguishing between human-written and machine-generated text becomes increasingly challenging. Generative models can return any sequence of tokens => any text that falls within their vocabulary. Even OpenAI is actively working on addressing this challenge, but achieving high accuracy in general cases remains difficult.
Development and implementation of a classification model for identifying texts generated by artificial intelligence to enhance information accuracy.
This dataset consists of approximately 10,000 essays written by humans and various generative models. It includes a prompt (instruction) for each essay. The goal is to identify whether a given text X, corresponding to prompt P, was generated by an AI or written by a human.
In the GHOSTBUSTER paper (https://arxiv.org/pdf/2305.15047.pdf) researchers introduced three new datasets: a writing dataset (based on the subreddit r/WritingPrompts), a news dataset (based on Reuters), and a student essay dataset (based on IvyPanda). Note that the legality of the writing dataset may now be in question.
Dataset #3: from work DetectGPT (https://arxiv.org/pdf/2301.11305.pdf)
In this study, the researchers did not provide their evaluation dataset but described the methodology for its generation.
After reviewing the relevant literature, several weaknesses have been identified: the absence of a standardized benchmark, meaning that each method is evaluated on its own dataset. Furthermore, existing datasets contain only English data, whereas current models are also proficient in other languages and coding. Therefore, our plan includes:
- Benchmark Preparation (Approximate Deadline: March 22)
- Combine existing datasets
- Generate new data for different languages: Russian, Chinese, German, French
- Generate new data for code: Python, Java, Kotlin, C++
-
Evaluate Baselines and Existing Methods (Approximate Deadline: April 22)
-
Implement Our Approach (Optional)
-
Create UI (Telegram Bot)
- The bot will return a label for a given text: 1 if the code is AI-generated and 0 if it is written by a human