Edward Tian was fast asleep when his bot broke a website.
The 22-year-old senior at Princeton spent his winter break in his local coffee shop creating GPTZero, an app that he claimed would be able to âquickly and efficientlyâ tell if an essay was written by a human or by OpenAIâs ChatGPT. When he uploaded it to the app creating and hosting platform Streamlit, he didnât expect it to get that much attention.
âI was expecting, at most, a few dozen people trying out the app,â Tian told The Daily Beast. âSuddenly, it was crazy in usership with over 2000 people signing up for the beta in a few hours.â
GPTZero eventually saw such a massive influx of users that it even crashed the platform that was hosting it. âIâm awestruck that it blew up and went so viral,â he added.
When OpenAI released ChatGPT on Nov. 30, 2022, it unleashed a digital Pandoraâs Box on the world.
Everyoneâfrom high school teachers to college professors to journalistsâall feared the powerful AI chatbot ushered in a new era of bot-generated essays and articles that some have dubbed âAIgiarism.â Some educators have already begun reporting instances of their students using ChatGPT in order to create essays out of whole cloth and finish writing assignments.
While OpenAI has said that they eventually plan on implementing âwatermarksâ in order to verify whether or not something was created by ChatGPT, thereâs still no official method of doing soâwhich can create a giant bot-sized headache across all sectors like education and journalism.
Tian, whoâs pursuing a double major in computer science and journalism, was bothered by ethical dilemmas posed by chatbots as well as what he described as the âblack boxâ nature of large language models like ChatGPT. The opaque nature of the models results in people fundamentally misunderstanding and, therefore, misusing them.
So, even though he is on the cusp of graduating, he decided to spend his winter break building a tool that could help people find out whether or not a piece of writing was likely written by a bot.
âHumans deserve to know when the writing isnât human,â Tian said. âThereâs so much hype around ChatGPT and AI generation lately, that humans deserve to know the truth.â
GPTZero uses two different metrics to assess whether or not a text has been penned by a bot: perplexity, and burstiness. Texts placed into the app will be assigned a number for both metrics. If the number is low, the likelihood of it being created by a bot is higher.
Perplexity is a measurement of randomness in a sentence. If a sentence is constructed or uses words in a way that surprises the app, then it will score higher in perplexity. Tian said that he used the free and open source GPT-2 to help train his app for this metric.
Burstiness is the quality of overall randomness for all the sentences in a text. For example, human writing tends to have sentences that vary in complexity. Some are simple. Some can give James Joyce a run for his money. Bots, on the other hand, tend to generate sentences that are relatively low in complexity, throughout the entire text.
âThere are beautiful qualities of human written prose that computers can and should never co-opt,â Tian explained. As a journalism student, he was inspired by a class he took with American writer John McPhee who taught him about those beautiful qualities of human writing.
Tian would go on to use an essay by McPhee in The New Yorker as part of his demo for GPTZero:
Despite building the tool, Tian isnât anti-AI. He believes that thereâs a time and a place for them if used ethically and with consent. Hell, heâs even used AI programs like CoPilot to âsupport much of my coding.â
âIâm not opposed to using AI for writing when it makes sense,â he said.
With the hype and fears surrounding ChatGPT, a tool like Tianâs could prove to be incredibly useful across sectors from educators who want to see if their student plagiarized an essay, to job recruiters who want to check if a cover letter was actually written by an applicant. As such, it could also be incredibly lucrative to the right investorsâsome of whom have already reached out to Tian.
âJust in the past day, a bunch of VCs have slid in my Twitter DMs,â Tian said, including the likes of A16Z, Menlo Ventures, and Red Swan. But heâs not done with GPTZero quite yet. He wants to further refine and develop the app, and he even has plans to expand its transparency with âexplainers and detection methodologies.â
And, at the end of the day, heâs a senior in college. He has finals looming, with homework and human-generated essays to worry about. Right now, thatâs a much bigger concern than a digital Pandora's Box or VC investors.
âIâm going to take all the calls, but for now,â he said with a laugh, âIâm just a college student focused on graduating from school."