Skip to main content

Software Analyzing Texts?

Can software accurately analyze the writing style of an author to determine if he or she wrote a specific work? Maybe…
Open source app can detect text's authors
http://www.theregister.co.uk/2013/02/22/author_detection_uni_adelaide/
A group of Adelaide researchers has released an open-source tool that helps identify document authorship by comparing texts.

While their own test cases – and therefore the headlines – concentrated on identifying the authors of historical documents, it seems to The Register that any number of modern uses of such a tool might arise.

The two test cases the researchers drew on in developing their software, on Github here, were a series of US essays called The Federalist Papers, and the Letter to the Hebrews in the New Testament.

The Federalist Paper essays were written in the lead-up to the drafting of the US Constitution, by Alexander Hamilton, James Madison and John Jay. Of the 85 essays, the authorship of 12 is disputed and one has generally been attributed to Jay.

Professor Derek Abbott of the University of Adelaide explains the results: "We've shown that one of the disputed texts, Essay 62, is indeed written by James Madison with a high degree of certainty.

"But the other 12 essays cannot be allocated to any of the three authors with a similarly strong likelihood. We believe they are probably the result of a certain degree of collaboration between the authors, which would also explain why there hasn't been scholarly consensus to date."
I love research such as this. One of the problems we will face with online courses (and even traditional courses) is that students might be more tempted to submit the works of others as their own. You can buy almost anything online, including term papers and reports. What if software could flag works as questionable? That would be pretty valuable.

The challenge might be amassing sufficient amounts of verified text to establish a pattern, but we could always ask students to write a few short samples. Their online forum posts would also give us some sense of how a students writes when a grade isn't at stake.

The research and the software are both freely available.

Free software means there is a high likelihood that other scholars will test the software and the conclusions of the researchers. The more testing of the software, the more likely it will be improved. It might be reasonable to expect private industry and public agencies to also test the software.
In the research paper, published in full at PLOSOne, the group notes that author attribution is a question that's stretching beyond academia in the modern era.

http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0054998
"Due to an increase in the amount of data in various forms including emails, blogs, messages on the internet and SMS, the problem of author attribution has received more attention. In addition to its traditional application for shedding light on the authorship of disputed texts in the classical literature, new applications have arisen such as plagiarism detection, web searching, spam email detection, and finding the authors of disputed or anonymous documents in forensics against cyber crime," the researchers write.

They note that further research would be needed to test their methodology against modern texts – but with the software offered for free, The Register can easily imagine the software getting a workout by any number of interested parties.
Free software? I know I'm curious enough to experiment with some public domain texts. After all, maybe Chaucer didn't write all those poems!

Comments

Popular posts from this blog

Practical Technology Skills

This blog is a revision to a column I wrote for Direct Media publications. Normally, I wouldn't repost something I wrote for hire, and I certainly don't wish to anger one of my publishers. However, since this blog is primarily accessed by one of my graduate seminars, I think the publisher will appreciate that I am extending my thoughts for educational purposes. I'm also more than willing to encourage businesses to visit the Direct Media home page . Page numbers seemed to be a half-inch lower on each successive page. I stared at the mid-term paper, handed in to me by a junior at the university, and thought back to my fights with dot-matrix printers. When I was an undergrad, my Epson FX/80 printer jammed often and would sometimes rip pages after the sprockets slipped out of alignment with the punched holes of the perforated paper. Surely the undergraduate author of this paper suffered the curse of a similarly possessed printer, I told myself. “I guess when I changed the ma...

Pursuing a University Degree Online

Visalia Direct: Virtual Valley February 2008 Issue January 7, 2008 Pursuing a University Degree Online When a star high school student graduates in Tulare County, the difficult reality is that he or she most likely will leave to attend a four-year university. For an eighteen-year-old student, leaving the Central Valley, or at least Tulare County, is part of the educational experience. But, after returning to Visalia some of us find out that our undergraduate educations are not quite enough. For those in education, Fresno State, Fresno Pacific University, Chapman University, and others have offered courses in Visalia for a number of years. This makes it possible to work and still complete a teaching credential or an advanced education-related degree. I have been thankful for the options we have in the Central Valley. But, as others have learned, if you are interested in some fields you must commute to Fresno — or even further. With the drive to Fresno taking just under an hour...

Robots for Home: Not Yet the Jetsons

NXT Robot (Photo credit: Wikipedia ) Visalia Direct: Virtual Valley November  3, 2014 Deadline December 2014 Issue Robots for Home: Not Yet the Jetsons Rosie the robot maintained the Jetson household more than 50 years ago. To the disappoint of many of us who still enjoy the classic 1960s cartoon, Rosie remains science fiction. The only robots in our houses are round bumper cars that vacuum floors. The iRobot Roomba offers no witty banter and no sighs of exasperation. Growing up, I expected Twiki, the android that followed Buck Rogers about for no apparent reason, to become a reality. After all, Twiki didn’t do anything except carry a much smarter talking computer about his neck. Sadly, Rogers was stuck in the twenty-fifth century. All the good androids and robots seem to be way off in the future or in other galaxies. Although we have no Rosie, robots are on the rise. They build our cars, deliver medications, defuse bombs, explore planets and even perform surgeries. M...