Skip to main content

You Can Write, But You Can't Hide: Big Data Knows Your Writing Quirks

posted onAugust 30, 2012
by l33tdawg

As I wrote recently, data scientists have been able to decode unstructured data to accurately predict where violence will occur in Afghanistan. Now, they can also mine unstructured data to determine the identity of a document’s writer. All of us, it seems, have a “write-print” as unique as our fingerprint.

According to forensic linguists, the experts who investigate a text’s originator, if they have an individual’s known writings, they can detect with up to 95% accuracy that person’s authorship of any other document. Forensic experts have been called as witnesses in the high profile lawsuit by Paul Ceglia, who has sued Mark Zuckerberg, claiming he owns half of Facebook. They’ve also been expert witnesses in murder trials.

While the field of forensic linguistics predates the advent of big data, the sheer volume of data being generated on the Internet is opening new business opportunities for automating the analysis. A company pursuing these opportunities claims it can pinpoint a document’s author and determine everything from the gender, age, and education of a writer to the veracity of the document’s content.

Source

Tags

Privacy Science

You May Also Like

Recent News

Friday, November 29th

Tuesday, November 19th

Friday, November 8th

Friday, November 1st

Tuesday, July 9th

Wednesday, July 3rd

Friday, June 28th

Thursday, June 27th

Thursday, June 13th

Wednesday, June 12th

Tuesday, June 11th

Simplenews subscription

Stay informed - subscribe to our newsletter.
The subscriber's email address.
Keeping Knowledge Free for Over a Decade

Copyright © 2018 Hack In The Box. All rights reserved.

36th Floor, Menara Maxis, Kuala Lumpur City Centre 50088 Kuala Lumpur Malaysia
Tel: +603-2615-7299 Fax: +603-2615-0088