The Crowd Sourcerer: Luis von Ahn
“I build systems that combine humans and computers to solve large-scale problems that neither can solve alone,” Luis von Ahn writes on the Carnegie Mellon University website. “I call this Human Computation, but others sometimes call it crowdsourcing.” Whatever it’s called, von Ahn’s work can be frustrating: he was the brains behind the CAPTCHA system that prevents spambots from filling in web forms, although frequently stops humans from doing so too by asking them to identify illegible phrases. Despite its flaws, it was bought by Google. Its successor, reCAPTCHA, takes advantage of the fact that 200 million people were, in von Ahn’s words, “wasting 500,000 hours a day typing annoying CAPTCHAs”.
Von Ahn thought those wasted hours could be put to better use, and now reCAPTCHA asks people to fill in two words: one regular puzzle and another that Google’s OCR has failed to identify during its attempt to digitise millions of books. Why? Because humans are better at converting images to text than the fallible OCR software. As a result, reCAPTCHAs placed on sites such as Google, Facebook and Twitter are used to digitise more than two million books a year.
His latest idea, Duolingo, uses crowd sourcing to translate the web, with von Ahn claiming that computer translators don’t work, while paying professional translators is prohibitively expensive. Von Ahn overcame concerns that there wouldn’t be enough bilingual people to achieve his aim, and that it would be impossible to motivate them. His solution? Turning the scheme into an educational program where participants learn a language while they translate web content.
Von Ahn claims the results rival those of professional translators, while “students” are shown increasingly complicated sentences to improve their skills. With a million users, von Ahn says Duolingo could translate Wikipedia from English to Spanish in 80 hours, for free – $15 million less than using human linguists.