The Questionable Quest for Linguistic Purity

This post makes another detour from my original topic. Roughly a month ago, after I wrote my last post about romanticism and the place of translation technology in university curricula, I was in for another surprise – which eventually connects to the same train of thought.

Within a week, I saw two articles that lamented about the deterioration or even the eventual death of the English language. I’m not a native speaker of English, nor am I a specialist of its history, and I don’t know how often such articles are published – but I definitely wasn’t used to seeing such pieces about English coming from high-profile news sources. That said, I am always ready to admit that this is due to my ignorance about the history and the current goings-on of English.

Continue reading

Translation, Technology, Training – or Romanticism and Fear

Horror stories – when I taught translation technology to university students, I had always started the semester with those. At least three of them. You could ask, weren’t my students frightened enough already? I happen to have the same answer as Aragorn gave Frodo & co., when they first met in the Prancing Pony: Not nearly frightened enough.

The stories I told my students were about translation jobs that simply couldn’t be done without help from technology. Translations of thousand-page books to be upgraded to a new edition in six weeks, editing, proofreading, printing included. Millions of words of automotive manuals to be translated into twenty-plus languages in three weeks. Tens of thousands of words of highly specialized tender documents to be translated over a long week-end, reviewing included, with no compromise possible about the hour of delivery. And the list goes on.

The purpose of these stories was to put technology in perspective for students who had never translated a single word for money before. And when I was to introduce a particular feature of technology, I’d always tried to remember to point to a practical problem where it helped.

Continue reading

Ambiguity

If you ask computers, they will probably name ambiguity as public enemy number one. Ambiguity occurs when a word or expression can mean two or more different things, and you can’t find out which from the word alone – for that, you need the surroundings, or the context, and often also a lot of background  information. Here’s an example: is a guide a book or a person? Although computational linguistics has methods to deal with some of this, resolving ambiguity remains mainly the privilege of the human mind.

Yehoshua Bar-Hillel himself invokes the concept of ambiguity in language to prove that high-quality automatic translation is not feasible, at least not if you stay with the generative approach. But I have already spent several posts on this, so now it’s time for something completely different.

Today I plan to confuse my readers by pointing out that human beings, whether they like it or not, are part of translation technology – or any technology, for that matter. Yet in the previous post, I argued that humans, for better or worse, are prone to refuse to be part of the machine.

Continue reading

Translation Technology: Replacement or Enhancement?

At one point in The Imitation Game (again), Commander Denniston enters Alan Turing’s workshop, shuts down Christopher the code-breaking machine, then orders Turing off the premises. The machine is not quite complete. Turing, terrified, protects it with his own body, and insists that the machine will work. He and his work is saved by fellow code-breakers who stand up for him. Then Denniston gives him one more month to make Christopher work.

Aspiring teams of machine translation research weren’t so lucky after, in 1964, the US government thought to set up a committee to look into their progress. The committee, pompously named the Automatic Language Processing Advisory Committee, or ALPAC in short, was active for two years, engaged in discussions, heard testimonies – and, in 1966, came up with a report that many thought was the nemesis of machine translation research.

Continue reading

“It is with our good will”

I thought I’d reboot my blog Dreamers and Doers – so it could use a captatio benevolentiae at this point. In this blog, I collect my thoughts on the history of translation technology – any technology that employs machinery for people to understand each other. At times, I take a peek at the science behind well-known technologies and services (like machine translation). I aspire to write my notes for the non-researcher: the professional translator, the student of engineering or linguistics – these posts are practically for anyone interested in language and technology. In this, I’m trying to contribute, meager as my contribution might be, to public communication about science and engineering, more specifically, language technology.

Continue reading

Corpus Cosmology

The generative theory of language (see the previous post for details) is mathematically sound and intellectually appealing. What’s more, it’s well suited for computer processing: for many generative grammars, it’s relatively easy to write a computer program that analyzes or produces texts that match that particular grammar.

At the time generative linguistics was introduced (1957), its computer applications were politically motivated, too: intelligence services hoped for an automatic translation facility that would quickly help them read Russian scientific papers, for example – as it turns out from the ALPAC report that brought about the temporary demise of machine translation and the ascent of machine-assisted human translation.

Continue reading

Language in Numbers

Everything can be expressed in numbers, said Alan Turing and Kurt Gödel when they were thinking up methods to come to mathematical statements and proofs – about mathematical statements and proofs. Turing sought a systematic approach to tackle Hilbert’s Entscheidungsproblem, while Gödel used his eponymous numbering system to prove his own incompleteness theorems. Which happened to constitute a negative proof to the same Eintscheidungsproblem. You can go back one post if you are interested in the particulars.

In that previous post, I have already attributed the ascent of the universal automata – computers – to this approach. Now I will show how this same approach enabled the mathematical treatment of language, giving birth to the processing of human languages on computers.

Continue reading

Universal Understanding, Universal Machine

Roughly six weeks ago, I went to see The Imitation Game – I caught one of the last English-language screenings in my city. Opinions might vary about this movie, but Alan Turing’s attitude, as shown in the film, reflected the mindset of a true programmer. True programmers, when they face a specific problem, tend to go one abstraction level up, and create a solution not just for the problem at hand, but for an entire class of similar problems. In fact, this is the very attitude that gave us language technology.

Continue reading

And they shall be at liberty to keep festivals and make rejoicings

… says the decree that Ptolemy V issued in 196 BC, at the time of his accession to the Egyptian throne. He – or the people who erected the stele with this text – probably didn’t know what joy they had actually given to later generations: first, to Jean-François Champollion; second, to historians who could finally understand ancient Egyptian scripts and unravel Egyptian history; third, to language technicians who found yet another historic item that they could use as legacy and name their products after.

Maybe the last part is a bit too sarcastic because the Rosetta stone and its likes hold real value for all these people – I mean, beyond the symbolic significance. In fact, the Rosetta stone is not the most important or the best-preserved specimen of its kind (see another example here) – but it had been discovered first, which made it the primary vehicle of deciphering the Egyptian scripts. Continue reading