Japanese Machine Translation

NMT versus SMT results in Japanese

The Pangeanic neural translation project

The last few months have been extraordinarily busy at Pangeanic, with a focus on the application neural networks for machine translation (neural machine translation) with tests into 7 languages (Japanese, Russian, Portuguese, French, Italian, German, Spanish), the completion of a national R&D project (Cor technology as a platform for translation companies offering an integrated way of analyzing and managing website translation and document analysis), the integration of CAT-agnostic translation memory system ActivaTM into Cor and our neural engines, and the award by the European Union’s CEF (Connecting Europe Facility) of the largest digital infrastructure project to build secure connectors to commercial MT vendors and the EU’s own machine translation service (MT@EC) for public administrations across Europe. Leading machine translation developers such as KantanMT, Prompsit, Tilde and our PangeaMT join forces with consulting company Everis to build IADAATPA, a system that will intelligently work on domain adaptation and the selection of […]

Deep learning – The day language technologies became a Christmas present

It is said the third Monday every January is the saddest day in the year. It does not take deep learning to feel so. A long vacation period has ended. No sight of another one until several months away. Overspent, overstuffed, with no more presents to exchange, with winter settling in the Northern hemisphere and missing the drinks and chocolates that made our sugar levels go sky high, many start booking holidays in the sun. Let’s turn the clocks back to Christmas and we will remember the last few weeks as the Christmas when language technologies made it to the top of the list. Millions of people, literally, have opened boxes whose content was an electronic assistant with a rapidly improving ability to use human language. There are two main products: Amazon’s Echo, featuring the Alexa digital assistant, which sold more than 5m units. In essence, Echo is a desktop […]

Machine translation: Can it be used to translate travel industry content?

by Manuel Herranz There have been strong opinions for and against machine Translation over the last few years. Whilst the general public has become a keen user of free online services, professional translators have poured bitter criticisms against the technology. Understandably, because the language industry is a small industry compared with other sectors where automation took place years ago (automotive industry, printing, telecommunications, to name a few). The Internet and in general any industry based on electronic communications has added to the increase in demand for multilingual websites, which means more translation for eCommerce sites and website translations. There are many supporters of machine translation technology because of the many advantages and problems it has solved where a translator could not be at hand and human translation was not an option. See the video celebrating Google Translate’s 10 years. But it has also gained something of a bad press, particularly […]

NTT DoCoMo prepares Japanese machine translation through Android

Japan is unique in many ways and this is reflected and expressed in its culture and its challenging language.  Japanese is controversially an Altaic language spoken by around 127 million people. Its intrinsic characteristics make it a challenge for machine translation and other forms of translation automation, although Pangeanic, in collaboration with Toshiba, has reported several advances in hybrid MT (as published in the Asian Association of Machine Translation in 2011) and presented in Japan Translation Festival (see presentation here). Making calls to other countries a challenge for Japanese speakers: locals often don’t have much choice but to learn someone else’s language or hope there’s a Japanese speaker on the other end of the line. All going well, NTT DoCoMo’s planned Hanashite Hon’yaku automatic translation service, international calls will be as comfortable as phoning a store in Nagano. As long as a subscriber has at least an Android 2.2 phone […]

Toshiba and Pangeanic Steps to Machine Translation Hybridation – Article in AAMT

Pangeanic’s R&D team and Toshiba’s Knowledge Media Laboratory have published a joint article describing an initial hybrid pilot setting the basis for future work in the development of hybrid machine translation technologies from English into and out of Japanese. This article has been published in the December issue of the Asia-Pacific Association for Machine Translation (number 50). A copy of the article is available for download from PangeaMT’s site. The article was co-written mainly by Ms Elia Yuste, Mr Manuel Herranz as the initiator of the project and Mr Alexandre Helle as leader of the nipponization module at Pangeanic with Toshiba’s input coming from Hirokazu Suzuki from the Corporate Research & Development Center of Toshiba Corporation. The article describes progress made using the statistical machine translation open-source platform Moses and Toshiba’s rule-based system to obtain better outputs. Future work points to Pangeanic’s syntax-based approach integrating English and Japanese within a […]