Localization

I used to be a Translator, Now I Run Machine Translation (LocWorld London 2013)

by Manuel Herranz It is only when looking back in time that one realizes how much work has been done, how far we are from where we used to be … what we call progress. That is what happened during our presentation at Localization World presentation of PangeaMT machine translation technologies. Our presentation summarized how the mix of Pangea Machine Translation Technologies have enabled translation practitioners to empower themselves and be active in machine translation rather than just be passive users or passive post-editors. The platform is mostly based on open source developments to allow flexibility and customization, but it also includes proprietary cleaning filters, translation engine creation and retraining, dataset management and a very powerful set of statistics so users can see improvements every step of the way. Pangea is the history of a solution designed for translators, for applied language professionals. It is machine translation as a productivity enhancer […]

How to build, run and own your machine translation ecosystem – Pangeanic at LocWorld London 2013

Pangeanic will exhibit its Pangea machine translation technology at LocWorld London 2013 inside ELIA‘s booth.  Over the 3-day event, you will have a chance to meet our representatives and see for yourself how PangeaMT works and how easy it is to create translation engines, manage them and update engines, clean and segregate training material and of course obtain translation in portable and open formats (from TMX and XLIFF to xml-compliant docx, odt, html and ttx). Manuel Herranz is also a guest speaker at the pre-conference day, where he will speak as an experienced implementor of machine translation (MT) technologies at LSP and for large organizations with big publishing needs. Pangeanic was a founding member of TAUS, the industry think-tank and its spin-off the Data Association. Advancements in machine translation led Pangeanic to become the first language service provider to successfully apply Moses as recorded by the EU research program Euromatrixplus. The release of […]

Machine Translation in Short

by Manuel Herranz It is evident that certain documents require a human translator in order to interpret the subtleties of a language. Nevertheless, no matter how skilled a human translator may be, machine translation (also known as automatic translation or MT for short) exceeds the efficiency of a human translator. Machine translation is generally used for subject-specific cases and this is where results and productivity rates are spectacularly higher. It allows individuals and companies to tailor their work according to the topic. Consequently, this enriches the output and quality of machine translation by cutting down on the number of choices for each word(s) to be translated. This form of translation is extremely helpful in areas where formal language is used or phrases are repeated without much variation, such as administrative documents, which do not require the use of colloquial language and expression. The potential of machine translation has been increasingly […]

End of Multimodal Pattern Recognition Project Hails New Features of Machine Translation

by Manuel Herranz The closure of the MIPRCV project (Multimodal Interactive Pattern Recognition) at the beginning of November showcased real-life industry applications from Spanish Research & Development, with examples from bank La Caixa, translation company Pangeanic for language and machine translation, Telefonica for image retrieval, etc. All systems relied on the concept of using existing information (be it bank information like receipts, invoices, and orders, translated bilingual files, classification of web images with and without text, etc)  and processing it on the dual concept of off-line training to produce good enough models and on user interaction that generates a system feedback whence the system learns and improves automatically. This is the also the basis, for example, of Pangeanic’s DIY SMT system applied to machine translation. Research and industry presentations ranged from prototypes to applied technology already in use in industry. One star application is the semantic information (and syntactic sometimes […]

Facebook to add automated translation services for posts

With over 750 million accounts, Facebook users span nearly every country in the world -and it has been ranked as the 7th most populated country in the world.  It was recently valued at 65,000 million US$. Not bad for a company that lets people chat and share pictures at its basic level, and that connects at its highest. It sucess has brought shadows to other online companies such as Yahoo!, which had to fire its MD Carol Bartz by phone, as the company struggles to keep up with other online giants. Facebook faces a common “problem” many large countries do:  multilinguism (if you consider that a problem) or rather the fact that it holds communities which do not interact with each other as there is a language barrier (that is a problem in the real world and in the digital world). However, according to an Inside Facebook post on 2nd […]

For Europe, no (new) CAT tool is good enough

by Manuel Herranz And why should it be? Decisions coming from Brussels tend to be misunderstood, shallowly analyzed or directly criticized whichever way the wind blows. Let us remember 2010’s first ever report on the Size of the Translation Industry in Europe, which was a very comprehensive view of the current status, country by country, and facts and figures into several areas, even if revenues could only take into account certain activities. It also contained words and forecasts from personalities in the industry. Liking reports is like choosing a favorite color – everyone has one liking. Nevertheless, it set detailed information where there was none. However, the decision not to award the contract to any CAT tool in the market points to a very clear state-of-affairs in the language industry: despite massive innovations in computing (from open cloud to internal or managed clouds: Eucalyptus (built on Amazon EC2), OpenNebula, the […]

Multilingual Web in Pisa & Translator MT Awareness at EU

by Manuel Herranz April was a busy month within the Northern-hemisphere, conference-rich season. Gala took place in Lisbon and, although Pangeanic – PangeaMT could not participate, there were plenty of other specialist venues to choose from. MultilingualWeb took place in Pisa during 4th and 5th April. The venue was true to its premise to discuss standards and best practices for the Multilingual Web and gathered a good, specialist and multi-disciplined crowd. The mixture was well planned: the sessions were standard by a keynote address by Oreste Signore and Kimmo Rossi, as well as a report from Ralf Steinberger about multimedia news reports service from the JRC (the organisation has made all presentations and videos available, so the blog entry was worth the waiting). You can check all presentations here. Not being an expert on the creation or development of the web, but being a keen open-standards supporter, I found it very enlightening […]

Lisa closes operations

by Manuel Herranz There was a time when there was practically one, powerful, language industry organization. It was known as LISA. There was a time when membership was de rigeur for almost any serious LSP. LISA advanced a lot of the work in translation standards, pioneering many of them, as well as metrics. However, defections from LISA, lack of interest and the birth of many other initiatives, sometimes with a more clear focus, have undermined its reign. To the point that it has become insolvent and it has closed. The last  official words from LISA come from its Managing Director, Michael Anobile in the last event he has attended, in Boston. Right after the event, a letter was sent to members and to the industry in general: “It is with sincere regret I inform you that the Localization Industry Standards Association (LISA) is insolvent and will close its operations effective 28 February […]

Breaking News – a few days later: “Aha, SDL has bought Language Weaver! So what?”

by Elia Yuste & Manuel Herranz SDL announced the purchase of Language Weaver (LW) on Thursday 15th July 2010. General media as well as GILT industry experts have spread the news in just no time in the form of newsletter updates, opinion articles in professional networks, blog entries or tweets. The news coverage has been phenomenal. Everyone has been asking about the why’s, the how’s and the thereafter’s of this financial move by SDL. We coincide with some leading industry analysts in that perhaps Language Weaver’s future will not be as rosy as it may seem. Perhaps it will follow the same destiny as Idiom once did, from market establishment to product support discontinuation following its acquisition by big buying father SDL. As an independent LSP offering its own customized MT technology, Pangeanic is in a privileged position to offer dedicated client-focused solutions, based on innovative MT and, if required, post-editing […]

Eyes Wide Shut? Still willing to send out your data to G-Translate and Language Weaver through SDL Studio 2009?

by Elia Yuste & Manuel Herranz At Pangeanic most of our internal globalization production workflows are now driven by open-standard software, be it our own PangeaMT engines or a translation environment, like SwordFish, to use it say for MT post-editing. Both our in-house and outsourced linguists are familiar with this. Still, we keep using translation-memory based software, and in this respect, traditional CAT tools like Trados or SDL 205, 2007 are still perfectly adequate. We have not felt urged by the client side to upgrade to the new Studio 2009. Nevertheless, we have tried and tested some of its features. Exciting as though it may seem that such significant industry-share software product has incorporated Machine Translation plug-ins, which reflects SDL´s broadening mind towards this kind of technology, many of their users may not be conscious about something that we consider as pretty outrageous. We first noticed that when calling up […]