I used to be a Translator, Now I Run Machine Translation (LocWorld London 2013)

by Manuel Herranz It is only when looking back in time that one realizes how much work has been done, how far we are from where we used to be … what we call progress. That is what happened during our presentation at Localization World presentation of Pangea machine translation technologies. Our presentation summarized how the mix of Pangea Technologies have enabled translation practitioners to empower themselves (see the launch of our DIY SMT in Barcelona, 2011) and be active in machine translation rather than just be passive users or passive post-editors. The platform is mostly based on open source developments to allow flexibility and customization, but it also includes  propietary cleaning filters, translation engine creation and retraining, dataset management and a very powerful set of statistics so users can see improvements every step of the way. Pangea is the history of a solution designed for translators, for applied language professionals. It […]

Expert

Pangeanic in EU EXPERT Project Evaluating Hybrid Machine Translation

by Manuel Herranz As recently published in our news section, Pangeanic is taking part in the EU-funded EXPERT Project. EXPERT aims to train young researchers to promote the research, development and use of hybrid language translation technologies. In practice, EXPERT aims at improving translation practices and enhancing the productivity of relevant actors in the translation market. In this respect, EXPERT’s findings will set an agenda for new skills and jobs by promoting new job profiles based on empirical data from translation professionals (language service providers) and academia. The assumption of the project is that true potential of MT remains to be exploited as a result of non-user-friendly interfaces, lack of awareness of translator’s feedback, etc. However, Pangeanic already created and released a web-based tool that is able to organize material for Machine Translation by domain, maintain it and perform some cleaning routines, a key factor in our participation in the project. […]

Translation Technologies at LocWorld (Part 2: Practitioners)

I will describe the rest of the very interesting Preconference Day and presentations by the organizers (TAUS) as well as other 3 companies which are either machine translation developers or practitioners of automated translation solutions. Presentations brought different perspectives to the machine translation landscape, with efforts and advancements by several companies, including Pangeanic. Maxim Khalilov from TAUS summarized the good work being done by the organization within the MosesCore project. His presentation was an invitation to visit and find out more about tools, data and resources. Amongst the tools, he mentioned other alternatives to Moses (Thot, for instance), as well as a collection of TAUS features on quality evaluation. Important for new entrants or those with an interest in MT is the collection of data at TAUS: Europarl (1,8 million sentences), JRC-Acquis (270 paragraphs), Hansards, UN, OPUS, LDC Linguistic Data Consortium, ELRA, as well as TAUS’ own repositories, and many […]

Translation Technologies at LocWorld (Part 1: Moses)

I attended Localization World London both as a guest speaker on what I call an upsurge in machine translation, almost a “transition frenzy” towards post-editing “future stability” within the EU-sponsored MosesCore project, organized by TAUS, and as an exhibitor of Pangea’s DIY SMT machine translation technology. The session formed part of the Pre-Conference Day and it was a lively session with plenty of Q&A from attendees, reflecting the high interest MT has stirred among translation users and practitioners nowadays. Prof. Hieu Hoang provided a general introduction to what an SMT system is, what translation and language models are. The distinction between a translation model and, probabilities of phrases to figure out how the output sentence is grammatically correct, proper re-ordered, etc. Prof. Hoang related the story of how he originally updated Pharaoh to replace Moses and now only maintains it, as it has become a community-based project in which he […]

How to build, run and own your machine translation ecosystem – Pangeanic at LocWorld London 2013

Pangeanic will exhibit its Pangea machine translation technology at LocWorld London 2013 inside ELIA‘s booth.  Over the 3-day event, you will have a chance to meet our representatives and see for yourself how PangeaMT works and how easy it is to create translation engines, manage them and update engines, clean and segregate training material and of course obtain translation in portable and open formats (from TMX and XLIFF to xml-compliant docx, odt, html and ttx). Manuel Herranz is also a guest speaker at the pre-conference day, where he will speak as an experienced implementor of machine translation (MT) technologies at LSP and for large organizations with big publishing needs. Pangeanic was a founding member of TAUS, the industry think-tank and its spin-off the Data Association. Advancements in machine translation led Pangeanic to become the first language service provider to successfully apply Moses as recorded by the EU research program Euromatrixplus. The release of […]

pangeanic

Machine Translation in Short

It is evident that certain documents require a human translator in order to interpret the subtleties of a language. Nevertheless, no matter how skilled a human translator may be, machine translation (also known as automatic translation or MT for short) exceeds the efficiency of a human translator. Machine translation is generally used for subject-specific cases and this is where results and productivity rates are spectacularly higher. It allows individuals and companies to tailor their work according to the topic. Consequently, this enriches the output and quality of machine translation by cutting down on the number of choices for each word(s) to be translated. This form of translation is extremely helpful in areas where formal language is used or phrases are repeated without much variation, such as administrative documents, which do not require the use of colloquial language and expression. The potential of machine translation has been increasingly explored. In 2009, […]

Multilingual web is more than translation (1/2)

by Manuel Herranz It is beyond doubt that the web has become a multilingual. The work, experiences and cross-pollination with other disciplines, from machine translation to localization and semantics, were shared at EU-sponsored Multilingual Web event which took place in Rome during 12-13th March 2013. Whilst technologies such as machine translation are already well-integrated for fast web page translation, it was reassuring to see that even large web actors, such as Google consider there is plenty of work to do in making the web truly multilingual. The release of ITS 2 and the new features and possibilities that html5 opens made the venue a meeting point for professionals, practitioners and academics dealing with the semantic web, translation, applied machine translation and CMS tool providers. Google’s experiences were shared by Mark Davis and Vladimir Weinstein and pinpointed translation and localization issues which are often overseen. We already assume that a page can […]

Machine Translation Helps Patent Offices Worldwide

by Manuel Herranz A unified patent court will come into effect at the beginning of 2014 in the wake of a European Parliament decision which will break a patent translation deadlock. This agreement still depends on the system being ratified by thirteen European Union states including Germany, France and the UK. The European Patent Office (EPO) also officially welcomed the adoption of this resolution by the European Parliament in Strasbourg of two draft regulations on the creation of the unitary patent, hailing it as a historic achievement. “The European Union is to be congratulated on this decision, which clears the way for the completion of the European patent system with a unitary patent and a Unified Patent Court, which we have been waiting for in Europe for 40 years,” said EPO President Benoît Battistelli. Members of the European Parliament voted in favour of a new single European patent system which is […]

Pangeanic Christmas Party… All for Translation Automation!!

Let’s change our machine translation and translation automation focus for once and share the happiness of Christmas period with everyone. All Pangeanic staff work very hard in all types of translation projects and translation consultancy so… it was time to celebrate! Next time you think languages, think Pangeanic Machine Translation Engines from PangeaMT follow us on –>   @Pangeanic   @manuelhrrnz

Help translation project to protect global internet freedom

The following statement has been written by Ellery Biddle in the Advocacy section of Global Voices. We urge all our readers to share it and understand the serious issue behind Internet governance at stake. If Internet can be controlled in the way it is proposed and its openness constrained, our global rights as netcitizens will be compromised. Below follows Ellery’s call, as it appears in the blog. “Over the next seven days, Global Voices Lingua volunteers will be translating a public online petition that supports the protection of human rights online and urges government members of the International Telecommunication Union (ITU) to preserve Internet openness at the upcoming conference of the ITU. Open for sign-on by any individual or civil society organization, the Protect Global Internet Freedom statement reads as follows: On December 3rd, the world’s governments will meet to update a key treaty of a UN agency called the International […]