EU

Pangea Machine Translation at Machine Translation Summit XIV

Pangeanic’s machine translation technologies from division PangeaMT and its involvement within the EXPERT program will be presented at the poster session of the European MT Machine Translation Summit XIV which will take place in Nice, 2-6th September. Pangeanic is attending as a member of the EU-funded Marie Curie consortium EXPERT (EXPloiting Empirical appRoaches to Translation). The aim of this EU-funded action is to train young researchers, namely Early Stage Researchers (ESRs) and Experienced Researchers (ERs), promoting research, development and the use of hybrid LT (language translation) technologies. Like other partners in the consortium, Pangeanic is to contribute to different project stages. At the end of the project, the technological division PangeaMT will oversee the implementation and evaluation of the new hybrid computer-aided translation technology proposed in EXPERT. The company will also collaborate by (co-)hosting and supervising a number of post-graduate and post-doctoral researchers. Some topics of interest for the translation industry are […]

Expert

Pangeanic in EU EXPERT Project Evaluating Hybrid Machine Translation

by Manuel Herranz As recently published in our news section, Pangeanic will take part as a full member in the EU-funded EXPERT Project. EXPERT aims to train young researchers to promote the research, development and use of hybrid language translation technologies. In practice, EXPERT aims at improving translation practices and enhancing the productivity of relevant actors in the translation market. In this respect, EXPERT’s findings will set an agenda for new skills and jobs by promoting new job profiles based on empirical data from translation professionals (language service providers) and academia. The assumption of the project is that true potential of MT remains to be exploited as a result of non-user-friendly interfaces, lack of awareness of translator’s feedback, etc. However, Pangeanic already created and released a web-based tool that is able to organize material for Machine Translation by domain, maintain it and perform some cleaning routines, a key factor in our participation […]

Translation Technologies at LocWorld (Part 2: Practitioners)

by Manuel Herranz I will describe the rest of the very interesting Pre-conference Day and presentations by the organizers (TAUS) as well as other 3 companies which are either machine translation developers or practitioners of automated translation solutions. Presentations brought different perspectives to the machine translation landscape, with efforts and advancements by several companies, including Pangeanic. Maxim Khalilov from TAUS summarized the good work being done by the organization within the MosesCore project. His presentation was an invitation to visit and find out more about tools, data and resources. Amongst the tools, he mentioned other alternatives to Moses (Thot, for instance), as well as a collection of TAUS features on quality evaluation. Important for new entrants or those with an interest in MT is the collection of data at TAUS: Europarl (1,8 million sentences), JRC-Acquis (270 paragraphs), Hansards, UN, OPUS, LDC Linguistic Data Consortium, ELRA, as well as TAUS’ own repositories, […]

Translation Technologies at LocWorld (Part 1: Moses)

by Manuel Herranz I attended Localization World London both as a guest speaker on what I call an upsurge in machine translation, almost a “transition frenzy” towards post-editing “future stability” within the EU-sponsored MosesCore project, organized by TAUS, and as an exhibitor of PangeaMT’s DIY SMT machine translation technologies. The session formed part of the Pre-Conference Day and it was a lively session with plenty of Q&A from attendees, reflecting the high interest MT has stirred among translation users and practitioners nowadays. Prof. Hieu Hoang provided a general introduction to what an SMT system is as translation technology, as well as what translation and language models are. The distinction between a translation model and, probabilities of phrases to figure out how the output sentence is grammatically correct, proper re-ordered, etc. Prof. Hoang related the story of how he originally updated Pharaoh to replace Moses and now only maintains it, as […]

How to build, run and own your machine translation ecosystem – Pangeanic at LocWorld London 2013

Pangeanic will exhibit its Pangea machine translation technology at LocWorld London 2013 inside ELIA‘s booth.  Over the 3-day event, you will have a chance to meet our representatives and see for yourself how PangeaMT works and how easy it is to create translation engines, manage them and update engines, clean and segregate training material and of course obtain translation in portable and open formats (from TMX and XLIFF to xml-compliant docx, odt, html and ttx). Manuel Herranz is also a guest speaker at the pre-conference day, where he will speak as an experienced implementor of machine translation (MT) technologies at LSP and for large organizations with big publishing needs. Pangeanic was a founding member of TAUS, the industry think-tank and its spin-off the Data Association. Advancements in machine translation led Pangeanic to become the first language service provider to successfully apply Moses as recorded by the EU research program Euromatrixplus. The release of […]

Machine Translation in Short

by Manuel Herranz It is evident that certain documents require a human translator in order to interpret the subtleties of a language. Nevertheless, no matter how skilled a human translator may be, machine translation (also known as automatic translation or MT for short) exceeds the efficiency of a human translator. Machine translation is generally used for subject-specific cases and this is where results and productivity rates are spectacularly higher. It allows individuals and companies to tailor their work according to the topic. Consequently, this enriches the output and quality of machine translation by cutting down on the number of choices for each word(s) to be translated. This form of translation is extremely helpful in areas where formal language is used or phrases are repeated without much variation, such as administrative documents, which do not require the use of colloquial language and expression. The potential of machine translation has been increasingly […]

Multilingual web is more than translation (1/2)

by Manuel Herranz It is beyond doubt that the web has become a multilingual. The work, experiences and cross-pollination with other disciplines, from machine translation to localization and semantics, were shared at EU-sponsored Multilingual Web event which took place in Rome during 12-13th March 2013. Whilst technologies such as machine translation are already well-integrated for fast web page translation, it was reassuring to see that even large web actors, such as Google consider there is plenty of work to do in making the web truly multilingual. The release of ITS 2 and the new features and possibilities that html5 opens made the venue a meeting point for professionals, practitioners and academics dealing with the semantic web, translation, applied machine translation and CMS tool providers. Google’s experiences were shared by Mark Davis and Vladimir Weinstein and pinpointed translation and localization issues which are often overseen. We already assume that a page can […]

Machine Translation Helps Patent Offices Worldwide

by Manuel Herranz A unified patent court will come into effect at the beginning of 2014 in the wake of a European Parliament decision which will break a patent translation deadlock. This agreement still depends on the system being ratified by thirteen European Union states including Germany, France and the UK. The European Patent Office (EPO) also officially welcomed the adoption of this resolution by the European Parliament in Strasbourg of two draft regulations on the creation of the unitary patent, hailing it as a historic achievement. “The European Union is to be congratulated on this decision, which clears the way for the completion of the European patent system with a unitary patent and a Unified Patent Court, which we have been waiting for in Europe for 40 years,” said EPO President Benoît Battistelli. Members of the European Parliament voted in favour of a new single European patent system which is […]

I want you to speak English or get out

EU reduces translation budget – Machine Translation and Post-editing, one future

by Manuel Herranz On 21st November 2012, lawmakers approved a report by Stanimir Ilchev, a Bulgarian Liberal MEP, that will bring change to the procedural rules recording plenary debates. This decision could be a Godsend for machine translation and language technology developers as the EU plans to increase translation productivity (or times) by 25% – this being a target in current R&D Language Technology Funding Calls. Starting from the next plenary, on 10th December, the European Parliament is not going to be required to translate the session into all the 23 official languages of the EU. Over the years, this requirement has proved quite costly and can take up to four months. However, a bias towards the English language has been pointed to in many circles and instances. For example, Jean Quatremer, a renowned French political journalist from the French daily Libération, complained about the official press statements containing the […]

Language Technology Industry Forum will incorporate

by Manuel Herranz Representatives from Europe’s leading language technology companies met in Brussels on 8th-9th October and agreed to form a legal entity representing their interests. The main outcome of the Workshop, attended by over 50 representatives, was the decision to set up LT-Innovate as a legal entity to serve as representative body for LT vendors at European level. This decision comes in the wake of the very successful LT-Innovate Summit that brought together more than 160 LT stakeholders in June 2012. Strengthening and raising the profile, as well as serving as a lobby to speak as one voice with industry and government is seen as increasingly necessary by many language technology companies, many of them SMEs.  In a world and EU brought increasingly together by economic forces and Internet,  the language technology industry must be an enabler and not a bottleneck in communication, thus opening new markets and business […]