EXPERT Project paper published by Translating and the Computer

Written by M.Ángeles García | 11/30/15
The paper "EXPERT Project:  Advancing the State of the Art in Hybrid Translation Technologies" has been published in the Proceedings of the Translating and the Computer 37 Conference. Pangeanic is a full member of the EXPERT Consortium, and it is in charge of the last Work Package, Pangeanic's role in the Project consortium is as industry expert in statistical machine translation (SMT) and hybrid machine translation as a developer of the PangeaMT system with particular reference to bilingual resource building and leverage for automated engine creation and updating as well as machine translation. One of the aims of the project is that users will be involved in the systematic evaluation and error analysis to be performed with the new machine translation technologies developed within the project. To achieve these objectives, Pangeanic is a member of the consortium as a machine translation technology developer (initially also with Wordfast) and it interfaces with other translation companies like Translated, Hermes Traducciones and Celer.  The University of Malaga (UMA) as an expert organization in human translation plays a key role in the development of an extensive user requirement analysis, as well as for the evaluation of the prototypes developed in EXPERT. [caption id="attachment_3693" align="alignright" width="300"] Proceedings of the Translating and the Computer 37 Conference[/caption] This paper provides an overview of the EXPloiting Empirical appRoaches to Translation (EXPERT) project, an FP7 Marie Curie Initial Training Network, which is preparing the next generation of world-class researchers in the field of hybrid machine translation. Currently, the EXPERT project employs 15 Marie Curie fellows who are working on 15 individual, but interconnected, projects and is organizing local and consortium wide training activities. The project has been running now for three years and has already produced high-quality research. This paper presents the most important research achievements of the project.

Information about Training Facilities for the Post-Doc Fellows and the Visiting PhD Students at Pangeanic

Pangeanic has a team of 8 translation consultants and development experts with long experience in the R&D of multilingual, automated, translation technologies, language development and its deployment in production environments, as well as 2 senior software developers focusing on scalable, web-service based machine translation architectures, user interfaces and deployment experiences. The Post-Doctorate fellows from the EXPERT project have access to Pangeanic’s hardware and software resources for the development of language building tools, the translation system and platform as well as the optimisation of the hybridisation performance. During the project, Pangeanic  provides unlimited access to its entire computing infrastructure to serve the project with its massive amounts of data repositories as well as its statistical language models, automated PangeaMT platform which features automated re-training features, etc. PhD students visiting Pangeanic for a secondment will gather an insightful view about a real-world multilingual content production environment that is also well-aware of and keen to foster and deploy advanced translation technologies. These are some relevant areas in which Pangeanic will be happy to train visiting PhD students as well as its hired post-doctorate fellow  
Local Training Offered at Pangeanic ESR ER
Introductory courses on Pangeanic (as LSP) and Translation Best Practices within an ISO-informed Globalisation / Internationalisation / Localisation / Translation (GILT) approach X X
Hands-on training on any commercial tools used by Pangeanic that were not previously known by them (focus on assets and deficiencies) X X
In-Depth training on PangeaMT (evolution, technology – components, ongoing dev and envisaged work, platform skins, commercialization and market needs…) X X
MT-related skills training for the computational linguist vs. the linguist and translator (including how they can also work together): data preparation, pre-editing, post-editing, output error analysis, etc. X X