Improving quality of customized SMT systems

At the MT Summit in Ottawa (August 28, 2009), Microsoft’s Chris Wendt  presented the findings from a recent pilot project using translation memories from more than ten TDA members to train the Microsoft statistical machine translation engine. Main tests were performed in two languages: Chinese and German, with customization for Sybase iAnywhere. Additional tests also were run on Polish and Japanese languages with customization for Adobe and Dell. BLEU scores went up significantly with increases between 22% and 74% compared to engines trained purely on Microsoft or general available data. This is a link to this seminal presentation Improving the quality of a customized SMT system using shared training data
 

Related Posts

What Are the Best Machine Learning Algorithms for NLP?

What Are the Best Machine Learning Algorithms for NLP?

Nowadays, natural language processing (NLP) is one of the most relevant areas within artificial intelligence. In this context, machine learning algorithms play a fundamental role in the...

Read more

NLP Techniques: The Most Powerful Natural Language Processing Methods

In today’s digital world, companies have access to large volumes of data, data that must be understood in order to deliver a better customer experience, to increase their competitive edge, to streamline internal processes, and more. 

Read more

Pangeanic in the Top 100 LSPs and the Importance of Adapting to the Times

As every year, Slator, a leading provider of news, research, consulting and market intelligence for the translation, localization, interpreting and language technology industry, has conducted its ranking of language service providers (LSPs) around...

Read more