At the MT Summit in Ottawa (August 28, 2009), Microsoft’s Chris Wendt presented the findings from a recent pilot project using translation memories from more than ten TDA members to train the Microsoft statistical machine translation engine.
Main tests were performed in two languages: Chinese and German, with customization for Sybase iAnywhere. Additional tests also were run on Polish and Japanese languages with customization for Adobe and Dell.
BLEU scores went up significantly with increases between 22% and 74% compared to engines trained purely on Microsoft or general available data.
This is a link to this seminal presentation
Improving the quality of a customized SMT system using shared training data
Related Articles
Following our participation in TAUS Tokyo Summit in April, Mr M Herranz, our CEO will visit B.I. Japan in Tokyo and B.I. China in Shanghai. Apart...
Following our participation in TAUS Tokyo Summit in April, Mr M Herranz, our CEO will visit B.I. Japan in Tokyo and B.I. China in Shanghai. Apart...
PangeaMT will be introduced in Japan as part of TAUS Tokyo Summit from 14th-16th April, 2010. The Tokyo summit will be the first of its kind in the...

