Pangeanic is currently developing an R+D project called “COR: FAST AND EFFICIENT TECHNICAL TRANSLATION MANAGEMENT”, having successfully overcome...
Pangeanic has reached its first milestone in its project called Hybrid Neural Machine Translation Platform.
This project, with the backing of the CDTI and the EU in its project Operative Growth of Intelligence (case no. IDI-20170964 ), aims to create a neural machine translation program through the development of hybridization techniques, using AI.Why neural machine translation?
Neural machine translation systems are currently a hot topic in the scientific community. In the last years, the number of publications is growing on this topic. These systems have great advantages; the context taken into account when translating is at sentence level (in classic statistical systems a maximum of 7 words were taken into account) and all the components of the system are trained at the same time in order to achieve better translation quality. Also, the stored model for translation occupies less memory and weighs less than the classical statistical systems. Mega-corporations such as Google (Wu et al., 2016) and Microsoft (Hassan et al., 2018) are interested in neural translation and claim that they are beginning to render neural machine translation results similar to human translation. [caption id="attachment_6929" align="aligncenter" width="625"]
Pangeanic’s stance on its study
Part of the first milestones for Pangeanic’s Hybrid Neural Machine Translation Platform include:- The redesigning of pre-processes and post-processes in order for them to operate correctly in neural systems. Previously designed in statistical systems that could work correctly in neural systems.
- Selecting the appropriate toolkit for the project.
- Architectural design of the project; a standard model was chosen due to its bidirectional sequence-to-sequence recurrent neural network.