digital economy

cropped-pangeamt_def_2010_sin_fondo.jpg

Cloud Traffic and Data Will Increase Translation Services

Cisco estimates that global cloud traffic will grow 45% annually until 2016, with translation services growing at around 15% to 20% per year. According to Ian  Henderson, CTO of Rubic, a translation and location company, this means that many new machine translators must enter the industry each year to handle the content. On the other hand, Raymond Kurzweil, one of the brightest minds in the world, director of technology at Google and a futurist known for his predictions about artificial intelligence, predicts that machines will match human intelligence and perform several feats that seem to us science fiction nowadays, including human-quality translation, by year 2029. Current happenings also suggest a strong role for non-human translation, with machine translation (MT) advancing rapidly. Three simultaneous-translation devices have been announced since June 2012, including one by Microsoft that renders live audio translations from the spoken word, respecting the tones and inflections of the […]

Human Translation or Machine Translation – What’s Best for Me?

For some people, using a translation software program to translate a piece of text from one language to the next is enough. They believe the current state-of-the-art is good enough to provide Human Translation or Machine Translation. It would be naive to believe this always works. We have proven at Pangeanic that this works in applied contexts, when we are dealing with a particular domain, enough clean data and when certain conditions apply. Please refer to many of our presentations since 2009 on the use of applied machine translation to speed translation of documentation in particular. But as we all know, it takes a lot more than just software. The application of unrestricted, universal machine translation will take some time. In fact, it would not be fair to talk about “machine translation” in general but language combinations (English/Spanish/French/Portuguese/Scandinavian) in which it is undoubtedly successful -whilst in other languages certain nuances make […]

Multilingual web is more than translation (1/2)

by Manuel Herranz It is beyond doubt that the web has become a multilingual. The work, experiences and cross-pollination with other disciplines, from machine translation to localization and semantics, were shared at EU-sponsored Multilingual Web event which took place in Rome during 12-13th March 2013. Whilst technologies such as machine translation are already well-integrated for fast web page translation, it was reassuring to see that even large web actors, such as Google consider there is plenty of work to do in making the web truly multilingual. The release of ITS 2 and the new features and possibilities that html5 opens made the venue a meeting point for professionals, practitioners and academics dealing with the semantic web, translation, applied machine translation and CMS tool providers. Google’s experiences were shared by Mark Davis and Vladimir Weinstein and pinpointed translation and localization issues which are often overseen. We already assume that a page can […]

Moses is not the new Messiah

by Manuel Herranz If you run a translation company or translation department or have some sort of connection with the translation industry, you have noticed without a doubt that MT (or automatic translation) is the flavor of the year in 2010… and will be for many years to come. It has and will change the way do things in this industry.  Several factors have been an unstoppable increase in the globalization of services and support, smaller budgets from buyers, an increase in international trading of services and the need for more content and more multilingual content in more languages. As of May 2009, there were 487 billion gigabytes of data which were increasing 50% a year (Oracle) or doubling every 11 hours (IBM). There are both exogenous and endogenous factors for things to reach maturity level now and not earlier or later. Among the latter factors we may include the fact that the bases did already […]

Breaking News – a few days later: “Aha, SDL has bought Language Weaver! So what?”

by Elia Yuste & Manuel Herranz SDL announced the purchase of Language Weaver (LW) on Thursday 15th July 2010. General media as well as GILT industry experts have spread the news in just no time in the form of newsletter updates, opinion articles in professional networks, blog entries or tweets. The news coverage has been phenomenal. Everyone has been asking about the why’s, the how’s and the thereafter’s of this financial move by SDL. We coincide with some leading industry analysts in that perhaps Language Weaver’s future will not be as rosy as it may seem. Perhaps it will follow the same destiny as Idiom once did, from market establishment to product support discontinuation following its acquisition by big buying father SDL. As an independent LSP offering its own customized MT technology, Pangeanic is in a privileged position to offer dedicated client-focused solutions, based on innovative MT and, if required, post-editing […]

IBM + Lionbridge MT agreement – What does it mean for the industry?

by Manuel Herranz The news of the month has undoubtedly been the announcement by Lionbridge to partner with IBM to develop (and probably offer) machine translation solutions. Possibly, the intention is to offer the advantages of MT to Lionbridge existing clients and maybe to control the technology. After all, whoever controls the technology, has a good chance of gaining (or consolidating) market dominance. The move must be welcome by all true believers of MT as the (new?) force of change in the translation industry. However, even though Lionbridge is the biggest language company jumping in the MT-DIY boat, it is not the first one to combine the offer of MT+PE as a substitute to the (increasingly old-fashioned?) TM or T+E+P models. With translator production reaching 850-1000+ words per hour and mounting production and  price pressures, 20th-century technologies seem too cumbersome and antiquated for the demands of multilingual digital content. Who […]

BBC debate demonstrates power of real time machine translation

MT is in the news. On 4th March, The Economist published a review of what the web might feel like without linguistic barriers. “Cyber-multilinguism” is increasingly a reflection of the world we live in. A few days later (9th March) The New York Times run a comparison of machine translated texts from French, Spanish, Russian, German and Arabic into English and the quality obtained from several engines. The texts dealt with works of literature, which may well have been in the training material of the engines, but it also dealt with current affairs and news clips. Google presented a cell phone link to MT. Last week, the BBC conducted a new experiment to test how people and the media can make use and really broaden the horizons by using MT, even if only a in a very generalist way. MT users seemed happy with 80%-90% accuracy. A couple of findings I thought were […]

Comment to SDL’s “Sharing Data between Companies – is it the Holy Grail?”

by Manuel Herranz Eye openers about data sharing (or data mixing) abound nowadays. The kick start for TM leveraging, automation and faster solutions has come from outside our beloved language industry in the shape of – algorithms that create language (SMT) and their application/business by players inside and outside the industry (from Google Translate to new MT entrants and offsprings) – a credit crunch and a financial crisis that is leading companies to rethink the unthinkable A few times (exceptions) language professionals have joined to actually innovate and come up with something really new, mostly crowdsourcing, in translation, in frameworks, in workflows. Never mind, it is seldom the norm that busy people have the time to innovate. It takes a shot from outside a particular industry to shake the foundations or to force to change things. (Let’s assume from a positivist point of view that change is for the better). […]

Machine Translation for Urdu – English

If Urdu/English machine translation combination is accurate up to 85%, what is stopping the development into other, more common pairs? In our opinion, it is not just “the service”, online or otherwise, but the “application”. New startups like Tradukka may mimic the way Google Translator works, better or worse, and claim to be faster. As it was discussed and learnt during the recent TAUS Summit in Portland, what we need is ways of integrating machine-translation into existing workflows, from TMS to community/crowdsourcing projects. Whilst these efforts are admirable, language output also requires integration with current CAT systems in order to leverage as much legacy as possible. And that is what we do a PangeaMT - integrate statistical machine-translation in an open source TMX format to leverage both. Quoting the article from The Nation “ISLAMABAD (APP) – ‘English is no more a barrier to learn and use computer’ as National Language Authority (NLA) has […]

Website Globalization Sponsorship

Pangeanic will sponsor the Website Globalization Conference that will take place in Barcelona in September 2007 where Sony Europe, Real Madrid, Amadeus, Unilever and HP will showcase their strategies. The globalization of every business goes hand-in-hand with the need of professional linguistic support. With companies increasingly earning their revenue from the world’s rather than from national markets, the choice of web workflows, technologies, globalization partners and support is paramount. IQPC’s panel of experts and practitioners will share their experience “from the trenches”: how to structure an organization to effectively manage Web globalization, how to leverage technology, and how to design, implement and maintain globalization processes to maximize ROI and achieve consistent global branding. Key Facts and Figures on Web Site Globalization: Just 30% of all Web users speak English as a native language—this number continues to fall. Source: Miniwatts Marketing Group, 2006 More than 13% of Web users speak Chinese—this number is […]