6 min read
09/12/2025
Pangeanic Featured in the Cervantes Institute’s 2025 Global Spanish Report
We are proud to share that Pangeanic has been formally recognized in the prestigious Instituto Cervantes 2025 annual report, Spanish: A Language to the World. This influential publication—widely regarded as the global benchmark for understanding the reach, impact, and evolution of the Spanish language—highlights our pioneering contributions to NLP, secure machine translation, privacy-by-design AI, and multilingual communication technologies. For our team, this acknowledgment is both an honor and a reaffirmation of two decades of commitment to building trustworthy, language-centric AI for society.
The Instituto Cervantes is Spain’s official public institution dedicated to promoting the Spanish language and Hispanic cultures worldwide. With more than 90 centers across over 40 countries, it is the world’s leading authority on Spanish language education, linguistic research, and cultural diplomacy. The institution produces the annual report Spanish: A Language to the World, considered the global benchmark for understanding the demographic, cultural, and technological evolution of Spanish, and collaborates with governments, universities, and international organizations to strengthen the global presence and prestige of the language.
This distinction follows another major institutional milestone: in summer 2024, Spain’s Ministry of Science and Innovation awarded Pangeanic the Innovative SME Seal, certifying the company as a national AI Lab and placing its R&D activity among the most advanced in the Spanish innovation ecosystem.
A Brief Summary of the Cervantes Institute Report
Spanish: A Language to the World 2025 offers a comprehensive analysis of global Spanish-speaker demographics, AI’s impact on language technologies, terminology infrastructures, and the evolution of Spanish in digital ecosystems. It estimates that the global community of Spanish speakers now surpasses 635 million people, including 520 million native speakers, making Spanish the third-largest mother tongue globally.
The report dedicates particular attention to artificial intelligence and the Spanish language, documenting how companies like Pangeanic are shaping language resources, privacy-first AI, and multilingual tools used by public administrations and global organizations.
Why Pangeanic Was Highlighted: A Track Record of Pioneering AI for Society
Pangeanic’s mention in the Instituto Cervantes global report is not incidental—it reflects a long-standing pattern of delivering AI that solves real societal, cultural, and institutional challenges. Over the past decade, the company has consistently transformed advanced NLP research into practical systems that strengthen public institutions, expand access to information, and preserve linguistic diversity across Europe. In a moment when AI systems shape how citizens interact with governments, public records, and cultural heritage, Pangeanic has become a trusted architect of multilingual digital infrastructure.
- Modernizing Democratic Institutions with AI
One of the clearest examples is Pangeanic’s role in helping the Spanish Parliament step into the multilingual digital age. By deploying AI-powered transcription and translation pipelines capable of processing Spanish and co-official languages such as Catalan, Basque, and Galician, Pangeanic supported a historic modernization effort: making parliamentary debates more accessible, transparent, and inclusive.
Where other providers focus on generic large-language-model deployments, Pangeanic built domain-adapted, privacy-controlled systems capable of running within sensitive governmental environments—demonstrating that European institutions can adopt advanced AI without compromising sovereignty or linguistic rights. - Sustaining Multilingualism in Europe’s Digital Public Sphere
Across the EU, Pangeanic has taken leading positions in projects aimed at ensuring that linguistic diversity remains an asset—not an obstacle—in the digital transition.
In the NTEU project, the company delivered more than 500 high-quality neural translation engines, establishing a new benchmark for public-sector translation quality. These engines power internal workflows across EU bodies, enabling equal access to documents for all member states.
In the MOSAIC initiative, Pangeanic’s technologies help break language barriers in public broadcasting, offering scalable multilingual subtitling and content enrichment for audiovisual media. This ensures that European citizens can engage with cultural and political content regardless of their native language.
In cultural preservation, Pangeanic’s AI is at the core of Europeana and other heritage-digitisation efforts, where massive archives of books, manuscripts, and historical recordings require semantic enrichment, OCR, translation, and metadata generation. This work resonates with the Cervantes report’s emphasis on Spanish as a global cultural asset—showing how the language can remain digitally competitive thanks to robust AI infrastructure.
Together, these efforts position Pangeanic not just as a technology vendor, but as a cultural steward, helping Europe build the multilingual backbone of its future digital commons. - Breaking New Ground in Privacy-First AI Research
The Cervantes report also highlights the importance of secure, ethical AI for the sustainability of Spanish and other languages in digital ecosystems. Here, Pangeanic stands out as one of the few European companies delivering truly privacy-centric NLP.
A prime example is the recently completed R&D project on Retrieval-Augmented Generative AI for Privacy-Controlled Machine Translation, co-funded by the EU and CDTI. Instead of relying on opaque LLMs, Pangeanic develops architectures where sensitive data never leaves the organization’s environment, and retrieval mechanisms ensure contextual accuracy without sacrificing confidentiality. These small models are changing the way users, government, public administrations and industry approach AI and data sovereignty at their organizations, controlling processes and not releasing their knowledge and data to third party applications.
This research continues the legacy of solutions like Masker, the company’s state-of-the-art anonymization platform adopted by the European Commission in Luxembourg. It also aligns with the growing demand for AI systems that respect legal, linguistic, and cultural boundaries while delivering enterprise-grade performance.
In a decade defined by discussions about data sovereignty, algorithmic risk, and the role of AI in public services, Pangeanic has become a reference point for trustworthy, private multilingual AI.
Retrieval-Augmented Generative AI for Privacy-Controlled Machine Translation
This project, co-funded by Spain’s CDTI and the European Union, produced new methods for generating enterprise-grade translations with zero exposure of sensitive content, an area increasingly important for government, healthcare, and regulated industries.
Moreover, the speed of AI translation services allows companies to keep up with the fast-paced nature of digital communication. In an era where information is disseminated instantly, being able to translate content quickly ensures that businesses remain competitive and responsive. The reduced time-to-market for translated content can open new avenues for business expansion and customer engagement.
Also recognized as an AI Lab: Core language technologies behind the distinction
The Spanish Ministry’s Innovative SME / AI Lab distinction formally acknowledges Pangeanic’s proprietary ecosystem of AI technologies, including:
- Deep Adaptive AI Translation®
Reaches up to 90–95% human parity by learning from domain-specific data and client terminology. - The ECO Platform
A modular NLP suite offering enterprise translation, anonymization, summarization, data classification, and API-based automation. - ECOChat
A secure multilingual chatbot built on Retrieval-Augmented Generation (RAG), enabling organizations to interact privately with their knowledge bases. - Masker®
Pangeanic’s GDPR-compliant anonymization engine—one of Europe’s most widely adopted privacy solutions for text, documents, and large datasets.
These technologies are not only widely used in industry but also contribute to Spanish and European digital sovereignty, one of the themes highlighted in the Cervantes report when examining AI’s role in the future of Spanish as a global language.
Keep learning:
Pangeanic’s Deep Adaptive AI Technology Revolutionizes Translation for BYD AUTO JAPANBYD Auto Japan - Saving 70% of time in translation management
The Future: Beyond translation, toward language iIntelligence
As Pangeanic CEO Manuel Herranz stated at the TAUS Massively Multilingual Summit 2025:
“The future of language AI is not only translation, it is becoming the infrastructure of reasoning for organizations.”
This vision aligns with Europe's shift, and governments worldwide toward:
- agentic AI systems,
- multilingual RAG architectures,
- privacy-preserving AI,
- and domain-adapted LLM workflows.
Pangeanic’s mission is clear: to build secure, ethical, and multilingual AI that enhances understanding, protects data sovereignty, and expands equitable access to knowledge across languages and cultures.
Key takeaways
The dual recognition from the Instituto Cervantes and the Ministry of Science and Innovation confirms Pangeanic’s status as a leading force in European AI. It validates two decades of continuous innovation and positions the company at the forefront of global efforts to fuse linguistic diversity with technological excellence.
As the Cervantes report underscores, the future of Spanish (and multilingual communication more broadly) will depend on the responsible development of advanced AI technologies. Pangeanic is proud to be one of the organizations shaping that future.
Frequently Asked Questions (FAQ)
Who is the Instituto Cervantes?
The Instituto Cervantes is Spain’s official public institution dedicated to promoting the Spanish language and Hispanic cultures worldwide. With a presence in more than 40 countries, it is the leading global authority for Spanish language education, linguistic research, and cultural diplomacy.
What is the report Spanish: A Language to the World 2025?
Spanish: A Language to the World 2025 is the Instituto Cervantes’ annual flagship report that analyzes global Spanish-speaking demographics, economic and cultural impact, and the role of Spanish in technology and digital ecosystems, including artificial intelligence.
Why was Pangeanic featured in the Cervantes Institute report?
Pangeanic was featured for its pioneering work in NLP, secure machine translation, privacy-by-design AI, and multilingual communication technologies, as well as its contribution to large European projects that support public institutions, cultural heritage, and digital multilingualism.
What does the Innovative SME / AI Lab seal mean for Pangeanic?
The Innovative SME / AI Lab seal, awarded by Spain’s Ministry of Science and Innovation, formally recognizes Pangeanic as a highly innovative company with strong, ongoing investment in R&D. It places Pangeanic among Spain’s most advanced AI laboratories and provides strategic advantages in funding and collaboration.
Which flagship AI projects contributed to this recognition?
Key projects include NTEU (Neural Translation Engines for the EU), MOSAIC for multilingual public broadcasting, Europeana-related cultural heritage digitisation, and the R&D project “Retrieval-Augmented Generative AI for Privacy-Controlled Machine Translation,” co-funded by CDTI and the European Union.
How is Pangeanic helping to modernize democratic institutions?
Pangeanic deploys AI-powered transcription and translation pipelines, including for the Spanish Parliament, to process Spanish and co-official languages. These systems make debates more accessible, transparent, and inclusive while respecting sovereignty and linguistic rights.
How does Pangeanic address privacy and data sovereignty in AI?
Pangeanic designs privacy-first architectures where sensitive data never leaves the organization’s environment. Retrieval-augmented generation, on-premise deployments, and tools like Masker® ensure GDPR compliance and full control over data, models, and knowledge bases.
What is Deep Adaptive AI Translation®?
Deep Adaptive AI Translation® is Pangeanic’s neural MT technology that adapts to a client’s domain, terminology, and style, reaching up to 90–95% parity with human translation quality for many use cases.
What is Masker® and who uses it?
Masker® is Pangeanic’s GDPR-compliant anonymization engine for text and documents. It is used by European institutions and enterprises, including deployments for the European Commission in Luxembourg, to protect personal data while enabling analytics and AI workflows.
How can organizations work with Pangeanic after this recognition?
Organizations can engage with Pangeanic through its ECO Platform, ECOChat, Deep Adaptive AI Translation®, and custom AI and data projects. The company supports governments, public administrations, and enterprises that require secure, multilingual, and privacy-preserving AI solutions.

