Towards the Primary Platform for
Language Technologies in Europe

Towards Digital Language Equality: META-FORUM 2022 presents project results and newest release of European Language Grid

Three years after the last in-person meeting, the annual European conference on Language Technology returns to Brussels: META-FORUM 2022 takes place as a hybrid event on 8 and 9 June and focuses on “Joining the European Language Grid – Together Towards Digital Language Equality”. The conference presents the results of the European Language Grid and European Language Equality projects, but also includes highlights such as an LT industry session and insights into the needs and future demands of the European Language Technology community. Registration for online and on-site participation is open and, as usual, free of charge.

The 11th META-FORUM will be a very special edition of the international conference series on powerful and innovative language technologies for the information society: With both projects, European Language Grid and European Language Equality, coming to an end in June, the two-day event will be divided into presentations of the final results of the two initiatives, their continuation and the next steps. META-FORUM 2022: Joining the European Language Grid – Together Towards European Language Equality takes place on 8 and 9 June 2022 in Brussels as well as online and is as usual free of charge.

The first conference day will be devoted to the European Language Grid as well as the European Language Technology (LT) industry, with highlights such as a demonstration of the newest ELG platform and its features and use cases, a panel discussion on language-centric Artificial Intelligence (AI), an LT industry session, an outlook on the future of ELG and more. On the second day, the focus lies on the results and continuation of the European Language Equality project. The project team will present the measurement and monitoring of Digital Language Equality over the next years, in order to achieve the goal of full Digital Language Equality in Europe by 2030. A strategic research agenda according to this goal will be presented at META-FORUM 2022 and illustrated with numerous project results, such as the state of national LT and AI strategies in Europe and a demonstration of the Digital Language Equality Metric and Dashboard.

Georg Rehm, coordinator of ELG and co-coordinator of ELE: “We are extremely happy to return to Brussels for META-FORUM 2022 to celebrate the successful end of our two projects. For both of them, this year’s conference is more of a milestone than a curtain call. With the newest release of the ELG platform, a wide array of findings on the state of Digital Language Equality in Europe and a range of speakers from industry and research, the 11th META-FORUM will be more insightful and – thanks to its hybrid form – more accessible to the European LT community than ever. For both projects, we will also present our plans for the next steps and continuation of the initiatives.”

The registration is, as usual, free of charge. For further information, the full programme and the opportunity to register, please visit www.meta-forum.eu. Current updates about the conference and the named initiatives are shared through the European Language Technology social media channels on Twitter and LinkedIn and featured in the ELT Newsletter.

Overview
META-FORUM 2022
Joining the European Language Grid – Towards Digital Language Equality
Hybrid conference
BluePoint Brussels (Boulevard Auguste Reyers 80, 1030 Brussels) and online
8-9 June 2022
Free of charge
Programme, info and registration:
www.meta-forum.eu

Contact
Prof. Dr. Georg Rehm
Coordinator European Language Grid, Co-Coordinator European Language Equality
German Research Center for Artificial Intelligence (DFKI)
Speech and Language Technology (SLT)
Georg.Rehm@dfki.de
+49 30 23895 1833


Doubling the database: How research on Digital Language Equality led to 6,000 new resources for the European Language Grid

Over the course of a weekend in the middle of January 2022, the European Language Grid (ELG) doubled in size. More than 6,000 new data resources, tools and services for 87 different languages were added to the ELG platform, pushing the ELG much closer to one of its central objectives: developing into a joint European language technology platform in which ideally all relevant language resources and technologies are registered. With the update, we are now confident that the majority of resources available in Europe can be found in and through the ELG, whether they are corpora, tools, conceptual resources or models. How did that happen? A look at the beginnings of ELG’s sister project, European Language Equality (ELE), might help.

Prospering languages in a digital world

The ELE project’s main goal is to achieve Digital Language Equality in Europe by 2030. According to the preliminary definition, Digital Language Equality describes the state in which all languages have the technological support and situational context necessary for them to continue to exist and to prosper as living languages in the digital age. While this definition paints a clear and desirable picture for the future of multilingualism in Europe, the main work was still lying ahead: developing a strategic research, innovation and implementation agenda and roadmap that leads towards this desired state.

One of the key parts of the strategy agenda is the DLE metric, a measure or quantified index that allows to compare the levels of digital readiness of and across Europe’s languages. This metric combines several factors about each language taken into consideration, such as the number of its speakers, its recognition in the EU, but most importantly the level of technological support it currently receives. In order to suggest how digital language equality can become a reality for all European languages, detailed knowledge about the current state of technological support for each language is necessary. But how does one gather this amount of data for 87 different languages?

Creating the primary platform for European language technology

The task was part of the ELE investigation into the current LT support for Europe’s languages, in which 33 project partners from different countries described the status quo of their respective language, based on empirical data and findings. In addition to these national institutions with expertise in language technology, several associations such as the European Language Equality Network (ELEN) and the European Civil Society Platform (ECSPM) focussed on smaller languages within the European Union. Altogether, the ELE consortium gathered metadata from around 1,000 organisations such as LT companies, universities and research institutions in a total of 87 different languages. 4,147 new data resources and 2,216 new tools and services were identified and their metadata documented.

These are new tools and resources because the approximately 6,000 resources gathered by the ELE consortium had not been available in the European Language Grid yet, which already consisted of more than 5,000 resources from the European LT landscape. Including the additional 6,000 resources collected by ELE, the ELG platform now provides information about more than 11,000 language technology resources – either as ELG-compatible services that can be downloaded and used directly through the ELG, or in the form of metadata including links to the original hosting platform.

All data leads to Athens

The import itself was handled by the Institute of Language and Speech Processing (ILSP) of the “Athena” Research Center in Greece. The team in Athens, which forms part of both projects, coordinated the metadata collection effort, ensured the compatibility with the ELG platform, homogenised and curated all the metadata records to prepare them for the import. The new import includes both public as well as on-demand data and services, hosted directly by their providers or through platforms such as Huggingface or GitHub.

The ELE resource import represents a prime example for the effective collaboration between the two projects and the reason why we consider them sister projects: the development of the strategic research, innovation and implementation agenda and roadmap for full digital language equality requires a comprehensive and empirical overview of the current technological support of Europe’s languages. While the European Language Grid provides exactly this kind of service, the new data in return pushes it much closer towards one of its central objectives in becoming the primary platform for European language technology.

Join the European Language Grid




How to use the ELG: Video tutorial for the European Language Grid

The ELG Video Tutorial hands you the basics of the European Language Grid: How to browse it, register, become a provider and upload, store and share resources like Language Technology tools and corpora. It also touches upon the ELG Python SDK, which is explained in detail in the ELG Documentation. For a shorter introduction into the functionalities and many advantages of the European Language Grid, have a look!