The fourth annual ELG conference
Joining the European Language Grid:
Together Towards Digital Language Equality
© Adobe Stock – Sergij Figurnyi
Project abbreviation: CEFRSERV
Project name: CEFR Labelling and Assessment Services
Project coordinator: Mark Breuker
Project consortium: EDIA
Funding: ELG (European Language Grid) Pilot Projects Open Call 2: 137.560 Euro
Project duration: 1 year
Main key words: authoring, CEFR, readability, content curation
Goal of the project: Develop a set of data collection and annotation tools to facilitate the creation of data sets for CEFR readability assessment.
Project abstract: This project aims to develop a set of data collection and annotation tools to facilitate the creation of data sets which can be used to train text classification models. These models can automatically assess a text’s reading difficulty against the Common European Framework of Reference (CEFR). The ability to accurately and consistently check the readability level of texts is crucial to authors and teachers. It will allow them to create and curate content that meets the needs of students with different backgrounds and skill levels.
EDIA already provides automated readability assessment technology (available as API and authoring tool) for the CEFR which is currently available for English. Through this project, additional languages will be supported (ie. Dutch, German, French and Spanish). As part of the project, we will also build an infrastructure that will pave the way for adding other languages in the future.