Navigation auf uzh.ch

Suche

LiRI - Linguistic Research Infrastructure

FAIR-FI-LD

Description

Moving towards a national FAIR-compliant ecosystem of Federated Infrastructure for Language Data, short FAIR-FI-LD, is a swissuniversities ORD-funded 12 months project (July 2024-June 2025) hosted by the University of Zurich, with the participation of CLARIN-CH, LiRI, ZHAW and USI. 

In the last 5-10 years, Swiss higher education institutions (HEI) have been working on building national services for language data. They include, up to now, the Linguistic Research Infrastructure (UZH), the Swiss-AL Platform for Applied Sciences (ZHAW), a national repository for the publication and long-term preservation of language data LaRS@SWISSUbase (UNIL, UZH), and various smaller tools and services. These units however are not all interoperable, which reduces the potential for collaboration and data reuse. In addition, fields such as interactional linguistics or second language acquisition lack adequate infrastructure.

With the foundation of the CLARIN-CH consortium in 2020 (9 HEIs and the SAGW), the HEI's efforts took a new direction: Work together to build a FAIR-compliant, sustainable and expandable CLARIN-CH ecosystem of federated infrastructure to answer the needs of researchers and professionals using language data in Switzerland and beyond; an ecosystem that must be interoperable at the national and European levels.

Principal Investigators

  • Dr. Cristina Grisot (CLARIN-CH)
  • Prof. Dr. Noah Bubenhofer (LiRI)
  • Prof. Dr. Julia Krasselt (ZHAW)
  • Prof. Dr. Johanna Miecznikowski-Fuenfschilling (USI)
  • Project coordinator: Dr. Letizia Volpin

Goals

The present project aims at realizing important steps towards this mid- and long-term goal, in compliance with the Swiss ORD strategy,

  • by prototyping
    • interoperable underlying software using NLP techniques and exploratory AI techniques
    • harmonized metadata between the existing Swiss infrastructure components and the European CLARIN infrastructure
    • the CLARIN federated content search technology (FCS) to query each component of the infrastructure
    • a FCS multilingual landing page hosted on the CLARIN-CH website
    • a frontend of the Videoscope@LiRI environment to visualize, query and analyze multimodal talk-in-interaction data, hosted at USI
  • by producing
    • documentation and training to support the use of the infrastructure and inform about legal and ethical issues related to language data in the context of Open Science
  • by planning
    • the future collaboration with further stakeholders and aggregation of further tools and services
    • a final workshop (June 2025) to disseminate information about the project outputs

Weiterführende Informationen

Call for participation

Are you a member of the Swiss scientific community working with language resources and you feel concerned about the topics addressed in this project?

Would you like to get involved?

Please drop an email to Cristina Grisot.

CLARIN-CH logo with the text "Common Language Resources and Technology Infrastructure"

Common Language Resources and Technology Infrastructure

More about Common Language Resources and Technology Infrastructure

CLARIN – Common Language Resources and Technology Infrastructure – is a pan-European research infrastructure aiming to render accessible all digital language resources and tools from all over Europe through a single sign-on online environment. Several Swiss academic institutions have manifested their intention to join CLARIN, first as an Observer member and later as Full member. For this, they have founded the consortium CLARIN-CH in 2020.