Navigation auf uzh.ch

Suche

LiRI – Linguistic Research Infrastructure

Upgrading the linguistic ORD-ecosystem | UpLORD

CLARIN-CH Training Sessions in May–June 2025: Swiss-AL, LaRS, Swissdox@LiRI, LCP and more!

We invite you to participate in a series of training sessions designed to introduce and deepen your knowledge of the CLARIN-CH ecosystem and its resources for language data research. Covering topics such as corpus querying, media data collection, multimodal data modeling, and data sharing, these sessions will provide hands-on insights into key tools and methodologies.

The sessions run from March to June 2025, featuring expert speakers from the CLARIN-CH network. Whether you are new to these tools or looking to refine your skills, we welcome you to join us!

Sessions are held every second Monday, from 15:30 to 17:00, online (via Zoom). For detailed information, please refer to the CLARIN-CH news page


Description

UpLORD is a swissuniversities ORD-funded 2-year project (2023-2024) hosted by the University of Zurich, with the support of the Zurich University Library and the CLARIN-CH Consortium. Since 2018, a consortium of partners has been working on building a national ecosystem of infrastructures, which covers the whole linguistic data lifecycle according to ORD requirements (FAIR principles: Findable, Accessible, Interoperable, Reusable) from data generating, processing and analyzing to data sharing and archiving. This ecosystem includes the national technology platform LiRI and the national repository for publishing and archiving linguistic data (SWISSUbase) as service providers, a database of Swiss media texts and a platform for hosting of and searching in large text and audio/video corpora. 

The project focuses on upgrading workflows and interoperability of existing infrastructure services, establishing working groups on the national level, documenting and promoting best practices, raising awareness and training about ORD practices in the context of teaching, research and publishing, and building a robust practice of data curation. In the long-term, this project will significantly contribute to a strong foundation for a sustainable ORD strategy for linguistic data in Switzerland. Here (PDF, 172 KB) you can find details about the Steering Committee and the governance of the project.

Principal Investigators

Prof. Dr. Noah Bubenhofer (LiRI)

Dr. Andrea Malits (Universitätsbibliothek Zürich)

Dr. Cristina Grisot (CLARIN-CH)

Project coordinator: Dr. Letizia Volpin

Main outcomes (October 2024)

In the context of the requirements of Open Science and of FAIR principles, on the one hand, and of that of more challenging data sets (such as, sensitive or with copyright issues), on the other hand, we identified several gaps regarding the current situation in Switzerland that are going to be addressed thanks to ORD project. Here (PDF, 82 KB) you can find detailed information about the gaps we identified and how we have addressed the gaps from the launch of the project until October 2024. Here is a panorama of our main outcomes up to October 2024:

1. Upgrading workflows and interoperability of existing infrastructure services

  • Implementation of data curation workflow (LaRS@SWISSUbase)
  • Construction of API to automatically deposit datasets on SWISSUbase and new SWISSUbase Info Website have been launched: info.swissubase.ch
  • Construction of a Swiss metadata profile that is interoperable with the system used by CLARIN (CMDI)
  • Construction of API for SWISSUbase to be harvested by the CLARIN Virtual Language Observatory
  • Building of data converters (into standard formats) (LiRI NLP)
  • Development of a national ORD corpus platform for text, video and audio language data: LCP@LiRI with the official presentation of the LiRI Corpus platform on November 1st 2024

2. Identifying and promoting standard data formats

3. CLARIN-CH Working Groups

  • Increasing the FAIRness of (Swiss) Learner Corpora and SLA
  • Management of Sensitive and Personal data, Ethical and Legal issues for linguistic data

4. CLARIN-CH Documentation Platform

5. Hands-on workshops and trainings

6. CLARIN-CH Day on September 9, 2024

Dissemination

  1. Bubenhofer, N., Malits, A., Strebel, S., Gräen, J., Buerli, S., & Grisot, C. (2023, December). Building and consolidating a FAIR-compliant ecosystem of infrastructures. In CLARIN Annual Conference Proceedings (p. 95-99)
  2. Schaber, J., Graën, J., McDonald, D., Mustac, I., Rajovic, N., Schneider, G., ... & Kontino, T. (2023, October). The LiRI Corpus Platform. In CLARIN annual conference proceedings (pp. 145-149). 
  3. Schaber, J., Graën, J., Mustač, I., Rajović, N., Schneider, G., Zehr, J., & Bubenhofer, N. Swissdox@ LiRI–a large database of media articles made accessible to researchers. CLARIN annual conference proceedings (pp. 111-115). 
  4. Poster at Open Access Week 2023 at UZH 

  5. Presentation at 2023 SWISSUbase Annual event at UZH (November 2023)

  6. Presentation of UpLORD project (PDF, 1 MB) at 2024 CLARIN-CH Day at University of Neuchâtel (September 9, 2024) 

  7. Swissuniversities P5 Open Science closing event (November 18, 2024) 

Weiterführende Informationen

CLARIN-CH logo with the text "Common Language Resources and Technology Infrastructure"

Common Language Resources and Technology Infrastructure

More about Common Language Resources and Technology Infrastructure

CLARIN is a pan-European research infrastructure aiming to render accessible all digital language resources and tools from all over Europe through a single sign-on online environment. Swiss academic institutions founded the CLARIN-CH consortium in 2020. 

SWISSUbase is a national repository that facilitates access to research data and projects across different disciplines and provides Swiss research institutions with a reliable data infrastructure.

Call for participation

Are you a member of the Swiss scientific community working with language resources and you feel concerned about the topics addressed in this project?

Would you like to get involved?

Please drop an email to Cristina Grisot.