Gerold Schneider
8050 Zürich
Campus Oerlikon
Gerold Schneider is Titulary Professor of Computational Linguistics and co-coordinator of LiRI's service area "Natural Language Processing". His doctoral degree is on large-scale dependency parsing, his habilitation on using computational models for corpus linguistics. His research interests include corpus linguistics, cognitive linguistics, statistical approaches, Digital Humanities, learner language, text mining, automated content analysis and language modeling. He has published over 130 articles on these topics, including a book on statistics for linguists available here.
He also works with NLP methods and hate speech detection for the URPP Digital Religion(s) project. Find out more about Gerolds work on his GoogleScholar page or his personal webpage.
Publications
ZORA Publication List
Publications
-
SpiritRAG: A Q&A System for Religion and Spirituality in the United Nations Archive (I. Habernal, P. Schulam, & J. Tiedemann, Eds.; pp. 26–41). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.emnlp-demos.3
-
ESCMID workshop: Artificial intelligence and machine learning in medical microbiology diagnostics Microbes and Infection, 105562. https://doi.org/10.1016/j.micinf.2025.105562
-
In patients’ words: natural language processing of reports from patients experiencing orofacial pain and dysfunction Journal of Headache and Pain, 26(1), 172. https://doi.org/10.1186/s10194-025-02095-z
-
Detecting and Mapping Hate in Religious Contexts In T. Schlag & K. Yadav (Eds.), Religious Communication, Interaction and Transformation in a Culture of Digitality : Insights into the Zurich University Research Priority Program “Digital Religion(s)” (pp. 153–183). De Gruyter. https://doi.org/10.1515/9783111721729
-
The ‘Spiritual’ and the ‘Religious’ in the Twittersphere: A Topic Model and Semantic Map Journal of Religion, Media & Digital Culture, 14(1), 1–22. https://doi.org/10.1163/21659214-bja10123
-
Investigating Linguistic Abilities of LLMs for Native Language Identification Proceedings of the 14th Workshop on NLP for Computer Assisted Language Learning. 2025., Talin. https://spraakbanken.gu.se/en/research/themes/icall/nlp4call-workshop-series/nlp4call2025
-
Refining Established Practices for Research Question Definition to Foster Interdisciplinary Research Skills in a Digital Age: Consensus Study With Nominal Group Technique JMIR Medical Education, 11, e56369. https://doi.org/10.2196/56369
-
Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech (O. Rambow, L. Wanner, M. Apidianaki, H. Al-Khalifa, B. Di Eugenio, & S. Schockaert, Eds.; pp. 1850–1864). Association for Computational Linguistics. https://aclanthology.org/2025.coling-main.126/
-
Digital Dickens: An automated content analysis of Charles Dickens’ novels In: Buschfeld, Sarah; Ronan, Patricia; Neumaier, Theresa; Wellinghoff, Andreas; Westermayer, Lisa . Crossing Boundaries through Corpora: Innovative corpus approaches within and beyond linguistics. Amsterdam: John Benjamins Publishing, 62-98.
-
Automatically detecting directives with SPICE Ireland In: Schweinberger, Martin; Ronan, Patricia . Socio-Pragmatic Variation in Ireland: Using Pragmatic Variation to Construct Social Identities. Berlin: De Gruyter, 205-234.
-
The LiRI Corpus Platform In: CLARIN Annual Conference 2023, Leuven, Belgium, 16 October 2023 - 18 October 2023. Linköping University Electronic Press, 62-75.
-
Evaluating Transformers on the Ethical Question of Euthanasia In: SwissText 2024, Chur, Switzerland, 10 Juni 2024 - 11 Juni 2024, 241-246.
-
The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer’s Disease Detection from Spontaneous Speech In: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italy, 20 May 2024 - 25 May 2024. Association for Computational Linguistics, 15955-15969.
-
Text Analytics for Corpus Linguistics and Digital Humanities: Simple R Scripts and Tools London: Bloomsbury Academic.
-
The Visualisation and Evaluation of Semantic and Conceptual Maps In: Laitinen, Mikko; Tyrkkö, Jukka . Linguistics across Disciplinary Borders: The March of Data. London: Bloomsbury Publishing, 67-94.
-
Investigating child language acquisition from a joint perspective: A comparison of traditional and new L1 speakers of English In: Schmalz, Mirjam; Vida-Mannl, Manuela; Buschfeld, Sarah . Acquisition and Variation in World Englishes: Bridging Paradigms and Rethinking Approaches. Berlin: De Gruyter, 133-157.
-
Native Language Identification Improves Authorship Attribution In: Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, 2024. Association for Computational Linguistics, 289-296.
-
Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset In: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico, 1 June 2024. Association for Computational Linguistics, 4405-4424.
-
Exploring Hybrid Linguistic Features for Turkish Text Readability In: 6th International Conference on Natural Language and Speech Processing (ICNLSP-2023), virtual, 16 December 2023 - 17 December 2023, 223-232.
-
Turkish Native Language Identification In: 6th International Conference on Natural Language and Speech Processing (ICNLSP-2023), virtual, 16 December 2023 - 17 December 2023, 303-307.