Gerold Schneider

8050 Zürich
Campus Oerlikon
Gerold Schneider is Titulary Professor of Computational Linguistics and co-coordinator of LiRI's service area "Natural Language Processing". His doctoral degree is on large-scale dependency parsing, his habilitation on using computational models for corpus linguistics. His research interests include corpus linguistics, cognitive linguistics, statistical approaches, Digital Humanities, learner language, text mining, automated content analysis and language modeling. He has published over 130 articles on these topics, including a book on statistics for linguists available here.
He also works with NLP methods and hate speech detection for the URPP Digital Religion(s) project. Find out more about Gerolds work on his GoogleScholar page or his personal webpage.
Publications
ZORA Publication List
Publications
-
Investigating Linguistic Abilities of LLMs for Native Language Identification. In: Proceedings of the 14th Workshop on NLP for Computer Assisted Language Learning. 2025., Talin, Estonia, 5 März 2025.
-
Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech. In: The 31st International Conference on Computational Linguistics, Abu Dhabi, UAE, 19 January 2025 - 24 January 2025. Association for Computational Linguistics, 1850-1864.
-
Digital Dickens: An automated content analysis of Charles Dickens’ novels. In: Buschfeld, Sarah; Ronan, Patricia; Neumaier, Theresa; Wellinghoff, Andreas; Westermayer, Lisa. Crossing Boundaries through Corpora: Innovative corpus approaches within and beyond linguistics. Amsterdam: John Benjamins Publishing, 62-98.
-
Automatically detecting directives with SPICE Ireland. In: Schweinberger, Martin; Ronan, Patricia. Socio-Pragmatic Variation in Ireland: Using Pragmatic Variation to Construct Social Identities. Berlin: De Gruyter, 205-234.
-
The LiRI Corpus Platform. In: CLARIN Annual Conference 2023, Leuven, Belgium, 16 October 2023 - 18 October 2023. Linköping University Electronic Press, 62-75.
-
Evaluating Transformers on the Ethical Question of Euthanasia. In: SwissText 2024, Chur, Switzerland, 10 Juni 2024 - 11 Juni 2024, 241-246.
-
The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer’s Disease Detection from Spontaneous Speech. In: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italy, 20 May 2024 - 25 May 2024. Association for Computational Linguistics, 15955-15969.
-
Text Analytics for Corpus Linguistics and Digital Humanities: Simple R Scripts and Tools. London: Bloomsbury Academic.
-
The Visualisation and Evaluation of Semantic and Conceptual Maps. In: Laitinen, Mikko; Tyrkkö, Jukka. Linguistics across Disciplinary Borders: The March of Data. London: Bloomsbury Publishing, 67-94.
-
Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset. In: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico, 1 June 2024. Association for Computational Linguistics, 4405-4424.
-
Native Language Identification Improves Authorship Attribution. In: Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, 2024. Association for Computational Linguistics, 289-296.
-
Investigating child language acquisition from a joint perspective: A comparison of traditional and new L1 speakers of English. In: Schmalz, Mirjam; Vida-Mannl, Manuela; Buschfeld, Sarah. Acquisition and Variation in World Englishes: Bridging Paradigms and Rethinking Approaches. Berlin: De Gruyter, 133-157.
-
Turkish Native Language Identification. In: 6th International Conference on Natural Language and Speech Processing (ICNLSP-2023), virtual, 16 December 2023 - 17 December 2023, 303-307.
-
Exploring Hybrid Linguistic Features for Turkish Text Readability. In: 6th International Conference on Natural Language and Speech Processing (ICNLSP-2023), virtual, 16 December 2023 - 17 December 2023, 223-232.
-
The LiRI Corpus Platform. In: CLARIN Annual Conference 2023, Leuven, Belgium, 16 October 2023 - 18 October 2023. CLARIN ERIC, 145-149.
-
“To boldly go where no man has gone before”: how iconic is the Star Trek split infinitive?. Linguistics Vanguard, 9(s3):247-255.
-
Exploring the role of AI in classifying, analyzing, and generating case reports on assisted suicide cases: feasibility and ethical implications. Frontiers in Artificial Intelligence, 6:1328865.
-
Colloquialisation, compression and democratisation in British parliamentary debates. In: Korhonen, Minna; Kotze, Haidee; Tyrkkö, Jukka. Exploring Language and Society with Big Data: Parliamentary discourse across time and space. Amsterdam: John Benjamins Publishing, 336-372.
-
Swissdox@ LiRI–a large database of media articles made accessible to researchers. In: CLARIN Annual Conference 2023, Leuven, 16 October 2023 - 18 October 2023. CLARIN ERIC, 111-115.