Gerold Schneider

Data scientist with focus on NLP and Human Language Technology

Office: AND-4.84

Andreasstrasse 15
8050 Zürich
Campus Oerlikon

Gerold Schneider is Titulary Professor of Computational Linguistics and co-coordinator of LiRI's service area "Natural Language Processing". His doctoral degree is on large-scale dependency parsing, his habilitation on using computational models for corpus linguistics. His research interests include corpus linguistics, cognitive linguistics, statistical approaches, Digital Humanities, learner language, text mining, automated content analysis and language modeling. He has published over 130 articles on these topics, including a book on statistics for linguists available here.

He also works with NLP methods and hate speech detection for the URPP Digital Religion(s) project. Find out more about Gerolds work on his GoogleScholar page or his personal webpage.

Publications

ZORA Publication List

Publications

Goldzycher, Janis; Schneider, Gerold (2022). Hypothesis Engineering for Zero-Shot Hate Speech Detection. In: Proceedings of the Third Workshop on Threat, Aggression and Cyberbullying (TRAC 2022), Gyeongju, Republic of Korea, 12 October 2022 - 17 October 2022. ACL, 75-90.
Reveilhac, Maud; Schneider, Gerold (2022). Comparing the coverage of the “marriage for all” vote on Twitter and in the newspapers. In: 2nd Workshop on Computational Linguistics for Political Text Analysis (CPSS-2022), Potsdam, Germany, 12 September 2022. CPSS, 55-62.
Schneider, Gerold (2022). Correlations and predictions of reading times using language models and surprisal. In: Krug, Manfred; Schützler, Ole; Vetter, Fabian; Werner, Valentin. Perspectives on Contemporary English : Structure, Variation, Cognition. Berlin, Bern, Bruxelles, New York, Oxford, Warszawa, Wien: Peter Lang, 209-243.
Schneider, Gerold (2022). Medical topics and style from 1500 to 2018. In: Hiltunen, Turo; Taavitsainen, Irma. Corpus pragmatic studies on the history of medical discourse. Amsterdam: Benjamins, 49-78.
Schneider, Gerold (2022). Recent changes in spoken British English according to spoken BNC2014. In: Flach, Susanne; Hilpert, Martin. Broadening the spectrum of corpus linguistics: New approaches to variability and change. Amsterdam: John Benjamins Publishing, 173-195.
Schneider, Gerold; Reveilhac, Maud (2022). Measuring Attitudes to Migration in the Media automatically with Complementary Data Sources and Methods. In: Ronan, Patricia; Ziegler, Evelyn. Approaches to Migration and Language Identity. Oxford, Bern, Berlin, Bruxelles, New York, Wien: Peter Lang, 207-252.
Schneider, Gerold (2022). Comparing data-driven to corpus-based approaches for diachronic variation: document-classification and overuse metrics. In: Schlüter, Julia; Schützler, Ole. Data and Methods in Corpus Linguistics: Comparative Approaches. Cambridge: Cambridge University Press, 291-322.
Schneider, Gerold (2022). Syntactic changes in verbal clauses and noun phrases from 1500 onwards. In: Los, Bettelou; Cowie, Claire; Honeybone, Patrick. English Historical Linguistics: Change in Structure and Meaning. Amsterdam: John Benjamins Publishing, 163-200.
Sedlakova, Jana; Daniore, Paola; Horn Wintsch, Andrea; Wolf, Markus; Stanikic, Mina; Haag, Christina; Sieber, Chloé; Schneider, Gerold; Staub, Kaspar; Ettlin, Dominik Alois; Grübner, Oliver; Rinaldi, Fabio; von Wyl, Viktor (2022). Challenges and best practices for digital unstructured data enrichment in health research: a systematic narrative review. medRxiv 22278137, Cold Spring Harbor Laboratory.
Luo, Minxia; Debelak, Rudolf; Schneider, Gerold; Martin, Mike; Demiray, Burcu (2021). With a little help from familiar interlocutors: real-world language use in young and older adults. Aging & Mental Health, 25(12):2310-2319.
Schneider, Gerold; Hundt, Marianne; Schreier, Daniel (2020). Pluralized non-count nouns across Englishes: a corpus-linguistic approach to dialect typology. Corpus Linguistics and Linguistic Theory, 16(3):515-546.
Luo, Minxia; Neysari, Mona; Schneider, Gerold; Martin, Mike; Demiray, Burcu (2020). Linear and Non-Linear Age Trajectories of Language Use: A Laboratory Observation Study of Couples' Conflict Conversations. Journals of Gerontology, Series B: Psychological Sciences and Social Sciences, 75(9):e206-e214.
Schneider, Gerold (2020). Changes in society and language: charting poverty. In: Rautinaho, Paula; Nurmi, Arja; Klemola, Juhani. Corpora and the changing society: studies in the evolution of English. Amsterdam: John Benjamins Publishing, 29-56.
Graën, Johannes; Alfter, David; Schneider, Gerold (2020). Using Multilingual Resources to Evaluate CEFRLex for Learner Applications. In: 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, 11 May 2020 - 16 May 2020. European Language Resources Association, 346-355.
Schneider, Gerold (2020). Spelling normalisation of Late Modern English: comparison and combination of VARD and character-based statistical machine translation. In: Kytö, Merja; Smitterberg, Eric. Late Modern English: novel encounters. Amsterdam: John Benjamins Publishing, 243-268.
Ronan, Patricia; Schneider, Gerold (2020). A Man who Was Just an Incredible Man, an Incredible Man: Age Factors and Coherence in Donald Trump’s Spontaneous Speech. In: Schneider, Ulrike; Eitelmann, Matthias. Linguistic Inquiries into Donald Trump’s Language : From ‘Fake News’ to ‘Tremendous Success’. London: Bloomsbury, 62-84.
Schneider, Gerold; Lauber, Max (2019). Statistics for Linguists: A patient, slow-paced introduction to statistics and to the programming language R. Zurich: Digitale Lehre und Forschung UZH.
Luo, Minxia; Schneider, Gerold; Martin, Mike; Demiray, Burcu (2019). Cognitive Aging Effects on Language Use in Real-Life Contexts: A Naturalistic Observation Study. In: The 41st Annual Meeting of the Cognitive Science Society, Montreal, QC, 24 July 2019 - 27 July 2019, CogSci.
Smith, Nick; Schneider, Gerold; Hoffmann, Sebastian; Lehmann, Hans Martin (2019). Enhancing the linguistic discovery potential of historical corpora: a twin-track approach using ARCHER. In: CL 2019 International Corpus Linguistics Conference, Cardiff, Wales, UK, 22 Juli 2019 - 26 Juli 2019, Gossip Theme.
Taavitsainen, Irma; Schneider, Gerold; Jones, Peter Murray (2019). Topics of eighteenth-century medical writing with triangulation of methods: LMEMT and the underlying reality. In: Taavitsainen, Irma; Hiltunen, Turo. Late Modern English medical texts: writing medicine in the eighteenth century (Including the LMEMT Corpus). Amsterdam: John Benjamins Publishing, 31-74.

LiRI - Linguistic Research Infrastructure

Quicklinks und Sprachwechsel

Main navigation

Gerold Schneider

Publications

ZORA Publication List

Publications

Pagination