Roland Schäfer

Professor of Linguistics | German Grammar | Jena

Menu

Skip to content
  • Research
    • Projects
    • External Funding
    • Software
  • CV
    • Education
    • Employment
  • Teaching
    • General Linguistics
    • German Linguistics
    • English Linguistics
    • Computational Linguistics
    • Languages
  • Publications
    • Incubator
    • Books
    • Papers
    • Theses
    • Chapters and Encyclopedia Articles
  • Talks
  • Confs
    • Workshops
    • Tutorials/Courses
  • Refereeing
    • Journals
    • Edited Volumes
    • Books
    • Conferences
  • Impressum (DE)
  • Datenschutz (DE)

Token-level noise in large Web corpora and non-destructive normalization for linguistic applications (2013)

Felix Bildhauer & Roland Schäfer: Token-level noise in large Web corpora and non-destructive normalization for linguistic applications. Corpus Analysis with Noise in the Signal (CANS 2013). Corpus Linguistics 2013, Lancaster.

Post navigation

← Inflectional Alternations in German Weak Nouns (2013) Einführung in die germanistische Sprachwissenschaft →

Informationen zur Lehre

SE Morphologie und Lexikologie
VL Deutsche Syntax

VL Deutsche Graphematik
Modul Examensvorbereitung

What happened to webcorpora.org?

My Einführung in die grammatische Beschreibung was downloaded 99,864 times and is the second best-downloading monograph of LangSci Press (as of 31 October 2025). The fourth edition will be out shortly after both the 100,000th download and the 10th anniversary in 2025. [Information and Errata]

Recent Posts

  • Desintegration attributiver Adjektivphrasen (Zeitschrift für Sprachwissenschaft 2025)21 July 2025
  • What happened to webcorpora.org?25 May 2025
  • Statistical Inference for Everybody and a Linguist (in progress)12 January 2025
  • Between syntax and morphology (Glossa)8 May 2024
  • Bei Bedarf … (Praxis Deutsch)20 August 2023

Office Address

Prof. Dr. Roland Schäfer
Germanistische Sprachwissenschaft
Fürstengraben 30
07743 Jena

Email address

Richtlinen für Arbeiten

Empfehlungen für Emails

Sprechstunden

Secretary: Nadin Friebe