Category Archives: Publications
The plural interpretability of German linking elements (Schäfer & Pankratz, submitted)
Univerbation of N+V constructions in German is probabilistic and driven by morphosyntactic prototypes (Schäfer & Sayatz)
This publication is in the INCUBATOR section.
Roland Schäfer & Ulrike Sayatz (in preparation) Univerbation of N+V constructions in German is probabilistic and driven by morphosyntactic prototypes. To be submitted in Q1 or Q2/2018.
A Usage-based Account of Sequences of Oblique Attributive Adjectives in German
This publication is in the INCUBATOR section.
Roland Schäfer (in preparation). A Usage-based Account of Sequences of Oblique Attributive Adjectives in German. To be submitted in Q1 or Q2/2017.
Full dataset and scripts on GitHub. (To be released after paper has been accepted.)
Statistical Modeling in Linguistics
This publication is in the INCUBATOR section.
Roland Schäfer (in preparation) Statistical Modeling in Linguistics. To be submitted in Q2/2018 to Language Science Press.
Generalised Linear Mixed Models (Practical Handbook of Corpus Linguistics)
Wie viel Grammatik braucht das Germanistikstudium? (2017)
Roland Schäfer & Ulrike Sayatz (2017) Wie viel Grammatik braucht das Germanistikstudium? Zeitschrift für germanistische Linguistik 45(2). [BibTeX]
Ulrike Sayatz and I hold the copyright to this paper since we refused to transfer our rights to DeGruyter. We simply did not sign the form. You can do the same!
Punctuation and Syntactic Structure in “obwohl” and “weil” Clauses in Nonstandard Written German (2016)
Induktive Topikmodellierung und extrinsische Topikdomänen (2017)
Introduction to the Grammar of German, 2nd edition (2016)
Roland Schäfer. 2016. Einführung in die grammatische Beschreibung des Deutschen (Introduction to the Grammar of German), Zweite, überarbeitete Auflage. Language Science Press, Text Books in Language Science, No. 2. [BibTeX]
The book can be downloaded freely under an open access license (CC-BY). If you use the book, why not do me a favor and tweet it @codeslapper or send me a message? Or visit the book website grammatick.de.
Buy paperback at Amazon.de (Germany).
Prototype-driven Alternations: The Case of German Weak Nouns (2016 aop)
Proceedings of the 10th Web as Corpus Workshop and the EmpiriST shared task (2016)
Paul Cook, Stefan Evert, Roland Schäfer and Egon Stemle (eds) 2016. Proceedings of the 10th Web as Corpus Workshop (WAC-X). ACL: Stroudsburg. [BibTeX]
Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus Comparison (2016)
On Bias-free Crawling and Representative Web Corpora (2016)
Accurate and Efficient General-Purpose Boilerplate Detection for Crawled Web Corpora (2016)
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws (2016)
Processing and Querying Large Web Corpora with the COW14 Architecture (2015)
Roland Schäfer. Processing and Querying Large Web Corpora with the COW14 Architecture. In Proceedings of Challenges in the Management of Large Corpora (CMLC-3) (IDS publication server). 28–34. [BibTeX]
Die Kurzformen des Indefinitartikels im Deutschen (2014)
Roland Schäfer & Ulrike Sayatz (2014) Die Kurzformen des Indefinitartikels im Deutschen (Cliticization of the indefinite article in German). Zeitschrift für Sprachwissenschaft (ZS) 33(2). [BibTeX]
Focused Web Corpus Crawling (2014)
Proceedings of the 9th Web as Corpus Workshop (2014)
Felix Bildhauer & Roland Schäfer (eds) 2014. Proceedings of the 9th Web as Corpus Workshop (WAC-9). ACL: Stroudsburg. [BibTeX]
The Good, the Bad, and the Hazy: Design Decisions in Web Corpus Construction (2013)
Web Corpus Construction (2013)
Roland Schäfer & Felix Bildhauer (2013) Web Corpus Construction. Morgan and Claypool. [BibTeX]
Websites: Morgan & Claypool (official), Companion web site (additional information, errata, etc.)
Reviews: Serge Sharoff in Computational Linguistics 41(1) (2015), Mats Wirén in Nordic Journal of Linguistics 37, 03 (2014)
Scalable Construction of High-quality Web Corpora (2013)
Building Large Corpora from the Web Using a New Efficient Tool Chain (2012)
Roland Schäfer & Felix Bildhauer (2012) Building Large Corpora from the Web Using a New Efficient Tool Chain. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12). 486–493. [BibTeX]
Please cite this paper if you use the COW corpora up to version COW16.
Arguments and Adjuncts at the Syntax-Semantics Interface (2010)
Roland Schäfer (2010) Arguments and Adjuncts at the Syntax-Semantics Interface. Dissertation. Georg-August-Universität Göttingen. [BibTeX]
On Frequency Adjectives (2007)
Ossetic (2006)
Michael Job & Roland Schäfer (2006) Ossetic. In Encyclopedia of Language and Linguistics, 2nd edition. 109–115. [BibTeX]