Felix Bildhauer & Roland Schäfer: Token-level noise in large Web corpora and non-destructive normalization for linguistic applications. Corpus Analysis with Noise in the Signal (CANS 2013). Corpus Linguistics 2013, Lancaster.
Felix Bildhauer & Roland Schäfer: Token-level noise in large Web corpora and non-destructive normalization for linguistic applications. Corpus Analysis with Noise in the Signal (CANS 2013). Corpus Linguistics 2013, Lancaster.