Publications

Mayer, Thomas, Johann-Mattis List, Anselm Terhalle and Matthias Urban. 2014. An interactive visualization of crosslinguistic colexification patterns. In Proceedings of the VisLR Workshop at LREC 2014, 1-8. [pdf]

Mayer, Thomas and Michael Cysouw. 2014. Creating a massively parallel Bible corpus. In Proceedings of LREC 2014, 3158-3163.[pdf]

Mayer, Thomas, Bernhard Wälchli, Christian Rohrdantz and Michael Hund. 2014. From the extraction of continuous features in parallel texts to visual analytics of heterogeneous areal-typological datasets: An extended functional and algorithmic processing pipeline. In Language processing and grammars: The role of functionally oriented computational models (SLCS) (Serie: Studies in Language). Amsterdam: John Benjamins, 13-38.

Mayer, Thomas. 2014. Inducing place distinctions of consonants from their distribution in words. In Szmrecsanyi, Benedikt and Bernhard Wälchli (eds.). Aggregating Dialectology, Typology, and Register Analysis. Linguistic variation in text and speech (Serie: Linguae et Litterae: Publications of the School of Language and Literature, Freiburg Institute for Advanced Studies). Berlin: de Gruyter, 394-425.

Mayer, Thomas, Michael Spagnol and Florian Schönhuber. 2013. Fixing the broken plural in Maltese. In Proceedings of the Third International Conference on Maltese Linguistics (International Association for Maltese Linguistics).

Mayer, Thomas and Christian Rohrdantz. 2013. PhonMatrix: Visualizing co-occurrence constraints in sounds. In Proceedings of the ACL 2013 System Demonstration. [pdf]

Mayer, Thomas and Michael Cysouw. 2012. Language comparison through sparse multilingual word alignment. In Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, 54–62. [pdf]

Butt, Miriam, Jelena Prokić, Thomas Mayer and Michael Cysouw. 2012. Introduction. In Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, 1–6. [pdf]

Rohrdantz, Christian, Michael Hund, Thomas Mayer, Bernhard Wälchli and Daniel A. Keim. 2012. The World’s Languages Explorer: Visual analysis of language features in genealogical and areal contexts. In Computer Graphic Forum, 31(3), 935–944. [link]

Biemann, C., Bildhauer, F., Evert, S., Goldhahn, D., Quasthoff, U., Schäfer, R., Simon, J., Swiezinski, L., & Zesch, T. 2013. Scalable Construction of High-Quality Web Corpora. In: Special Issue of the Journal for Language Technology and Computational Linguistics (JLCL), Gesellschaft für Sprachtechnologie und Computerlinguistik.

Eckart, T., Quasthoff, U., & Goldhahn, D. 2012. Language Statistics-Based Quality Assurance for Large Corpora. In: Proceedings of Asia Pacific Corpus Linguistics Conference 2012, Auckland, New Zealand.

Eckart, T., Quasthoff, U., & Goldhahn, D. 2012. The Influence of Corpus Quality on Statistical Measurements on Language Resources. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12).

Eckart, T., Hallsteinsdóttir, E., Helgadóttir, S., Quasthoff, U., & Goldhahn, D. 2014. A 500 Million Word POS-Tagged Icelandic Corpus. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC).

Eckart, E., Alshargi, F., Quasthoff, U., & Goldhahn, D., 2014. Large Arabic Web Corpora of High Quality: The Dimensions Time and Origin. In: Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools, LREC, Reykjavík.

Goldhahn, D., Eckart, T., & Quasthoff, U. 2012. Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12).

Goldhahn, D. 2013. Quantitative Methoden in der Sprachtypologie: Nutzung korpusbasierter Statistiken. Dissertation University of Leipzig.

Goldhahn, D., & Quasthoff, U. 2014. Vocabulary-Based Language Similarity using Web Corpora. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2014

Goldhahn, D., Quasthoff, U., & Heyer, G. 2014. Corpus-Based Linguistic Typology: A Comprehensive Approach. In: Proceedings of Konvens 2014. Hildesheim, Germany.

Goldhahn, D., Remus, S., Quasthoff, U., & Biemann, C. 2014. Top-Level Domain Crawling for Producing Comprehensive Monolingual Corpora from the Web. In: Workshop on Challenges in the Management of Large Corpora (CMLC-2), LREC, Reykjavík.

Quasthoff, U., Mitra, R., Mitra, S., Eckart, T., Goldhahn, D., Goyal, P., & Mukherjee, A. .2014. Large Web Corpora of High Quality for Indian Languages. In: 2nd Workshop on Indian Language Data: Resources and Evaluation, LREC, Reykjavík.

 

Last updated on  December 29th, 2014