Corpus Linguistics

Description

  

Publications

  • Kilgarriff, A., Pomikálek, J., Rychlý, P. & Suchomel, V (2012). Measuring Distance between Language Varieties. In Proceedings of the Sixth Inter-Varietal Applied Corpus Studies (IVACS2012). Leeds, UK. [More] 
  • Kilgarriff, A., Rychlý, P., Vojtěch, K. & Baisa, V (2012). Finding Multiwords of More Than Two Words. In Proceedings of the 15th EURALEX International Congress. Oslo, Norway. [More] 
  • Bharat Ram, A., Siva, R. & Kilgarriff, A (2012). Word Sketches for Turkish. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC2012), pages 2945-2950. Istanbul, Turkey. [More] 
  • Kilgarriff, A (2012). Getting to know your corpus. In Sojka, P., Horak, A., Kopecek, I. et al (editors), Proceedings of the 15th International Conference on Text, Speech and Dialogue (TSD2012), pages 3-15. Brno, Czech Republic : Springer. [More] 
  • Pomikálek, J., Jakubíček, M. & Rychlý, P (2012). Building a 70 billion word corpus of English from ClueWeb. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC2012), pages 502-506. Istanbul, Turkey. [More] 
  • Pomikálek, J (2011). Corpus Architect developments and CCBC (Comparable Corpus BootCat). In. Brighton, UK. [More] 
  • Kilgarriff, A (2011). Terminology, translation, and PRESEMT; word frequency lists and KELLY. In. Brighton, UK. [More] 
  • Kilgarriff, A., PVS, A. & Pomikálek, J (2011). BootCatting Comparable Corpora. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence, pages 123-126. Paris, France. [More] 
  • Jakubíček, M (2011). Effective Parsing Using Competing CFG Rules. In Habernal, I. & Matoušek, V. (editors), Proceedings of the 14th international conference on Text, Speech and Dialogue (TSD2011), pages 115-122. Plzeň, Czech Republic : Springer Verlag. [More] 
  • Jakubíček, M., Kilgarriff, A., McCarthy, D. & Rychlý, P (2010). Fast syntactic searching in very large corpora for many languages. In Otoguro, R., Ishikawa, K., Umemoto, H., Yoshimoto, K. & Harada, Y. (editors), Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation (PACLIC 24), pages 741-747. Tokyo, Japan. [More] 
  • Kilgarriff, A (2010). Comparable Corpora Within and Across Languages, Word Frequency Lists and the KELLY Project. In Rapp, R., Zweigenbaum, P. & Sharoff, S. (editors), Proceedings of the 3rd Workshop on Building and Using Comparable Corpora (BUCC2010) [held in conjunction with LREC2010], pages 1-5. Valletta, Malta. [More]