Publications
Year: 2010
- Kilgarriff, A (2010). Comparable Corpora Within and Across Languages, Word Frequency Lists and the KELLY Project. In Rapp, R., Zweigenbaum, P. & Sharoff, S. (editors), Proceedings of the 3rd Workshop on Building and Using Comparable Corpora (BUCC2010) [held in conjunction with LREC2010], pages 1-5. Valletta, Malta. [More] [Online version]
- Bungum, L. & Gambäck, B (2010). Evolutionary Algorithms in Natural Language Processing. In Yildirim, Ş. & Kofod-Petersen, A. (editors), Proceedings of the Second Norwegian Artificial Intelligence Symposium (NAIS 2010), pages 7-18. Gjøvik, Norway. [More] [JRESEARCH_FULLTEXT]
- Jakubíček, M., Kilgarriff, A., McCarthy, D. & Rychlý, P (2010). Fast syntactic searching in very large corpora for many languages. In Otoguro, R., Ishikawa, K., Umemoto, H., Yoshimoto, K. & Harada, Y. (editors), Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation (PACLIC 24), pages 741-747. Tokyo, Japan. [More] [Online version]
Year: 2011
- Tambouratzis, G., Sofianopoulos, S., Vassiliou, M., Simistira, F. & Tsimboukakis, N (2011). A resource-light phrase scheme for language-portable MT. In Forcada, M. L., Depraetere, H. & Vadeghinste, V. (editors), Proceedings of the 15th International Conference of the European Association for Machine Translation, pages 185-192. Leuven, Belgium. [More] [JRESEARCH_FULLTEXT]
- Bungum, L. & Gambäck, B (2011). A Survey of Domain Adaptation in Machine Translation: Towards a refinement of domain space. In Proceedings of the India-Norway Workshop on Web Concepts and Technologies. Trondheim, Norway. [More] [JRESEARCH_FULLTEXT]
- Kilgarriff, A., PVS, A. & Pomikálek, J (2011). BootCatting Comparable Corpora. In Proceedings of the 9th International Conference on Terminology and Artificial Intelligence, pages 123-126. Paris, France. [More] [JRESEARCH_FULLTEXT]
- Pomikálek, J (2011). Corpus Architect developments and CCBC (Comparable Corpus BootCat). In. Brighton, UK. [More] [Online version]
- Jakubíček, M (2011). Effective Parsing Using Competing CFG Rules. In Habernal, I. & Matoušek, V. (editors), Proceedings of the 14th international conference on Text, Speech and Dialogue (TSD2011), pages 115-122. Plzeň, Czech Republic : Springer Verlag. [More] [Online version]
- Sofianopoulos, S. & Tambouratzis, G (2011). Studying the SPEA2 Algorithm for Optimising a Pattern-Recognition Based Machine Translation System. In Proceedings of the 2011 IEEE Symposium on Computational Intelligence in Multicriteria Decision-Making (MCDM 2011), pages 97-104. Paris, France : IEEE PRESS. [More] [JRESEARCH_FULLTEXT]
- Kilgarriff, A (2011). Terminology, translation, and PRESEMT; word frequency lists and KELLY. In. Brighton, UK. [More] [JRESEARCH_FULLTEXT]
- Preuss, S., Keffer, H. & Schmidt, P (2011). Using annotated corpora for rapid development of new language pairs in MT. In Proceedings of GSCL 2011. Hamburg, Germany. [More] [JRESEARCH_FULLTEXT]
- Marsi, E., Lynum, A., Bungum, L. & Gambäck, B (2011). Word Translation Disambiguation without Parallel Texts. In Proceedings of the International Workshop on Using Linguistic Information for Hybrid Machine Translation. Barcelona, Spain. [More] [JRESEARCH_FULLTEXT]
Year: 2012
- Pomikálek, J., Jakubíček, M. & Rychlý, P (2012). Building a 70 billion word corpus of English from ClueWeb. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC2012), pages 502-506. Istanbul, Turkey. [More] [JRESEARCH_FULLTEXT]
- Lynum, A., Marsi, E., Bungum, L. & Gambäck, B (2012). Disambiguating word translations with target language models. In Proceedings of the Hybrid Machine Translation Workshop [held in conjunction with the 15th International Conference on Text, Speech and Dialogue [(TSD2012)], pages 378-385. Brno, Czech Republic : Springer. [More] [Online version]
- Bungum, L. & Gambäck, B (2012). Efficient N-gram Language Modeling for Billion Word Web-Corpora. In Proceedings of the workshop 'Challenges in the Management of Large Corpora' (CMLC) [held in conjunction with LREC2012], pages 6-12. Istanbul, Turkey. [More] [JRESEARCH_FULLTEXT]
- Tambouratzis, G., Troullinos, M., Sofianopoulos, S. & Vassiliou, M (2012). Accurate phrase alignment in a bilingual corpus for EBMT systems. In Proceedings of the 5th Workshop on Building and Using Comparable Corpora (BUCC2012) [held in conjunction with LREC2012], pages 104-111. Istanbul, Turkey. [More] [JRESEARCH_FULLTEXT]
- Bharat Ram, A., Siva, R. & Kilgarriff, A (2012). Word Sketches for Turkish. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC2012), pages 2945-2950. Istanbul, Turkey. [More] [JRESEARCH_FULLTEXT]
- Kilgarriff, A (2012). Getting to know your corpus. In Sojka, P., Horak, A., Kopecek, I. et al (editors), Proceedings of the 15th International Conference on Text, Speech and Dialogue (TSD2012), pages 3-15. Brno, Czech Republic : Springer. [More] [Online version]
- Sofianopoulos, S., Vassiliou, M. & Tambouratzis, G (2012). Implementing a language-independent MT Methodology. In, pages 1-10. Jeju Island, Korea. [More] [JRESEARCH_FULLTEXT]
- Tambouratzis, G., Vassiliou, M. & Sofianopoulos, S (2012). PRESEMT: Pattern Recognition-based Statistically Enhanced MT. In, pages 65-68. Avignon, France. [More] [JRESEARCH_FULLTEXT]
- Preuss, S., Keffer, H., Schmidt, P., Goumas, G., Asiki, A. & Konstantinou, I (2012). User Adaptation in a Hybrid MT System: Feeding User Corrections into Synchronous Grammars and System Dictionaries. In Proceedings of the Hybrid Machine Translation Workshop [held in conjunction with the 15th International Conference on Text, Speech and Dialogue [(TSD2012)], pages 362-369. Brno, Czech Republic : Springer. [More] [Online version]
- Kilgarriff, A. & Tambouratzis, G (2012). The PRESEMT Project. In, pages 27-28. Istanbul, Turkey. [More] [JRESEARCH_FULLTEXT]
- Kilgarriff, A., Rychlý, P., Vojtěch, K. & Baisa, V (2012). Finding Multiwords of More Than Two Words. In Proceedings of the 15th EURALEX International Congress. Oslo, Norway. [More] [JRESEARCH_FULLTEXT]
- Tambouratzis, G., Tsatsanifos, G., Dologlou, I. & Tsimboukakis, N (2012). SOM-based corpus modeling for disambiguation purposes in MT. In Proceedings of the Hybrid Machine Translation Workshop [held in conjunction with the 15th International Conference on Text, Speech and Dialogue [(TSD2012)]. Brno, Czech Republic. [More] [Online version]
- Kilgarriff, A., Pomikálek, J., Rychlý, P. & Suchomel, V (2012). Measuring Distance between Language Varieties. In Proceedings of the Sixth Inter-Varietal Applied Corpus Studies (IVACS2012). Leeds, UK. [More] [JRESEARCH_FULLTEXT]