BUCC 2021 Website | RANLP 2021 Website

Proceedings of the 14th Workshop on Building and Using Comparable Corpora (BUCC 2021)

Chairs
Reinhard Rapp (Athena R.C., Greece, Magdeburg-Stendal University of Applied Sciences and University of Mainz, Germany)
Serge Sharoff (University of Leeds, UK)
Pierre Zweigenbaum (Université Paris-Saclay, CNRS, LISN, Orsay, France)

BUCC 2021 Proceedings Home (HTML)
Full Proceedings Volume (PDF)
Author Index (HTML)
Bibliography (BibTeX)


pdf bib Front matter pages
pdf bib Invited Presentation
Pushpak Bhattacharyya
pp. 1‑1
pdf bib Mining Bilingual Word Pairs from Comparable Corpus using Apache Spark Framework
Sanjanasri JP, Vijay Krishna Menon, Soman KP and Krzysztof Wolk
pp. 2‑7
pdf bib Effective Bitext Extraction From Comparable Corpora Using a Combination of Three Different Approaches
Steinþór Steingrímsson, Pintu Lohar, Hrafn Loftsson and Andy Way
pp. 8‑17
pdf bib Syntax-aware Transformers for Neural Machine Translation: The Case of Text to Sign Gloss Translation
Santiago Egea Gómez, Euan McGill and Horacio Saggion
pp. 18‑27
pdf bib Employing Wikipedia as a resource for Named Entity Recognition in Morphologically complex under-resourced languages
Aravind Krishnan, Stefan Ziehe, Franziska Pannach and Caroline Sporleder
pp. 28‑39
pdf bib Semi-Automated Labeling of Requirement Datasets for Relation Extraction
Jeremias Bohn, Jannik Fischbach, Martin Schmitt, Hinrich Schütze and Andreas Vogelsang
pp. 40‑45
pdf bib Majority Voting with Bidirectional Pre-translation For Bitext Retrieval
Alexander Jones and Derry Tanti Wijaya
pp. 46‑59
pdf bib EM Corpus: a comparable corpus for a less-resourced language pair Manipuri-English
Rudali Huidrom, Yves Lepage and Khogendra Khomdram
pp. 60‑67
pdf bib On Pronunciations in Wiktionary: Extraction and Experiments on Multilingual Syllabification and Stress Prediction
Winston Wu and David Yarowsky
pp. 68‑74
pdf bib A Dutch Dataset for Cross-lingual Multilabel Toxicity Detection
Ben Burtenshaw and Mike Kestemont
pp. 75‑79

Last modified on October 13, 2021, 7:34 a.m.