pdf bib Front matter pages
pdf bib BPoMP: The Benchmark of Poetic Minimal Pairs – Limericks, Rhyme, and Narrative Coherence
Almas Abdibayev, Allen Riddell and Daniel Rockmore
pp. 1‑9
pdf bib Ontology Population Reusing Resources for Dialogue Intent Detection: Generic and Multilingual Approach
Cristina Aceta, Izaskun Fernández and Aitor Soroa
pp. 10‑18
pdf bib Efficient Multilingual Text Classification for Indian Languages
Salil Aggarwal, Sourav Kumar and Radhika Mamidi
pp. 19‑25
pdf bib Domain Adaptation for Hindi-Telugu Machine Translation Using Domain Specific Back Translation
Hema Ala, Vandan Mujadia and Dipti Sharma
pp. 26‑34
pdf bib ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD
Moustafa Al-Hajj and Mustafa Jarrar
pp. 35‑43
pdf bib English-Arabic Cross-language Plagiarism Detection
Naif Alotaibi and Mike Joy
pp. 44‑52
pdf bib Towards a Better Understanding of Noise in Natural Language Processing
Khetam Al Sharou, Zhenhao Li and Lucia Specia
pp. 53‑62
pdf bib Comparing Supervised Machine Learning Techniques for Genre Analysis in Software Engineering Research Articles
Felipe Araújo de Britto, Thiago Castro Ferreira, Leonardo Pereira Nunes and Fernando Silva Parreiras
pp. 63‑72
pdf bib Enriching the Transformer with Linguistic Factors for Low-Resource Machine Translation
Jordi Armengol-Estapé, Marta R. Costa-jussà and Carlos Escolano
pp. 73‑78
pdf bib A Multi-Pass Sieve Coreference Resolution for Indonesian
Valentina Kania Prameswara Artari, Rahmad Mahendra, Meganingrum Arista Jiwanggi, Adityo Anggraito and Indra Budi
pp. 79‑85
pdf bib Solving SCAN Tasks with Data Augmentation and Input Embeddings
Michal Auersperger and Pavel Pecina
pp. 86‑91
pdf bib PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors
Andrei-Marius Avram, Vasile Pais and Dan Ioan Tufis
pp. 92‑101
pdf bib TEASER: Towards Efficient Aspect-based SEntiment Analysis and Recognition
Vaibhav Bajaj, Kartikey Pant, Ishan Upadhyay, Srinath Nair and Radhika Mamidi
pp. 102‑110
pdf bib Interactive Learning Approach for Arabic Target-Based Sentiment Analysis
Husamelddin Balla, Marisa Llorens Salvador and Sarah Jane Delany
pp. 111‑120
pdf bib Litescale: A Lightweight Tool for Best-worst Scaling Annotation
Valerio Basile and Christian Cagnazzo
pp. 121‑127
pdf bib Probabilistic Ensembles of Zero- and Few-Shot Learning Models for Emotion Classification
Angelo Basile, Guillermo Pérez-Torró and Marc Franco-Salvador
pp. 128‑137
pdf bib Cross-Lingual Wolastoqey-English Definition Modelling
Diego Bear and Paul Cook
pp. 138‑146
pdf bib Neural Network-Based Generation of Sport Summaries: A Preliminary Study
David Stéphane Belemkoabga, Aurélien Bossard, Abdallah Essa, Christophe Rodrigues and Kévin Sylla
pp. 147‑154
pdf bib Split-and-Rephrase in a Cross-Lingual Manner: A Complete Pipeline
Paulo Berlanga Neto and Evandro Eduardo Seron Ruiz
pp. 155‑164
pdf bib On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than English
Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis and Arantza Casillas
pp. 165‑172
pdf bib Can the Transformer Be Used as a Drop-in Replacement for RNNs in Text-Generating GANs?
Kevin Blin and Andrei Kucharavy
pp. 173‑181
pdf bib Predicting the Factuality of Reporting of News Media Using Observations about User Attention in Their YouTube Channels
Krasimira Bozhanova, Yoan Dinkov, Ivan Koychev, Maria Castaldo, Tommaso Venturini and Preslav Nakov
pp. 182‑189
pdf bib OCR Processing of Swedish Historical Newspapers Using Deep Hybrid CNN–LSTM Networks
Molly Brandt Skelbye and Dana Dannélls
pp. 190‑198
pdf bib A Psychologically Informed Part-of-Speech Analysis of Depression in Social Media
Ana-Maria Bucur, Ioana R. Podina and Liviu P. Dinu
pp. 199‑207
pdf bib InFoBERT: Zero-Shot Approach to Natural Language Understanding Using Contextualized Word Embedding
Pavel Burnyshev, Andrey Bout, Valentin Malykh and Irina Piontkovskaya
pp. 208‑215
pdf bib Active Learning for Assisted Corpus Construction: A Case Study in Knowledge Discovery from Biomedical Text
Hian Cañizares-Díaz, Alejandro Piad-Morffis, Suilan Estevez-Velarde, Yoan Gutiérrez, Yudivián Almeida Cruz, Andres Montoyo and Rafael Muñoz-Guillena
pp. 216‑225
pdf bib Unsupervised Text Style Transfer with Content Embeddings
Keith Carlson, Allen Riddell and Daniel Rockmore
pp. 226‑233
pdf bib Evaluating Recognizing Question Entailment Methods for a Portuguese Community Question-Answering System about Diabetes Mellitus
Thiago Castro Ferreira, João Victor de Pinho Costa, Isabela Rigotto, Vitoria Portella, Gabriel Frota, Ana Luisa A. R. Guimarães, Adalberto Penna, Isabela Lee, Tayane A. Soares, Sophia Rolim, Rossana Cunha, Celso França, Ariel Santos, Rivaney F. Oliveira, Abisague Langbehn, Daniel Hasan Dalip, Marcos André Gonçalves, Rodrigo Bastos Fóscolo and Adriana Pagano
pp. 234‑243
pdf bib On the Usability of Transformers-based Models for a French Question-Answering Task
Oralie Cattan, Christophe Servan and Sophie Rosset
pp. 244‑255
pdf bib Classification of Code-Mixed Text Using Capsule Networks
Shanaka Chathuranga and Surangika Ranathunga
pp. 256‑263
pdf bib Character-based Thai Word Segmentation with Multiple Attentions
Thodsaporn Chay-intr, Hidetaka Kamigaito and Manabu Okumura
pp. 264‑273
pdf bib Are Language-Agnostic Sentence Representations Actually Language-Agnostic?
Yu Chen and Tania Avgustinova
pp. 274‑280
pdf bib Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting
Hee-Soo Choi, Bruno Guillaume, Karën Fort and Guy Perrier
pp. 281‑290
pdf bib RED: A Novel Dataset for Romanian Emotion Detection from Tweets
Alexandra Ciobotaru and Liviu P. Dinu
pp. 291‑300
pdf bib Assessing the Eligibility of Backtranslated Samples Based on Semantic Similarity for the Paraphrase Identification Task
Jean-Philippe Corbeil and Hadi Abdi Ghavidel
pp. 301‑308
pdf bib Fine-tuning Neural Language Models for Multidimensional Opinion Mining of English-Maltese Social Data
Keith Cortis, Kanishk Verma and Brian Davis
pp. 309‑314
pdf bib Towards an Etymological Map of Romanian
Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Ana Sabina Uban and Laurentiu Zoicas
pp. 315‑323
pdf bib A Syntax-Aware Edit-based System for Text Simplification
Oscar M. Cumbicus-Pineda, Itziar Gonzalez-Dios and Aitor Soroa
pp. 324‑334
pdf bib On Generating Fact-Infused Question Variations
Arthur Deschamps, Sujatha Das Gollapalli and See-Kiong Ng
pp. 335‑345
pdf bib Event Prominence Extraction Combining a Knowledge-Based Syntactic Parser and a BERT Classifier for Dutch
Thierry Desot, Orphee De Clercq and Veronique Hoste
pp. 346‑357
pdf bib Automatic Detection and Classification of Mental Illnesses from General Social Media Texts
Anca Dinu and Andreea-Codrina Moldovan
pp. 358‑366
pdf bib A Pre-trained Transformer and CNN Model with Joint Language ID and Part-of-Speech Tagging for Code-Mixed Social-Media Text
Suman Dowlagar and Radhika Mamidi
pp. 367‑374
pdf bib Tracing Source Language Interference in Translation with Graph-Isomorphism Measures
Koel Dutta Chowdhury, Cristina España-Bonet and Josef van Genabith
pp. 375‑385
pdf bib Decoupled Transformer for Scalable Inference in Open-domain Question Answering
Haytham Elfdaeel and Stanislav Peshterliev
pp. 386‑393
pdf bib Towards Task-Agnostic Privacy- and Utility-Preserving Models
Yaroslav Emelyanov
pp. 394‑401
pdf bib Knowledge Discovery in COVID-19 Research Literature
Ernesto L. Estevanell-Valladares, Suilan Estevez-Velarde, Alejandro Piad-Morffis, Yoan Gutierrez, Andres Montoyo, Rafael Muñoz and Yudivián Almeida Cruz
pp. 402‑410
pdf bib Online Learning over Time in Adaptive Neural Machine Translation
Thierry Etchegoyhen, David Ponce, Harritxu Gete and Victor Ruiz
pp. 411‑420
pdf bib Improving Character-Aware Neural Language Model by Warming up Character Encoder under Skip-gram Architecture
Yukun Feng, Chenlong Hu, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura
pp. 421‑427
pdf bib Interpretable Identification of Cybersecurity Vulnerabilities from News Articles
Pierre Frode de la Foret, Stefan Ruseti, Cristian Sandescu, Mihai Dascalu and Sebastien Travadel
pp. 428‑436
pdf bib Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi
Saurabh Sampatrao Gaikwad, Tharindu Ranasinghe, Marcos Zampieri and Christopher Homan
pp. 437‑443
pdf bib Relying on Discourse Analysis to Answer Complex Questions by Neural Machine Reading Comprehension
Boris Galitsky, Dmitry Ilvovsky and Elizaveta Goncharova
pp. 444‑453
pdf bib A Dynamic Head Importance Computation Mechanism for Neural Machine Translation
Akshay Goindani and Manish Shrivastava
pp. 454‑462
pdf bib Syntax and Themes: How Context Free Grammar Rules and Semantic Word Association Influence Book Success
Henry Gorelick, Biddut Sarker Bijoy, Syeda Jannatus Saba, Sudipta Kar, Md Saiful Islam and Mohammad Ruhul Amin
pp. 463‑474
pdf bib SocialVisTUM: An Interactive Visualization Toolkit for Correlated Neural Topic Models on Social Media Opinion Mining
Gerhard Hagerer, Martin Kirchhoff, Hannah Danner, Robert Pesch, Mainak Ghosh, Archishman Roy, Jiaxi Zhao and Georg Groh
pp. 475‑482
pdf bib Apples to Apples: A Systematic Evaluation of Topic Models
Ismail Harrando, Pasquale Lisena and Raphael Troncy
pp. 483‑493
pdf bib Claim Verification Using a Multi-GAN Based Model
Amartya Hatua, Arjun Mukherjee and Rakesh Verma
pp. 494‑503
pdf bib Semi-Supervised and Unsupervised Sense Annotation via Translations
Bradley Hauer, Grzegorz Kondrak, Yixing Luan, Arnob Mallik and Lili Mou
pp. 504‑513
pdf bib Personality Predictive Lexical Cues and Their Correlations
Xiaoli He and Gerard de Melo
pp. 514‑523
pdf bib Evaluation Datasets for Cross-lingual Semantic Textual Similarity
Tomáš Hercig and Pavel Kral
pp. 524‑529
pdf bib Relation Extraction Using Multiple Pre-Training Models in Biomedical Domain
Satoshi Hiai, Kazutaka Shimada, Taiki Watanabe, Akiva Miura and Tomoya Iwakura
pp. 530‑537
pdf bib Discussion Structure Prediction Based on a Two-step Method
Takumi Himeno and Kazutaka Shimada
pp. 538‑546
pdf bib On the Usefulness of Personality Traits in Opinion-oriented Tasks
Marjan Hosseinia, Eduard Dragut, Dainis Boumber and Arjun Mukherjee
pp. 547‑556
pdf bib Application of Deep Learning Methods to SNOMED CT Encoding of Clinical Texts: From Data Collection to Extreme Multi-Label Text-Based Classification
Anton Hristov, Aleksandar Tahchiev, Hristo Papazov, Nikola Tulechki, Todor Primov and Svetla Boytcheva
pp. 557‑565
pdf bib Syntax Matters! Syntax-Controlled in Text Style Transfer
Zhiqiang Hu, Roy Ka-Wei Lee and Charu C. Aggarwal
pp. 566‑575
pdf bib Transfer Learning for Czech Historical Named Entity Recognition
Helena Hubková and Pavel Kral
pp. 576‑582
pdf bib Personality Trait Identification Using the Russian Feature Extraction Toolkit
James R. Hull, Valerie Novak, C. Anton Rytting, Paul Rodrigues, Victor M. Frank and Matthew Swahn
pp. 583‑592
pdf bib Semi-Supervised Learning Based on Auto-generated Lexicon Using XAI in Sentiment Analysis
Hohyun Hwang and Younghoon Lee
pp. 593‑600
pdf bib Multiple Teacher Distillation for Robust and Greener Models
Artur Ilichev, Nikita Sorokin, Irina Piontkovskaya and Valentin Malykh
pp. 601‑610
pdf bib BERT Embeddings for Automatic Readability Assessment
Joseph Marvin Imperial
pp. 611‑618
pdf bib Semantic-Based Opinion Summarization
Marcio Inácio and Thiago Pardo
pp. 619‑628
pdf bib Using Collaborative Filtering to Model Argument Selection
Sagar Indurkhya
pp. 629‑639
pdf bib Domain-Specific Japanese ELECTRA Model Using a Small Corpus
Youki Itoh and Hiroyuki Shinnou
pp. 640‑646
pdf bib BERT-PersNER: A New Model for Persian Named Entity Recognition
Farane Jalali Farahani and Gholamreza Ghassem-Sani
pp. 647‑654
pdf bib Cross-lingual Fine-tuning for Abstractive Arabic Text Summarization
Mram Kahla, Zijian Győző Yang and Attila Novák
pp. 655‑663
pdf bib Behavior of Modern Pre-trained Language Models Using the Example of Probing Tasks
Ekaterina Kalyaeva, Oleg Durandin and Alexey Malafeev
pp. 664‑670
pdf bib Towards Quantifying Magnitude of Political Bias in News Articles Using a Novel Annotation Schema
Lalitha Kameswari and Radhika Mamidi
pp. 671‑678
pdf bib Application of Mix-Up Method in Document Classification Task Using BERT
Naoki Kikuta and Hiroyuki Shinnou
pp. 679‑683
pdf bib Translation Memory Retrieval Using Lucene
Kwang-hyok Kim, Myong-ho Cho, Chol-ho Ryang, Ju-song Im, Song-yong Cho and Yong-jun Han
pp. 684‑691
pdf bib Now, It’s Personal : The Need for Personalized Word Sense Disambiguation
Milton King and Paul Cook
pp. 692‑700
pdf bib Multilingual Image Corpus: Annotation Protocol
Svetla Koeva
pp. 701‑707
pdf bib ELERRANT: Automatic Grammatical Error Type Classification for Greek
Katerina Korre, Marita Chatzipanagiotou and John Pavlopoulos
pp. 708‑717
pdf bib Neural Machine Translation for Sinhala-English Code-Mixed Text
Archchana Kugathasan and Sagara Sumathipala
pp. 718‑726
pdf bib Multilingual Multi-Domain NMT for Indian Languages
Sourav Kumar, Salil Aggarwal and Dipti Sharma
pp. 727‑733
pdf bib Fiction in Russian Translation: A Translationese Study
Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski and Ruslan Mitkov
pp. 734‑743
pdf bib Corpus Creation and Language Identification in Low-Resource Code-Mixed Telugu-English Text
Siva Subrahamanyam Varma Kusampudi, Anudeep Chaluvadi and Radhika Mamidi
pp. 744‑752
pdf bib Sentiment Analysis in Code-Mixed Telugu-English Text with Unsupervised Data Normalization
Siva Subrahamanyam Varma Kusampudi, Preetham Sathineni and Radhika Mamidi
pp. 753‑760
pdf bib From Constituency to UD-Style Dependency: Building the First Conversion Tool of Turkish
Aslı Kuzgun, Oğuz Kerem Yıldız, Neslihan Cesur, Büşra Marşan, Arife Betül Yenice, Ezgi Sanıyar, Oguzhan Kuyrukçu, Bilge Nas Arıcan and Olcay Taner Yıldız
pp. 761‑769
pdf bib Making Your Tweets More Fancy: Emoji Insertion to Texts
Jingun Kwon, Naoki Kobayashi, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura
pp. 770‑779
pdf bib Addressing Slot-Value Changes in Task-oriented Dialogue Systems through Dialogue Domain Adaptation
Tiziano Labruna and Bernardo Magnini
pp. 780‑789
pdf bib Developing a Clinical Language Model for Swedish: Continued Pretraining of Generic BERT with In-Domain Data
Anastasios Lamproudis, Aron Henriksson and Hercules Dalianis
pp. 790‑797
pdf bib Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model
John Lee and Chak Yan Yeung
pp. 798‑804
pdf bib Transforming Multi-Conditioned Generation from Meaning Representation
Joosung Lee
pp. 805‑813
pdf bib Frustration Level Annotation in Latvian Tweets with Non-Lexical Means of Expression
Viktorija Leonova and Janis Zuters
pp. 814‑823
pdf bib System Combination for Grammatical Error Correction Based on Integer Programming
Ruixi Lin and Hwee Tou Ng
pp. 824‑829
pdf bib Multilingual Learning for Mild Cognitive Impairment Screening from a Clinical Speech Task
Hali Lindsay, Philipp Müller, Insa Kröger, Johannes Tröger, Nicklas Linz, Alexandra Konig, Radia Zeghari, Frans RJ Verhey and Inez HGB Ramakers
pp. 830‑838
pdf bib Naturalness Evaluation of Natural Language Generation in Task-oriented Dialogues Using BERT
Ye Liu, Wolfgang Maier, Wolfgang Minker and Stefan Ultes
pp. 839‑845
pdf bib Towards the Application of Calibrated Transformers to the Unsupervised Estimation of Question Difficulty from Text
Ekaterina Loginova, Luca Benedetto, Dries Benoit and Paolo Cremonesi
pp. 846‑855
pdf bib GeSERA: General-domain Summary Evaluation by Relevance Analysis
Jessica López Espejel, Gaël de Chalendar, Jorge Garcia Flores, Thierry Charnois and Ivan Vladimir Meza Ruiz
pp. 856‑867
pdf bib On the Interaction between Annotation Quality and Classifier Performance in Abusive Language Detection
Holly Lopez Long, Alexandra O’Neil and Sandra Kübler
pp. 868‑875
pdf bib NEREL: A Russian Dataset with Nested Named Entities, Relations and Events
Natalia Loukachevitch, Ekaterina Artemova, Tatiana Batura, Pavel Braslavski, Ilia Denisov, Vladimir Ivanov, Suresh Manandhar, Alexander Pugachev and Elena Tutubalina
pp. 876‑885
pdf bib Active Learning for Interactive Relation Extraction in a French Newspaper’s Articles
Cyrielle Mallart, Michel Le Nouy, Guillaume Gravier and Pascale Sébillot
pp. 886‑894
pdf bib ROFF - A Romanian Twitter Dataset for Offensive Language
Mihai Manolescu and Çağrı Çöltekin
pp. 895‑900
pdf bib Monitoring Fact Preservation, Grammatical Consistency and Ethical Behavior of Abstractive Summarization Neural Models
Iva Marinova, Yolina Petrova, Milena Slavcheva, Petya Osenova, Ivaylo Radev and Kiril Simov
pp. 901‑909
pdf bib Cultural Topic Modelling over Novel Wikipedia Corpora for South-Slavic Languages
Filip Markoski, Elena Markoska, Nikola Ljubešić, Eftim Zdravevski and Ljupco Kocarev
pp. 910‑917
pdf bib Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language
Katarzyna Marszałek-Kowalewska
pp. 918‑928
pdf bib The Impact of Text Normalization on Multiword Expressions Discovery in Persian
Katarzyna Marszałek-Kowalewska
pp. 929‑939
pdf bib Improving Neural Language Processing with Named Entities
Kyoumoto Matsushita, Takuya Makino and Tomoya Iwakura
pp. 940‑949
pdf bib TREMoLo-Tweets: A Multi-Label Corpus of French Tweets for Language Register Characterization
Jade Mekki, Gwénolé Lecorvé, Delphine Battistelli and Nicolas Béchet
pp. 950‑958
pdf bib Ranking Online Reviews Based on Their Helpfulness: An Unsupervised Approach
Alimuddin Melleng, Anna Jurek-Loughrey and Deepak P
pp. 959‑967
pdf bib incom.py 2.0 - Calculating Linguistic Distances and Asymmetries in Auditory Perception of Closely Related Languages
Marius Mosbach, Irina Stenger, Tania Avgustinova, Bernd Möbius and Dietrich Klakow
pp. 968‑977
pdf bib Not All Linearizations Are Equally Data-Hungry in Sequence Labeling Parsing
Alberto Muñoz-Ortiz, Michalina Strzyz and David Vilares
pp. 978‑988
pdf bib Pre-training a BERT with Curriculum Learning by Increasing Block-Size of Input Text
Koichi Nagatsuka, Clifford Broni-Bediako and Masayasu Atsumi
pp. 989‑996
pdf bib COVID-19 in Bulgarian Social Media: Factuality, Harmfulness, Propaganda, and Framing
Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino and Yifan Zhang
pp. 997‑1009
pdf bib A Second Pandemic? Analysis of Fake News about COVID-19 Vaccines in Qatar
Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino and Yifan Zhang
pp. 1010‑1021
pdf bib A Hierarchical Entity Graph Convolutional Network for Relation Extraction across Documents
Tapas Nayak and Hwee Tou Ng
pp. 1022‑1030
pdf bib Improving Distantly Supervised Relation Extraction with Self-Ensemble Noise Filtering
Tapas Nayak, Navonil Majumder and Soujanya Poria
pp. 1031‑1039
pdf bib Learning Entity-Likeness with Multiple Approximate Matches for Biomedical NER
An Nguyen Le, Hajime Morita and Tomoya Iwakura
pp. 1040‑1049
pdf bib Extending a Text-to-Pictograph System to French and to Arasaac
Magali Norré, Vincent Vandeghinste, Pierrette Bouillon and Thomas François
pp. 1050‑1059
pdf bib Transfer-based Enrichment of a Hungarian Named Entity Dataset
Attila Novák and Borbála Novák
pp. 1060‑1067
pdf bib One Size Does Not Fit All: Finding the Optimal Subword Sizes for FastText Models across Languages
Vít Novotný, Eniafe Festus Ayetiran, Dalibor Bačovský, Dávid Lupták, Michal Štefánik and Petr Sojka
pp. 1068‑1074
pdf bib CLexIS2: A New Corpus for Complex Word Identification Research in Computing Studies
Jenny A. Ortiz Zambrano and Arturo Montejo-Ráez
pp. 1075‑1083
pdf bib Towards Precise Lexicon Integration in Neural Machine Translation
Ogün Öz and Maria Sukhareva
pp. 1084‑1095
pdf bib OffendES: A New Corpus in Spanish for Offensive Language Research
Flor Miriam Plaza-del-Arco, Arturo Montejo-Ráez, L. Alfonso Ureña-López and María-Teresa Martín-Valdivia
pp. 1096‑1108
pdf bib On Machine Translation of User Reviews
Maja Popović, Alberto Poncelas, Marija Brkic and Andy Way
pp. 1109‑1118
pdf bib Multilingual Coreference Resolution with Harmonized Annotations
Ondřej Pražák, Miloslav Konopík and Jakub Sido
pp. 1119‑1123
pdf bib Predicting Informativeness of Semantic Triples
Judita Preiss
pp. 1124‑1129
pdf bib Unknown Intent Detection Using Multi-Objective Optimization on Deep Learning Classifiers
Prerna Prem, Zishan Ahmad, Asif Ekbal, Shubhashis Sengupta, Sakshi C. Jain and Roshni Ramnani
pp. 1130‑1137
pdf bib Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň and Josef Steinberger
pp. 1138‑1149
pdf bib Metric Learning in Multilingual Sentence Similarity Measurement for Document Alignment
Charith Rajitha, Lakmali Piyarathna, Dilan Sachintha and Surangika Ranathunga
pp. 1150‑1157
pdf bib Multi-label Diagnosis Classification of Swedish Discharge Summaries – ICD-10 Code Assignment Using KB-BERT
Sonja Remmer, Anastasios Lamproudis and Hercules Dalianis
pp. 1158‑1166
pdf bib Siamese Networks for Inference in Malayalam Language Texts
Sara Renjit and Sumam Mary Idicula
pp. 1167‑1173
pdf bib A Call for Clarity in Contemporary Authorship Attribution Evaluation
Allen Riddell, Haining Wang and Patrick Juola
pp. 1174‑1179
pdf bib Varieties of Plain Language
Allen Riddell and Yohei Igarashi
pp. 1180‑1187
pdf bib Word Discriminations for Vocabulary Inventory Prediction
Frankie Robertson
pp. 1188‑1195
pdf bib FrenLyS: A Tool for the Automatic Simplification of French General Language Texts
Eva Rolin, Quentin Langlois, Patrick Watrin and Thomas François
pp. 1196‑1205
pdf bib Spelling Correction for Russian: A Comparative Study of Datasets and Methods
Alla Rozovskaya
pp. 1206‑1216
pdf bib Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer by Machine Translation Systems
Hadeel Saadany, Constantin Orăsan, Emad Mohamed and Ashraf Tantavy
pp. 1217‑1226
pdf bib Multilingual Epidemic Event Extraction : From Simple Classification Methods to Open Information Extraction (OIE) and Ontology
Sihem Sahnoun and Gaël Lejeune
pp. 1227‑1233
pdf bib Exploiting Domain-Specific Knowledge for Judgment Prediction Is No Panacea
Olivier Salaün, Philippe Langlais and Karim Benyekhlef
pp. 1234‑1243
pdf bib Masking and Transformer-based Models for Hyperpartisanship Detection in News
Javier Sánchez-Junquera, Paolo Rosso, Manuel Montes-y-Gómez and Simone Paolo Ponzetto
pp. 1244‑1251
pdf bib Serbian NER&Beyond: The Archaic and the Modern Intertwinned
Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković and Milica Ikonić Nešić
pp. 1252‑1260
pdf bib A Semi-Supervised Approach to Detect Toxic Comments
Ghivvago Damas Saraiva, Rafael Anchiêta, Francisco Assis Ricarte Neto and Raimundo Moura
pp. 1261‑1267
pdf bib Graph-based Argument Quality Assessment
Ekaterina Saveleva, Volha Petukhova, Marius Mosbach and Dietrich Klakow
pp. 1268‑1280
pdf bib A Hybrid Approach of Opinion Mining and Comparative Linguistic Analysis of Restaurant Reviews
Salim Sazzed
pp. 1281‑1288
pdf bib A Lexicon for Profane and Obscene Text Identification in Bengali
Salim Sazzed
pp. 1289‑1296
pdf bib A Case Study of Deep Learning-Based Multi-Modal Methods for Labeling the Presence of Questionable Content in Movie Trailers
Mahsa Shafaei, Christos Smailis, Ioannis Kakadiaris and Thamar Solorio
pp. 1297‑1307
pdf bib A Domain-Independent Holistic Approach to Deception Detection
Sadat Shahriar, Arjun Mukherjee and Omprakash Gnawali
pp. 1308‑1317
pdf bib Towards Domain-Generalizable Paraphrase Identification by Avoiding the Shortcut Learning
Xin Shen and Wai Lam
pp. 1318‑1325
pdf bib Czert – Czech BERT-like Model for Language Representation
Jakub Sido, Ondřej Pražák, Pavel Přibáň, Jan Pašek, Michal Seják and Miloslav Konopík
pp. 1326‑1338
pdf bib Exploring German Multi-Level Text Simplification
Nicolas Spring, Annette Rios and Sarah Ebling
pp. 1339‑1349
pdf bib Exploring Reliability of Gold Labels for Emotion Detection in Twitter
Sanja Stajner
pp. 1350‑1359
pdf bib How to Obtain Reliable Labels for MBTI Classification from Texts?
Sanja Stajner and Seren Yenikent
pp. 1360‑1368
pdf bib Watching a Language Model Learning Chess
Andreas Stöckl
pp. 1369‑1379
pdf bib Tackling Multilinguality and Internationality in Fake News
Andrey Tagarev, Krasimira Bozhanova, Ivelina Nikolova-Koleva and Ivan Ivanov
pp. 1380‑1386
pdf bib Learning and Evaluating Chinese Idiom Embeddings
Minghuan Tan and Jing Jiang
pp. 1387‑1396
pdf bib Does BERT Understand Idioms? A Probing-Based Empirical Study of BERT Encodings of Idioms
Minghuan Tan and Jing Jiang
pp. 1397‑1407
pdf bib An Empirical Analysis of Topic Models: Uncovering the Relationships between Hyperparameters, Document Length and Performance Measures
Silvia Terragni and Elisabetta Fersini
pp. 1408‑1416
pdf bib TR-SEQ: Named Entity Recognition Dataset for Turkish Search Engine Queries
Berkay Topçu and İlknur Durgar El-Kahlout
pp. 1417‑1422
pdf bib Opinion Prediction with User Fingerprinting
Kishore Tumarada, Yifan Zhang, Fan Yang, Eduard Dragut, Omprakash Gnawali and Arjun Mukherjee
pp. 1423‑1431
pdf bib Can Multilingual Transformers Fight the COVID-19 Infodemic?
Lasitha Uyangodage, Tharindu Ranasinghe and Hansi Hettiarachchi
pp. 1432‑1437
pdf bib Contextual-Lexicon Approach for Abusive Language Detection
Francielle Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto and Thiago Pardo
pp. 1438‑1447
pdf bib Comparative Analysis of Fine-tuned Deep Learning Language Models for ICD-10 Classification Task for Bulgarian Language
Boris Velichkov, Sylvia Vassileva, Simeon Gerginov, Boris Kraychev, Ivaylo Ivanov, Philip Ivanov, Ivan Koychev and Svetla Boytcheva
pp. 1448‑1454
pdf bib Mistake Captioning: A Machine Learning Approach for Detecting Mistakes and Generating Instructive Feedback
Anton Vinogradov, Andrew Miles Byrd and Brent Harrison
pp. 1455‑1462
pdf bib A Novel Machine Learning Based Approach for Post-OCR Error Detection
Shafqat Mumtaz Virk, Dana Dannélls and Azam Sheikh Muhammad
pp. 1463‑1470
pdf bib A Data-Driven Semi-Automatic Framenet Development Methodology
Shafqat Mumtaz Virk, Dana Dannélls, Lars Borin and Markus Forsberg
pp. 1471‑1479
pdf bib A Deep Learning System for Automatic Extraction of Typological Linguistic Information from Descriptive Grammars
Shafqat Mumtaz Virk, Daniel Foster, Azam Sheikh Muhammad and Raheela Saleem
pp. 1480‑1489
pdf bib Recognizing and Splitting Conditional Sentences for Automation of Business Processes Management
Ngoc Phuoc An Vo, Irene Manotas, Octavian Popescu, Algimantas Černiauskas and Vadim Sheinin
pp. 1490‑1497
pdf bib “Don’t discuss”: Investigating Semantic and Argumentative Features for Supervised Propagandist Message Detection and Classification
Vorakit Vorakitphan, Elena Cabrio and Serena Villata
pp. 1498‑1507
pdf bib ComboNER: A Lightweight All-In-One POS Tagger, Dependency Parser and NER
Aleksander Wawer
pp. 1508‑1514
pdf bib Investigating Annotator Bias in Abusive Language Datasets
Maximilian Wich, Christian Widmer, Gerhard Hagerer and Georg Groh
pp. 1515‑1525
pdf bib Rules Ruling Neural Networks - Neural vs. Rule-Based Grammar Checking for a Low Resource Language
Linda Wiechetek, Flammie Pirinen, Mika Hämäläinen and Chiara Argese
pp. 1526‑1535
pdf bib Transformer with Syntactic Position Encoding for Machine Translation
Yikuan Xie, Wenyong Wang, Mingqian Du and Qing He
pp. 1536‑1544
pdf bib Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media
Venkata Himakar Yanamandra, Kartikey Pant and Radhika Mamidi
pp. 1545‑1552
pdf bib Improving Evidence Retrieval with Claim-Evidence Entailment
Fan Yang, Eduard Dragut and Arjun Mukherjee
pp. 1553‑1558
pdf bib Sentence Structure and Word Relationship Modeling for Emphasis Selection
Haoran Yang and Wai Lam
pp. 1559‑1566
pdf bib Utterance Position-Aware Dialogue Act Recognition
Yuki Yano, Akihiro Tamura, Takashi Ninomiya and Hiroaki Obayashi
pp. 1567‑1574
pdf bib Tell Me What You Read: Automatic Expertise-Based Annotator Assignment for Text Annotation in Expert Domains
Hiyori Yoshikawa, Tomoya Iwakura, Kimi Kaneko, Hiroaki Yoshida, Yasutaka Kumano, Kazutaka Shimada, Rafal Rzepka and Patrycja Swieczkowska
pp. 1575‑1585
pdf bib Abstractive Document Summarization with Word Embedding Reconstruction
Jingyi You, Chenlong Hu, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura
pp. 1586‑1596
pdf bib Interpretable Propaganda Detection in News Articles
Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James Glass and Preslav Nakov
pp. 1597‑1605
pdf bib Generic Mechanism for Reducing Repetitions in Encoder-Decoder Models
Ying Zhang, Hidetaka Kamigaito, Tatsuya Aoki, Hiroya Takamura and Manabu Okumura
pp. 1606‑1615
pdf bib Knowledge Distillation with BERT for Image Tag-Based Privacy Prediction
Chenye Zhao and Cornelia Caragea
pp. 1616‑1625
pdf bib Delexicalized Cross-lingual Dependency Parsing for Xibe
He Zhou and Sandra Kübler
pp. 1626‑1635
pdf bib AutoChart: A Dataset for Chart-to-Text Generation Task
Jiawen Zhu, Jinye Ran, Roy Ka-Wei Lee, Zhi Li and Kenny Choo
pp. 1636‑1644
pdf bib A Comparative Study on Abstractive and Extractive Approaches in Summarization of European Legislation Documents
Valentin Zmiycharov, Milen Chechev, Gergana Lazarova, Todor Tsonkov and Ivan Koychev
pp. 1645‑1651
pdf bib Not All Comments Are Equal: Insights into Comment Moderation from a Topic-Aware Model
Elaine Zosa, Ravi Shekhar, Mladen Karan and Matthew Purver
pp. 1652‑1662

