pdf |
bib |
Front matter |
pages |
pdf |
bib |
BPoMP: The Benchmark of Poetic Minimal Pairs – Limericks, Rhyme, and Narrative Coherence Almas Abdibayev, Allen Riddell and Daniel Rockmore |
pp. 1‑9 |
pdf |
bib |
Ontology Population Reusing Resources for Dialogue Intent Detection: Generic and Multilingual Approach Cristina Aceta, Izaskun Fernández and Aitor Soroa |
pp. 10‑18 |
pdf |
bib |
Efficient Multilingual Text Classification for Indian Languages Salil Aggarwal, Sourav Kumar and Radhika Mamidi |
pp. 19‑25 |
pdf |
bib |
Domain Adaptation for Hindi-Telugu Machine Translation Using Domain Specific Back Translation Hema Ala, Vandan Mujadia and Dipti Sharma |
pp. 26‑34 |
pdf |
bib |
ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD Moustafa Al-Hajj and Mustafa Jarrar |
pp. 35‑43 |
pdf |
bib |
English-Arabic Cross-language Plagiarism Detection Naif Alotaibi and Mike Joy |
pp. 44‑52 |
pdf |
bib |
Towards a Better Understanding of Noise in Natural Language Processing Khetam Al Sharou, Zhenhao Li and Lucia Specia |
pp. 53‑62 |
pdf |
bib |
Comparing Supervised Machine Learning Techniques for Genre Analysis in Software Engineering Research Articles Felipe Araújo de Britto, Thiago Castro Ferreira, Leonardo Pereira Nunes and Fernando Silva Parreiras |
pp. 63‑72 |
pdf |
bib |
Enriching the Transformer with Linguistic Factors for Low-Resource Machine Translation Jordi Armengol-Estapé, Marta R. Costa-jussà and Carlos Escolano |
pp. 73‑78 |
pdf |
bib |
A Multi-Pass Sieve Coreference Resolution for Indonesian Valentina Kania Prameswara Artari, Rahmad Mahendra, Meganingrum Arista Jiwanggi, Adityo Anggraito and Indra Budi |
pp. 79‑85 |
pdf |
bib |
Solving SCAN Tasks with Data Augmentation and Input Embeddings Michal Auersperger and Pavel Pecina |
pp. 86‑91 |
pdf |
bib |
PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors Andrei-Marius Avram, Vasile Pais and Dan Ioan Tufis |
pp. 92‑101 |
pdf |
bib |
TEASER: Towards Efficient Aspect-based SEntiment Analysis and Recognition Vaibhav Bajaj, Kartikey Pant, Ishan Upadhyay, Srinath Nair and Radhika Mamidi |
pp. 102‑110 |
pdf |
bib |
Interactive Learning Approach for Arabic Target-Based Sentiment Analysis Husamelddin Balla, Marisa Llorens Salvador and Sarah Jane Delany |
pp. 111‑120 |
pdf |
bib |
Litescale: A Lightweight Tool for Best-worst Scaling Annotation Valerio Basile and Christian Cagnazzo |
pp. 121‑127 |
pdf |
bib |
Probabilistic Ensembles of Zero- and Few-Shot Learning Models for Emotion Classification Angelo Basile, Guillermo Pérez-Torró and Marc Franco-Salvador |
pp. 128‑137 |
pdf |
bib |
Cross-Lingual Wolastoqey-English Definition Modelling Diego Bear and Paul Cook |
pp. 138‑146 |
pdf |
bib |
Neural Network-Based Generation of Sport Summaries: A Preliminary Study David Stéphane Belemkoabga, Aurélien Bossard, Abdallah Essa, Christophe Rodrigues and Kévin Sylla |
pp. 147‑154 |
pdf |
bib |
Split-and-Rephrase in a Cross-Lingual Manner: A Complete Pipeline Paulo Berlanga Neto and Evandro Eduardo Seron Ruiz |
pp. 155‑164 |
pdf |
bib |
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than English Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis and Arantza Casillas |
pp. 165‑172 |
pdf |
bib |
Can the Transformer Be Used as a Drop-in Replacement for RNNs in Text-Generating GANs? Kevin Blin and Andrei Kucharavy |
pp. 173‑181 |
pdf |
bib |
Predicting the Factuality of Reporting of News Media Using Observations about User Attention in Their YouTube Channels Krasimira Bozhanova, Yoan Dinkov, Ivan Koychev, Maria Castaldo, Tommaso Venturini and Preslav Nakov |
pp. 182‑189 |
pdf |
bib |
OCR Processing of Swedish Historical Newspapers Using Deep Hybrid CNN–LSTM Networks Molly Brandt Skelbye and Dana Dannélls |
pp. 190‑198 |
pdf |
bib |
A Psychologically Informed Part-of-Speech Analysis of Depression in Social Media Ana-Maria Bucur, Ioana R. Podina and Liviu P. Dinu |
pp. 199‑207 |
pdf |
bib |
InFoBERT: Zero-Shot Approach to Natural Language Understanding Using Contextualized Word Embedding Pavel Burnyshev, Andrey Bout, Valentin Malykh and Irina Piontkovskaya |
pp. 208‑215 |
pdf |
bib |
Active Learning for Assisted Corpus Construction: A Case Study in Knowledge Discovery from Biomedical Text Hian Cañizares-Díaz, Alejandro Piad-Morffis, Suilan Estevez-Velarde, Yoan Gutiérrez, Yudivián Almeida Cruz, Andres Montoyo and Rafael Muñoz-Guillena |
pp. 216‑225 |
pdf |
bib |
Unsupervised Text Style Transfer with Content Embeddings Keith Carlson, Allen Riddell and Daniel Rockmore |
pp. 226‑233 |
pdf |
bib |
Evaluating Recognizing Question Entailment Methods for a Portuguese Community Question-Answering System about Diabetes Mellitus Thiago Castro Ferreira, João Victor de Pinho Costa, Isabela Rigotto, Vitoria Portella, Gabriel Frota, Ana Luisa A. R. Guimarães, Adalberto Penna, Isabela Lee, Tayane A. Soares, Sophia Rolim, Rossana Cunha, Celso França, Ariel Santos, Rivaney F. Oliveira, Abisague Langbehn, Daniel Hasan Dalip, Marcos André Gonçalves, Rodrigo Bastos Fóscolo and Adriana Pagano |
pp. 234‑243 |
pdf |
bib |
On the Usability of Transformers-based Models for a French Question-Answering Task Oralie Cattan, Christophe Servan and Sophie Rosset |
pp. 244‑255 |
pdf |
bib |
Classification of Code-Mixed Text Using Capsule Networks Shanaka Chathuranga and Surangika Ranathunga |
pp. 256‑263 |
pdf |
bib |
Character-based Thai Word Segmentation with Multiple Attentions Thodsaporn Chay-intr, Hidetaka Kamigaito and Manabu Okumura |
pp. 264‑273 |
pdf |
bib |
Are Language-Agnostic Sentence Representations Actually Language-Agnostic? Yu Chen and Tania Avgustinova |
pp. 274‑280 |
pdf |
bib |
Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting Hee-Soo Choi, Bruno Guillaume, Karën Fort and Guy Perrier |
pp. 281‑290 |
pdf |
bib |
RED: A Novel Dataset for Romanian Emotion Detection from Tweets Alexandra Ciobotaru and Liviu P. Dinu |
pp. 291‑300 |
pdf |
bib |
Assessing the Eligibility of Backtranslated Samples Based on Semantic Similarity for the Paraphrase Identification Task Jean-Philippe Corbeil and Hadi Abdi Ghavidel |
pp. 301‑308 |
pdf |
bib |
Fine-tuning Neural Language Models for Multidimensional Opinion Mining of English-Maltese Social Data Keith Cortis, Kanishk Verma and Brian Davis |
pp. 309‑314 |
pdf |
bib |
Towards an Etymological Map of Romanian Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Ana Sabina Uban and Laurentiu Zoicas |
pp. 315‑323 |
pdf |
bib |
A Syntax-Aware Edit-based System for Text Simplification Oscar M. Cumbicus-Pineda, Itziar Gonzalez-Dios and Aitor Soroa |
pp. 324‑334 |
pdf |
bib |
On Generating Fact-Infused Question Variations Arthur Deschamps, Sujatha Das Gollapalli and See-Kiong Ng |
pp. 335‑345 |
pdf |
bib |
Event Prominence Extraction Combining a Knowledge-Based Syntactic Parser and a BERT Classifier for Dutch Thierry Desot, Orphee De Clercq and Veronique Hoste |
pp. 346‑357 |
pdf |
bib |
Automatic Detection and Classification of Mental Illnesses from General Social Media Texts Anca Dinu and Andreea-Codrina Moldovan |
pp. 358‑366 |
pdf |
bib |
A Pre-trained Transformer and CNN Model with Joint Language ID and Part-of-Speech Tagging for Code-Mixed Social-Media Text Suman Dowlagar and Radhika Mamidi |
pp. 367‑374 |
pdf |
bib |
Tracing Source Language Interference in Translation with Graph-Isomorphism Measures Koel Dutta Chowdhury, Cristina España-Bonet and Josef van Genabith |
pp. 375‑385 |
pdf |
bib |
Decoupled Transformer for Scalable Inference in Open-domain Question Answering Haytham Elfdaeel and Stanislav Peshterliev |
pp. 386‑393 |
pdf |
bib |
Towards Task-Agnostic Privacy- and Utility-Preserving Models Yaroslav Emelyanov |
pp. 394‑401 |
pdf |
bib |
Knowledge Discovery in COVID-19 Research Literature Ernesto L. Estevanell-Valladares, Suilan Estevez-Velarde, Alejandro Piad-Morffis, Yoan Gutierrez, Andres Montoyo, Rafael Muñoz and Yudivián Almeida Cruz |
pp. 402‑410 |
pdf |
bib |
Online Learning over Time in Adaptive Neural Machine Translation Thierry Etchegoyhen, David Ponce, Harritxu Gete and Victor Ruiz |
pp. 411‑420 |
pdf |
bib |
Improving Character-Aware Neural Language Model by Warming up Character Encoder under Skip-gram Architecture Yukun Feng, Chenlong Hu, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura |
pp. 421‑427 |
pdf |
bib |
Interpretable Identification of Cybersecurity Vulnerabilities from News Articles Pierre Frode de la Foret, Stefan Ruseti, Cristian Sandescu, Mihai Dascalu and Sebastien Travadel |
pp. 428‑436 |
pdf |
bib |
Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi Saurabh Sampatrao Gaikwad, Tharindu Ranasinghe, Marcos Zampieri and Christopher Homan |
pp. 437‑443 |
pdf |
bib |
Relying on Discourse Analysis to Answer Complex Questions by Neural Machine Reading Comprehension Boris Galitsky, Dmitry Ilvovsky and Elizaveta Goncharova |
pp. 444‑453 |
pdf |
bib |
A Dynamic Head Importance Computation Mechanism for Neural Machine Translation Akshay Goindani and Manish Shrivastava |
pp. 454‑462 |
pdf |
bib |
Syntax and Themes: How Context Free Grammar Rules and Semantic Word Association Influence Book Success Henry Gorelick, Biddut Sarker Bijoy, Syeda Jannatus Saba, Sudipta Kar, Md Saiful Islam and Mohammad Ruhul Amin |
pp. 463‑474 |
pdf |
bib |
SocialVisTUM: An Interactive Visualization Toolkit for Correlated Neural Topic Models on Social Media Opinion Mining Gerhard Hagerer, Martin Kirchhoff, Hannah Danner, Robert Pesch, Mainak Ghosh, Archishman Roy, Jiaxi Zhao and Georg Groh |
pp. 475‑482 |
pdf |
bib |
Apples to Apples: A Systematic Evaluation of Topic Models Ismail Harrando, Pasquale Lisena and Raphael Troncy |
pp. 483‑493 |
pdf |
bib |
Claim Verification Using a Multi-GAN Based Model Amartya Hatua, Arjun Mukherjee and Rakesh Verma |
pp. 494‑503 |
pdf |
bib |
Semi-Supervised and Unsupervised Sense Annotation via Translations Bradley Hauer, Grzegorz Kondrak, Yixing Luan, Arnob Mallik and Lili Mou |
pp. 504‑513 |
pdf |
bib |
Personality Predictive Lexical Cues and Their Correlations Xiaoli He and Gerard de Melo |
pp. 514‑523 |
pdf |
bib |
Evaluation Datasets for Cross-lingual Semantic Textual Similarity Tomáš Hercig and Pavel Kral |
pp. 524‑529 |
pdf |
bib |
Relation Extraction Using Multiple Pre-Training Models in Biomedical Domain Satoshi Hiai, Kazutaka Shimada, Taiki Watanabe, Akiva Miura and Tomoya Iwakura |
pp. 530‑537 |
pdf |
bib |
Discussion Structure Prediction Based on a Two-step Method Takumi Himeno and Kazutaka Shimada |
pp. 538‑546 |
pdf |
bib |
On the Usefulness of Personality Traits in Opinion-oriented Tasks Marjan Hosseinia, Eduard Dragut, Dainis Boumber and Arjun Mukherjee |
pp. 547‑556 |
pdf |
bib |
Application of Deep Learning Methods to SNOMED CT Encoding of Clinical Texts: From Data Collection to Extreme Multi-Label Text-Based Classification Anton Hristov, Aleksandar Tahchiev, Hristo Papazov, Nikola Tulechki, Todor Primov and Svetla Boytcheva |
pp. 557‑565 |
pdf |
bib |
Syntax Matters! Syntax-Controlled in Text Style Transfer Zhiqiang Hu, Roy Ka-Wei Lee and Charu C. Aggarwal |
pp. 566‑575 |
pdf |
bib |
Transfer Learning for Czech Historical Named Entity Recognition Helena Hubková and Pavel Kral |
pp. 576‑582 |
pdf |
bib |
Personality Trait Identification Using the Russian Feature Extraction Toolkit James R. Hull, Valerie Novak, C. Anton Rytting, Paul Rodrigues, Victor M. Frank and Matthew Swahn |
pp. 583‑592 |
pdf |
bib |
Semi-Supervised Learning Based on Auto-generated Lexicon Using XAI in Sentiment Analysis Hohyun Hwang and Younghoon Lee |
pp. 593‑600 |
pdf |
bib |
Multiple Teacher Distillation for Robust and Greener Models Artur Ilichev, Nikita Sorokin, Irina Piontkovskaya and Valentin Malykh |
pp. 601‑610 |
pdf |
bib |
BERT Embeddings for Automatic Readability Assessment Joseph Marvin Imperial |
pp. 611‑618 |
pdf |
bib |
Semantic-Based Opinion Summarization Marcio Inácio and Thiago Pardo |
pp. 619‑628 |
pdf |
bib |
Using Collaborative Filtering to Model Argument Selection Sagar Indurkhya |
pp. 629‑639 |
pdf |
bib |
Domain-Specific Japanese ELECTRA Model Using a Small Corpus Youki Itoh and Hiroyuki Shinnou |
pp. 640‑646 |
pdf |
bib |
BERT-PersNER: A New Model for Persian Named Entity Recognition Farane Jalali Farahani and Gholamreza Ghassem-Sani |
pp. 647‑654 |
pdf |
bib |
Cross-lingual Fine-tuning for Abstractive Arabic Text Summarization Mram Kahla, Zijian Győző Yang and Attila Novák |
pp. 655‑663 |
pdf |
bib |
Behavior of Modern Pre-trained Language Models Using the Example of Probing Tasks Ekaterina Kalyaeva, Oleg Durandin and Alexey Malafeev |
pp. 664‑670 |
pdf |
bib |
Towards Quantifying Magnitude of Political Bias in News Articles Using a Novel Annotation Schema Lalitha Kameswari and Radhika Mamidi |
pp. 671‑678 |
pdf |
bib |
Application of Mix-Up Method in Document Classification Task Using BERT Naoki Kikuta and Hiroyuki Shinnou |
pp. 679‑683 |
pdf |
bib |
Translation Memory Retrieval Using Lucene Kwang-hyok Kim, Myong-ho Cho, Chol-ho Ryang, Ju-song Im, Song-yong Cho and Yong-jun Han |
pp. 684‑691 |
pdf |
bib |
Now, It’s Personal : The Need for Personalized Word Sense Disambiguation Milton King and Paul Cook |
pp. 692‑700 |
pdf |
bib |
Multilingual Image Corpus: Annotation Protocol Svetla Koeva |
pp. 701‑707 |
pdf |
bib |
ELERRANT: Automatic Grammatical Error Type Classification for Greek Katerina Korre, Marita Chatzipanagiotou and John Pavlopoulos |
pp. 708‑717 |
pdf |
bib |
Neural Machine Translation for Sinhala-English Code-Mixed Text Archchana Kugathasan and Sagara Sumathipala |
pp. 718‑726 |
pdf |
bib |
Multilingual Multi-Domain NMT for Indian Languages Sourav Kumar, Salil Aggarwal and Dipti Sharma |
pp. 727‑733 |
pdf |
bib |
Fiction in Russian Translation: A Translationese Study Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski and Ruslan Mitkov |
pp. 734‑743 |
pdf |
bib |
Corpus Creation and Language Identification in Low-Resource Code-Mixed Telugu-English Text Siva Subrahamanyam Varma Kusampudi, Anudeep Chaluvadi and Radhika Mamidi |
pp. 744‑752 |
pdf |
bib |
Sentiment Analysis in Code-Mixed Telugu-English Text with Unsupervised Data Normalization Siva Subrahamanyam Varma Kusampudi, Preetham Sathineni and Radhika Mamidi |
pp. 753‑760 |
pdf |
bib |
From Constituency to UD-Style Dependency: Building the First Conversion Tool of Turkish Aslı Kuzgun, Oğuz Kerem Yıldız, Neslihan Cesur, Büşra Marşan, Arife Betül Yenice, Ezgi Sanıyar, Oguzhan Kuyrukçu, Bilge Nas Arıcan and Olcay Taner Yıldız |
pp. 761‑769 |
pdf |
bib |
Making Your Tweets More Fancy: Emoji Insertion to Texts Jingun Kwon, Naoki Kobayashi, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura |
pp. 770‑779 |
pdf |
bib |
Addressing Slot-Value Changes in Task-oriented Dialogue Systems through Dialogue Domain Adaptation Tiziano Labruna and Bernardo Magnini |
pp. 780‑789 |
pdf |
bib |
Developing a Clinical Language Model for Swedish: Continued Pretraining of Generic BERT with In-Domain Data Anastasios Lamproudis, Aron Henriksson and Hercules Dalianis |
pp. 790‑797 |
pdf |
bib |
Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model John Lee and Chak Yan Yeung |
pp. 798‑804 |
pdf |
bib |
Transforming Multi-Conditioned Generation from Meaning Representation Joosung Lee |
pp. 805‑813 |
pdf |
bib |
Frustration Level Annotation in Latvian Tweets with Non-Lexical Means of Expression Viktorija Leonova and Janis Zuters |
pp. 814‑823 |
pdf |
bib |
System Combination for Grammatical Error Correction Based on Integer Programming Ruixi Lin and Hwee Tou Ng |
pp. 824‑829 |
pdf |
bib |
Multilingual Learning for Mild Cognitive Impairment Screening from a Clinical Speech Task Hali Lindsay, Philipp Müller, Insa Kröger, Johannes Tröger, Nicklas Linz, Alexandra Konig, Radia Zeghari, Frans RJ Verhey and Inez HGB Ramakers |
pp. 830‑838 |
pdf |
bib |
Naturalness Evaluation of Natural Language Generation in Task-oriented Dialogues Using BERT Ye Liu, Wolfgang Maier, Wolfgang Minker and Stefan Ultes |
pp. 839‑845 |
pdf |
bib |
Towards the Application of Calibrated Transformers to the Unsupervised Estimation of Question Difficulty from Text Ekaterina Loginova, Luca Benedetto, Dries Benoit and Paolo Cremonesi |
pp. 846‑855 |
pdf |
bib |
GeSERA: General-domain Summary Evaluation by Relevance Analysis Jessica López Espejel, Gaël de Chalendar, Jorge Garcia Flores, Thierry Charnois and Ivan Vladimir Meza Ruiz |
pp. 856‑867 |
pdf |
bib |
On the Interaction between Annotation Quality and Classifier Performance in Abusive Language Detection Holly Lopez Long, Alexandra O’Neil and Sandra Kübler |
pp. 868‑875 |
pdf |
bib |
NEREL: A Russian Dataset with Nested Named Entities, Relations and Events Natalia Loukachevitch, Ekaterina Artemova, Tatiana Batura, Pavel Braslavski, Ilia Denisov, Vladimir Ivanov, Suresh Manandhar, Alexander Pugachev and Elena Tutubalina |
pp. 876‑885 |
pdf |
bib |
Active Learning for Interactive Relation Extraction in a French Newspaper’s Articles Cyrielle Mallart, Michel Le Nouy, Guillaume Gravier and Pascale Sébillot |
pp. 886‑894 |
pdf |
bib |
ROFF - A Romanian Twitter Dataset for Offensive Language Mihai Manolescu and Çağrı Çöltekin |
pp. 895‑900 |
pdf |
bib |
Monitoring Fact Preservation, Grammatical Consistency and Ethical Behavior of Abstractive Summarization Neural Models Iva Marinova, Yolina Petrova, Milena Slavcheva, Petya Osenova, Ivaylo Radev and Kiril Simov |
pp. 901‑909 |
pdf |
bib |
Cultural Topic Modelling over Novel Wikipedia Corpora for South-Slavic Languages Filip Markoski, Elena Markoska, Nikola Ljubešić, Eftim Zdravevski and Ljupco Kocarev |
pp. 910‑917 |
pdf |
bib |
Discovery of Multiword Expressions with Loanwords and Their Equivalents in the Persian Language Katarzyna Marszałek-Kowalewska |
pp. 918‑928 |
pdf |
bib |
The Impact of Text Normalization on Multiword Expressions Discovery in Persian Katarzyna Marszałek-Kowalewska |
pp. 929‑939 |
pdf |
bib |
Improving Neural Language Processing with Named Entities Kyoumoto Matsushita, Takuya Makino and Tomoya Iwakura |
pp. 940‑949 |
pdf |
bib |
TREMoLo-Tweets: A Multi-Label Corpus of French Tweets for Language Register Characterization Jade Mekki, Gwénolé Lecorvé, Delphine Battistelli and Nicolas Béchet |
pp. 950‑958 |
pdf |
bib |
Ranking Online Reviews Based on Their Helpfulness: An Unsupervised Approach Alimuddin Melleng, Anna Jurek-Loughrey and Deepak P |
pp. 959‑967 |
pdf |
bib |
incom.py 2.0 - Calculating Linguistic Distances and Asymmetries in Auditory Perception of Closely Related Languages Marius Mosbach, Irina Stenger, Tania Avgustinova, Bernd Möbius and Dietrich Klakow |
pp. 968‑977 |
pdf |
bib |
Not All Linearizations Are Equally Data-Hungry in Sequence Labeling Parsing Alberto Muñoz-Ortiz, Michalina Strzyz and David Vilares |
pp. 978‑988 |
pdf |
bib |
Pre-training a BERT with Curriculum Learning by Increasing Block-Size of Input Text Koichi Nagatsuka, Clifford Broni-Bediako and Masayasu Atsumi |
pp. 989‑996 |
pdf |
bib |
COVID-19 in Bulgarian Social Media: Factuality, Harmfulness, Propaganda, and Framing Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino and Yifan Zhang |
pp. 997‑1009 |
pdf |
bib |
A Second Pandemic? Analysis of Fake News about COVID-19 Vaccines in Qatar Preslav Nakov, Firoj Alam, Shaden Shaar, Giovanni Da San Martino and Yifan Zhang |
pp. 1010‑1021 |
pdf |
bib |
A Hierarchical Entity Graph Convolutional Network for Relation Extraction across Documents Tapas Nayak and Hwee Tou Ng |
pp. 1022‑1030 |
pdf |
bib |
Improving Distantly Supervised Relation Extraction with Self-Ensemble Noise Filtering Tapas Nayak, Navonil Majumder and Soujanya Poria |
pp. 1031‑1039 |
pdf |
bib |
Learning Entity-Likeness with Multiple Approximate Matches for Biomedical NER An Nguyen Le, Hajime Morita and Tomoya Iwakura |
pp. 1040‑1049 |
pdf |
bib |
Extending a Text-to-Pictograph System to French and to Arasaac Magali Norré, Vincent Vandeghinste, Pierrette Bouillon and Thomas François |
pp. 1050‑1059 |
pdf |
bib |
Transfer-based Enrichment of a Hungarian Named Entity Dataset Attila Novák and Borbála Novák |
pp. 1060‑1067 |
pdf |
bib |
One Size Does Not Fit All: Finding the Optimal Subword Sizes for FastText Models across Languages Vít Novotný, Eniafe Festus Ayetiran, Dalibor Bačovský, Dávid Lupták, Michal Štefánik and Petr Sojka |
pp. 1068‑1074 |
pdf |
bib |
CLexIS2: A New Corpus for Complex Word Identification Research in Computing Studies Jenny A. Ortiz Zambrano and Arturo Montejo-Ráez |
pp. 1075‑1083 |
pdf |
bib |
Towards Precise Lexicon Integration in Neural Machine Translation Ogün Öz and Maria Sukhareva |
pp. 1084‑1095 |
pdf |
bib |
OffendES: A New Corpus in Spanish for Offensive Language Research Flor Miriam Plaza-del-Arco, Arturo Montejo-Ráez, L. Alfonso Ureña-López and María-Teresa Martín-Valdivia |
pp. 1096‑1108 |
pdf |
bib |
On Machine Translation of User Reviews Maja Popović, Alberto Poncelas, Marija Brkic and Andy Way |
pp. 1109‑1118 |
pdf |
bib |
Multilingual Coreference Resolution with Harmonized Annotations Ondřej Pražák, Miloslav Konopík and Jakub Sido |
pp. 1119‑1123 |
pdf |
bib |
Predicting Informativeness of Semantic Triples Judita Preiss |
pp. 1124‑1129 |
pdf |
bib |
Unknown Intent Detection Using Multi-Objective Optimization on Deep Learning Classifiers Prerna Prem, Zishan Ahmad, Asif Ekbal, Shubhashis Sengupta, Sakshi C. Jain and Roshni Ramnani |
pp. 1130‑1137 |
pdf |
bib |
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers Pavel Přibáň and Josef Steinberger |
pp. 1138‑1149 |
pdf |
bib |
Metric Learning in Multilingual Sentence Similarity Measurement for Document Alignment Charith Rajitha, Lakmali Piyarathna, Dilan Sachintha and Surangika Ranathunga |
pp. 1150‑1157 |
pdf |
bib |
Multi-label Diagnosis Classification of Swedish Discharge Summaries – ICD-10 Code Assignment Using KB-BERT Sonja Remmer, Anastasios Lamproudis and Hercules Dalianis |
pp. 1158‑1166 |
pdf |
bib |
Siamese Networks for Inference in Malayalam Language Texts Sara Renjit and Sumam Mary Idicula |
pp. 1167‑1173 |
pdf |
bib |
A Call for Clarity in Contemporary Authorship Attribution Evaluation Allen Riddell, Haining Wang and Patrick Juola |
pp. 1174‑1179 |
pdf |
bib |
Varieties of Plain Language Allen Riddell and Yohei Igarashi |
pp. 1180‑1187 |
pdf |
bib |
Word Discriminations for Vocabulary Inventory Prediction Frankie Robertson |
pp. 1188‑1195 |
pdf |
bib |
FrenLyS: A Tool for the Automatic Simplification of French General Language Texts Eva Rolin, Quentin Langlois, Patrick Watrin and Thomas François |
pp. 1196‑1205 |
pdf |
bib |
Spelling Correction for Russian: A Comparative Study of Datasets and Methods Alla Rozovskaya |
pp. 1206‑1216 |
pdf |
bib |
Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer by Machine Translation Systems Hadeel Saadany, Constantin Orăsan, Emad Mohamed and Ashraf Tantavy |
pp. 1217‑1226 |
pdf |
bib |
Multilingual Epidemic Event Extraction : From Simple Classification Methods to Open Information Extraction (OIE) and Ontology Sihem Sahnoun and Gaël Lejeune |
pp. 1227‑1233 |
pdf |
bib |
Exploiting Domain-Specific Knowledge for Judgment Prediction Is No Panacea Olivier Salaün, Philippe Langlais and Karim Benyekhlef |
pp. 1234‑1243 |
pdf |
bib |
Masking and Transformer-based Models for Hyperpartisanship Detection in News Javier Sánchez-Junquera, Paolo Rosso, Manuel Montes-y-Gómez and Simone Paolo Ponzetto |
pp. 1244‑1251 |
pdf |
bib |
Serbian NER&Beyond: The Archaic and the Modern Intertwinned Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković and Milica Ikonić Nešić |
pp. 1252‑1260 |
pdf |
bib |
A Semi-Supervised Approach to Detect Toxic Comments Ghivvago Damas Saraiva, Rafael Anchiêta, Francisco Assis Ricarte Neto and Raimundo Moura |
pp. 1261‑1267 |
pdf |
bib |
Graph-based Argument Quality Assessment Ekaterina Saveleva, Volha Petukhova, Marius Mosbach and Dietrich Klakow |
pp. 1268‑1280 |
pdf |
bib |
A Hybrid Approach of Opinion Mining and Comparative Linguistic Analysis of Restaurant Reviews Salim Sazzed |
pp. 1281‑1288 |
pdf |
bib |
A Lexicon for Profane and Obscene Text Identification in Bengali Salim Sazzed |
pp. 1289‑1296 |
pdf |
bib |
A Case Study of Deep Learning-Based Multi-Modal Methods for Labeling the Presence of Questionable Content in Movie Trailers Mahsa Shafaei, Christos Smailis, Ioannis Kakadiaris and Thamar Solorio |
pp. 1297‑1307 |
pdf |
bib |
A Domain-Independent Holistic Approach to Deception Detection Sadat Shahriar, Arjun Mukherjee and Omprakash Gnawali |
pp. 1308‑1317 |
pdf |
bib |
Towards Domain-Generalizable Paraphrase Identification by Avoiding the Shortcut Learning Xin Shen and Wai Lam |
pp. 1318‑1325 |
pdf |
bib |
Czert – Czech BERT-like Model for Language Representation Jakub Sido, Ondřej Pražák, Pavel Přibáň, Jan Pašek, Michal Seják and Miloslav Konopík |
pp. 1326‑1338 |
pdf |
bib |
Exploring German Multi-Level Text Simplification Nicolas Spring, Annette Rios and Sarah Ebling |
pp. 1339‑1349 |
pdf |
bib |
Exploring Reliability of Gold Labels for Emotion Detection in Twitter Sanja Stajner |
pp. 1350‑1359 |
pdf |
bib |
How to Obtain Reliable Labels for MBTI Classification from Texts? Sanja Stajner and Seren Yenikent |
pp. 1360‑1368 |
pdf |
bib |
Watching a Language Model Learning Chess Andreas Stöckl |
pp. 1369‑1379 |
pdf |
bib |
Tackling Multilinguality and Internationality in Fake News Andrey Tagarev, Krasimira Bozhanova, Ivelina Nikolova-Koleva and Ivan Ivanov |
pp. 1380‑1386 |
pdf |
bib |
Learning and Evaluating Chinese Idiom Embeddings Minghuan Tan and Jing Jiang |
pp. 1387‑1396 |
pdf |
bib |
Does BERT Understand Idioms? A Probing-Based Empirical Study of BERT Encodings of Idioms Minghuan Tan and Jing Jiang |
pp. 1397‑1407 |
pdf |
bib |
An Empirical Analysis of Topic Models: Uncovering the Relationships between Hyperparameters, Document Length and Performance Measures Silvia Terragni and Elisabetta Fersini |
pp. 1408‑1416 |
pdf |
bib |
TR-SEQ: Named Entity Recognition Dataset for Turkish Search Engine Queries Berkay Topçu and İlknur Durgar El-Kahlout |
pp. 1417‑1422 |
pdf |
bib |
Opinion Prediction with User Fingerprinting Kishore Tumarada, Yifan Zhang, Fan Yang, Eduard Dragut, Omprakash Gnawali and Arjun Mukherjee |
pp. 1423‑1431 |
pdf |
bib |
Can Multilingual Transformers Fight the COVID-19 Infodemic? Lasitha Uyangodage, Tharindu Ranasinghe and Hansi Hettiarachchi |
pp. 1432‑1437 |
pdf |
bib |
Contextual-Lexicon Approach for Abusive Language Detection Francielle Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto and Thiago Pardo |
pp. 1438‑1447 |
pdf |
bib |
Comparative Analysis of Fine-tuned Deep Learning Language Models for ICD-10 Classification Task for Bulgarian Language Boris Velichkov, Sylvia Vassileva, Simeon Gerginov, Boris Kraychev, Ivaylo Ivanov, Philip Ivanov, Ivan Koychev and Svetla Boytcheva |
pp. 1448‑1454 |
pdf |
bib |
Mistake Captioning: A Machine Learning Approach for Detecting Mistakes and Generating Instructive Feedback Anton Vinogradov, Andrew Miles Byrd and Brent Harrison |
pp. 1455‑1462 |
pdf |
bib |
A Novel Machine Learning Based Approach for Post-OCR Error Detection Shafqat Mumtaz Virk, Dana Dannélls and Azam Sheikh Muhammad |
pp. 1463‑1470 |
pdf |
bib |
A Data-Driven Semi-Automatic Framenet Development Methodology Shafqat Mumtaz Virk, Dana Dannélls, Lars Borin and Markus Forsberg |
pp. 1471‑1479 |
pdf |
bib |
A Deep Learning System for Automatic Extraction of Typological Linguistic Information from Descriptive Grammars Shafqat Mumtaz Virk, Daniel Foster, Azam Sheikh Muhammad and Raheela Saleem |
pp. 1480‑1489 |
pdf |
bib |
Recognizing and Splitting Conditional Sentences for Automation of Business Processes Management Ngoc Phuoc An Vo, Irene Manotas, Octavian Popescu, Algimantas Černiauskas and Vadim Sheinin |
pp. 1490‑1497 |
pdf |
bib |
“Don’t discuss”: Investigating Semantic and Argumentative Features for Supervised Propagandist Message Detection and Classification Vorakit Vorakitphan, Elena Cabrio and Serena Villata |
pp. 1498‑1507 |
pdf |
bib |
ComboNER: A Lightweight All-In-One POS Tagger, Dependency Parser and NER Aleksander Wawer |
pp. 1508‑1514 |
pdf |
bib |
Investigating Annotator Bias in Abusive Language Datasets Maximilian Wich, Christian Widmer, Gerhard Hagerer and Georg Groh |
pp. 1515‑1525 |
pdf |
bib |
Rules Ruling Neural Networks - Neural vs. Rule-Based Grammar Checking for a Low Resource Language Linda Wiechetek, Flammie Pirinen, Mika Hämäläinen and Chiara Argese |
pp. 1526‑1535 |
pdf |
bib |
Transformer with Syntactic Position Encoding for Machine Translation Yikuan Xie, Wenyong Wang, Mingqian Du and Qing He |
pp. 1536‑1544 |
pdf |
bib |
Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media Venkata Himakar Yanamandra, Kartikey Pant and Radhika Mamidi |
pp. 1545‑1552 |
pdf |
bib |
Improving Evidence Retrieval with Claim-Evidence Entailment Fan Yang, Eduard Dragut and Arjun Mukherjee |
pp. 1553‑1558 |
pdf |
bib |
Sentence Structure and Word Relationship Modeling for Emphasis Selection Haoran Yang and Wai Lam |
pp. 1559‑1566 |
pdf |
bib |
Utterance Position-Aware Dialogue Act Recognition Yuki Yano, Akihiro Tamura, Takashi Ninomiya and Hiroaki Obayashi |
pp. 1567‑1574 |
pdf |
bib |
Tell Me What You Read: Automatic Expertise-Based Annotator Assignment for Text Annotation in Expert Domains Hiyori Yoshikawa, Tomoya Iwakura, Kimi Kaneko, Hiroaki Yoshida, Yasutaka Kumano, Kazutaka Shimada, Rafal Rzepka and Patrycja Swieczkowska |
pp. 1575‑1585 |
pdf |
bib |
Abstractive Document Summarization with Word Embedding Reconstruction Jingyi You, Chenlong Hu, Hidetaka Kamigaito, Hiroya Takamura and Manabu Okumura |
pp. 1586‑1596 |
pdf |
bib |
Interpretable Propaganda Detection in News Articles Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James Glass and Preslav Nakov |
pp. 1597‑1605 |
pdf |
bib |
Generic Mechanism for Reducing Repetitions in Encoder-Decoder Models Ying Zhang, Hidetaka Kamigaito, Tatsuya Aoki, Hiroya Takamura and Manabu Okumura |
pp. 1606‑1615 |
pdf |
bib |
Knowledge Distillation with BERT for Image Tag-Based Privacy Prediction Chenye Zhao and Cornelia Caragea |
pp. 1616‑1625 |
pdf |
bib |
Delexicalized Cross-lingual Dependency Parsing for Xibe He Zhou and Sandra Kübler |
pp. 1626‑1635 |
pdf |
bib |
AutoChart: A Dataset for Chart-to-Text Generation Task Jiawen Zhu, Jinye Ran, Roy Ka-Wei Lee, Zhi Li and Kenny Choo |
pp. 1636‑1644 |
pdf |
bib |
A Comparative Study on Abstractive and Extractive Approaches in Summarization of European Legislation Documents Valentin Zmiycharov, Milen Chechev, Gergana Lazarova, Todor Tsonkov and Ivan Koychev |
pp. 1645‑1651 |
pdf |
bib |
Not All Comments Are Equal: Insights into Comment Moderation from a Topic-Aware Model Elaine Zosa, Ravi Shekhar, Mladen Karan and Matthew Purver |
pp. 1652‑1662 |
Last modified on October 9, 2021, 5:00 p.m.