RANLP 2025 Proceedings Home | RANLP 2025 Website | RANLP Website

Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing
Natural Language Processing in the Generative AI Era

Chairs
Galia Angelova
Maria Kunilovskaya
Marie Escribe
Ruslan Mitkov

Full proceedings volume (PDF)
Author index (HTML)
Bibliography (BibTeX)


pdf bib Front matter pages
pdf bib Harnessing Open-Source LLMs for Tender Named Entity Recognition
Asim Abbas, Venelin Kovatchev, Mark Lee, Niloofer Shanavas and Mubashir Ali
pp. 1‑10
pdf bib On the Limitations of Large Language Models (LLMs): False Attribution
Tosin Adewumi, Nudrat Habib, Lama Alkhaled and Elisa Barney
pp. 11‑21
pdf bib Candidate Profile Summarization: A RAG Approach with Synthetic Data Generation for Tech Jobs
Anum Afzal, Ishwor Subedi and Florian Matthes
pp. 22‑31
pdf bib PersianSciQA: A New Dataset for Bridging the Language Gap in Scientific Question Answering
Safoura Aghadavoud Jolfaei, Azadeh Mohebi and Zahra Hemmat
pp. 32‑37
pdf bib Multilingual Pre-training Meets Supervised Neural Machine Translation: A Reproducible Evaluation on English–French and Finnish Translation
Benyamin Ahmadnia, Yeswanth Soma and Hossein Sarrafzadeh
pp. 38‑47
pdf bib Advancing Clinical Translation in Nepali through Fine-Tuned Multilingual Models
Benyamin Ahmadnia, Sumaiya Shaikh, Bibek Poudel, Shazan Mohammed and Sahar Hooshmand
pp. 48‑56
pdf bib Advancing Active Learning with Ensemble Strategies
Naif Alatrush, Sultan Alsarra, Afraa Alshammari, Luay Abdeljaber, Niamat Zawad, Latifur Khan, Patrick T. Brandt, Javier Osorio and Vito D’Orazio
pp. 57‑66
pdf bib Evaluating Large Language Models on Sentiment Analysis in Arabic Dialects
Maram I. Alharbi, Saad Ezzini, Hansi Hettiarachchi, Tharindu Ranasinghe and Ruslan Mitkov
pp. 67‑74
pdf bib From Posts to Predictions: A User-Aware Framework for Faithful and Transparent Detection of Mental Health Risks on Social Media
Hessam Amini and Leila Kosseim
pp. 75‑84
pdf bib Beyond Methods and Datasets Entities: Introducing SH-NER for Hardware and Software Entity Recognition in Scientific Text
Aftab Anjum, Nimra Maqbool and Ralf Krestel
pp. 85‑94
pdf bib Toponym Resolution: Will Prompt Engineering Change Expectations?
Isuri Anuradha, Deshan Koshala Sumanathilaka, Ruslan Mitkov and Paul Rayson
pp. 95‑104
pdf bib HoloBERT: Pre-Trained Transformer Model for Historical Narratives
Isuri Anuradha, Le An Ha and Ruslan Mitkov
pp. 105‑110
pdf bib A Framework for Fine-Tuning LLMs Using Heterogeneous Feedback
Ryan Aponte, Ryan A. Rossi, Shunan Guo, Franck Dernoncourt, Tong Yu, Xiang Chen, Subrata Mitra and Nedim Lipka
pp. 111‑117
pdf bib Chakoshi: A Customizable Guardrail for LLMs with a Focus on Japanese-Language Moderation
Kazuhiro Arai, Ryota Matsui, Kenji Miyama, Yudai Yamamoto, Ren Shibamiya, Kaito Sugimoto and Yoshimasa Iwase
pp. 118‑124
pdf bib KoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines
Alexander Baranov, Anna Palatkina, Yulia Makovka and Pavel Braslavski
pp. 125‑132
pdf bib Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets
Eduard Barbu, Meeri-Ly Muru and Sten Marcus Malva
pp. 133‑142
pdf bib Mitigating Bias in Text Classification via Prompt-Based Text Transformation
Charmaine Barker and Dimitar Kazakov
pp. 143‑149
pdf bib Towards CEFR-targeted Text Simplification for Question Adaptation
Luca Benedetto and Paula Buttery
pp. 150‑157
pdf bib Evaluation of Pretrained and Instruction-Based Pretrained Models for Emotion Detection in Arabic Social Media Text
Md. Rafiul Biswas, Shimaa Ibrahim, Mabrouka Bessghaier and Wajdi Zaghouani
pp. 158‑165
pdf bib Can LLMs Disambiguate Grounded Language? The Case of PP Attachment
John Blackmore and Matthew Stone
pp. 166‑174
pdf bib MLDataForge: Accelerating Large-Scale Dataset Preprocessing and Access for Multimodal Foundation Model Training
Andrea Blasi Núñez, Lukas Paul Achatius Galke and Peter Schneider-Kamp
pp. 175‑183
pdf bib The Impact of Named Entity Recognition on Transformer-Based Multi-Label Dietary Recipe Classification
Kemalcan Bora and Horacio Saggion
pp. 184‑193
pdf bib Balancing the Scales: Addressing Gender Bias in Social Media Toxicity Detection
Beatriz Botella-Gil, Juan Pablo Consuegra-Ayala, Alba Bonet-Jover and Paloma Moreda-Pozo
pp. 194‑203
pdf bib "Simple-Tool": A Tool for the Automatic Transformation of Spanish Texts into Easy-to-Read
Beatriz Botella-Gil, Isabel Espinosa-Zaragoza, Paloma Moreda Pozo and Manuel Palomar
pp. 204‑209
pdf bib QuARK: LLM-Based Domain-Specific Question Answering Using Retrieval Augmented Generation and Knowledge Graphs
Edward Burgin, Sourav Dutta and Mingxue Wang
pp. 210‑217
pdf bib Classifying Emotions in Tweets from the Financial Market: A BERT-based Approach
Wesley Pompeu Carvalho and Norton Trevisan Roman
pp. 218‑226
pdf bib Detecting Changes in Mental Health Status via Reddit Posts in Response to Global Negative Events
Zenan Chen, Judita Preiss and Peter A. Bath
pp. 227‑233
pdf bib APIO: Automatic Prompt Induction and Optimization for Grammatical Error Correction and Text Simplification
Artem Chernodub, Aman Saini, Yejin Huh, Vivek Kulkarni and Vipul Raheja
pp. 234‑239
pdf bib Integrating Archaic and Regional Lexicons to Improve the Readability of Old Romanian Texts
Madalina Chitez, Roxana Rogobete, Cristina Aura Udrea, Karla Csürös, Ana-Maria Bucur and Mihai Dascalu
pp. 240‑246
pdf bib ExPe: Exact Positional Encodings for Generative Transformer Models with Extrapolating Capabilities
Aleksis Ioannis Datseris, Sylvia Vassileva, Ivan K. Koychev and Svetla Boytcheva
pp. 247‑253
pdf bib End-to-End Deep Learning for Named Entity Recognition and Relation Extraction in Gut-Brain Axis PubMed Abstracts
Aleksis Ioannis Datseris, Mario Kuzmanov, Ivelina Nikolova-Koleva, Dimitar Taskov and Svetla Boytcheva
pp. 254‑259
pdf bib Enabling On-Premises Large Language Models for Space Traffic Management
Enrique De Alba
pp. 260‑267
pdf bib Top Ten from Lakhs: A Transformer-based Retrieval System for Identifying Previously Fact-Checked Claims across Multiple Languages
Srijani Debnath, Pritam Pal and Dipankar Das
pp. 268‑274
pdf bib Evaluating Bilingual Lexicon Induction without Lexical Data
Michaela Denisová and Pavel Rychly
pp. 275‑282
pdf bib Utilizing Large Language Models for Focused Conversational Assistants
Shruti Dhavalikar and Karthika Vijayan
pp. 283‑290
pdf bib AntiSemRO: Studying the Romanian Expression of Antisemitism
Anca Dinu, Andreea C. Moldovan and Adina Marincea
pp. 291‑298
pdf bib Towards a Map of Related Words in Romance Languages
Liviu P. Dinu, Ana Sabina Uban, Ioan-Bogdan Iordache, Claudia Vlad, Simona Georgescu, Laurentiu Zoicas and Anca Dinu
pp. 299‑305
pdf bib Decoding Emotion in Ancient Poetry: Leveraging Generative Models for Classical Chinese Sentiment Analysis
Quanqi Du, Loic De Langhe, Els Lefever and Veronique Hoste
pp. 306‑315
pdf bib GRILE: A Benchmark for Grammar Reasoning and Explanation in Romanian LLMs
Marius Dumitran, Angela Dumitran and Alexandra Mihaela Danila
pp. 316‑324
pdf bib PerSpaCor: Correcting Space and ZWNJ Errors in Persian Text with Transformer Models
Matin Ebrahimkhani and Ebrahim Ansari
pp. 325‑333
pdf bib Reddit-V: A Virality Prediction Dataset and Zero-Shot Evaluation with Large Language Models
Samir El-amrany, Matthias R. Brust, Salima Lamsiyah and Pascal Bouvry
pp. 334‑341
pdf bib Simplifications Are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions
Lukas Ellinger, Miriam Anschütz and Georg Groh
pp. 342‑351
pdf bib Multi-LLM Text Summarization
Jiangnan Fang, Cheng-Tse Liu, Jieun Kim, Yash Bhedaru, Ethan Liu, Nikhil Singh, Nedim Lipka, Puneet Mathur, Nesreen K. Ahmed, Franck Dernoncourt, Ryan Rossi and Hanieh Deilamsalehy
pp. 352‑362
pdf bib EDAudio: Easy Data Augmentation for Dialectal Audio
Lea Fischbach, Akbar Karimi, Alfred Lameli and Lucie Flek
pp. 363‑368
pdf bib Authorship Verification Using Cloze Test with Large Language Models
Tomáš Foltýnek, Tomáš Kancko and Pavel Rychly
pp. 369‑377
pdf bib A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
Alexandru Ganea, Antonia-Adelina Popovici and Marius Dumitran
pp. 378‑387
pdf bib Graph-based RAG for Low-Resource Aromanian–Romanian Translation
Laurentiu G. Ghetoiu and Sergiu Nisioi
pp. 388‑394
pdf bib Differential Robustness in Transformer Language Models: Empirical Evaluation under Adversarial Text Attacks
Taniya Gidatkar, Oluwaseun Ajao and Matthew Shardlow
pp. 395‑402
pdf bib An Annotation Scheme for Factuality and Its Application to Parliamentary Proceedings
Gili Goldin, Shira Wigderson, Ella Rabinovich and Shuly Wintner
pp. 403‑412
pdf bib Can We Predict Innovation? Narrow Experts versus Competent Generalists
Amir Hazem and Motohashi Kazuyuki
pp. 413‑422
pdf bib Arabic to Romanian Machine Translation: A Case Study on Distant Language Pairs
Ioan Alexandru Hirica, Stefana Arina Tabusca and Sergiu Nisioi
pp. 423‑432
pdf bib BiGCAT: A Graph-Based Representation Learning Model with LLM Embeddings for Named Entity Recognition
Md. Akram Hossain, Abdul Aziz, Muhammad Anwarul Azim, Abu Nowshed Chy, Md Zia Ullah and Mohammad Khairul Islam
pp. 433‑440
pdf bib Measuring How (Not Just Whether) VLMs Build Common Ground
Saki Imai, Mert Inan, Anthony B. Sicilia and Malihe Alikhani
pp. 441‑451
pdf bib SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation
Saki Imai, Mert Inan, Anthony B. Sicilia and Malihe Alikhani
pp. 452‑461
pdf bib Alignment of Historical Manuscript Transcriptions and Translations
Maarten Janssen, Piroska Lendvai and Anna Jouravel
pp. 462‑470
pdf bib Zero-shot OCR Accuracy of Low-Resourced Languages: A Comparative Analysis on Sinhala and Tamil
Nevidu Jayatilleke and Nisansa de Silva
pp. 471‑480
pdf bib Detecting Gender Stereotypical Language Using Model-agnostic and Model-specific Explanations
Manuela Nayantara Jeyaraj and Sarah Jane Delany
pp. 481‑490
pdf bib Reversing Causal Assumptions: Explainability in Online Sports Dialogues
Asteria Kaeberlein and Malihe Alikhani
pp. 491‑500
pdf bib How LLMs Influence Perceived Bias in Journalism
Asteria Kaeberlein and Malihe Alikhani
pp. 501‑510
pdf bib Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes
Mahammed Kamruzzaman and Gene Louis Kim
pp. 511‑520
pdf bib Performance Gaps in Acted and Naturalistic Speech: Insights from Speech Emotion Recognition Strategies on Customer Service Calls
Lily Kawaoto, Hita Gupta, Ning Yu and Daniel Dakota
pp. 521‑530
pdf bib Synthetic vs. Gold: The Role of LLM Generated Labels and Data in Cyberbullying Detection
Arefeh Kazemi, Sri Balaaji Natarajan Kalaivendan, Joachim Wagner, Hamza Qadeer, Kanishk Verma and Brian Davis
pp. 531‑540
pdf bib FreeTxt: Analyse and Visualise Multilingual Qualitative Survey Data for Cultural Heritage Sites
Nouran Khallaf, Ignatius Ezeani, Dawn Knight, Paul Rayson, Mo El-Haj, John Vidler, James Davies and Fernando Alva-Manchego
pp. 541‑545
pdf bib GPT-Based Lexical Simplification for Multi-Word Expressions Using Prompt Engineering
Sardar Khan Khayamkhani and Matthew Shardlow
pp. 546‑556
pdf bib Instruction-Tuning LLaMA for Synthetic Medical Note Generation in Swedish and English
Lotta Kiefer, Jesujoba Alabi, Thomas Vakili, Hercules Dalianis and Dietrich Klakow
pp. 557‑566
pdf bib Output Trend Analysis in Semantic Classification of Katakana Words Using a Large Language Model
Kazuki Kodaki and Minoru Sasaki
pp. 567‑571
pdf bib Domain Knowledge Distillation for Multilingual Sentence Encoders in Cross-lingual Sentence Similarity Estimation
Risa Kondo, Hiroki Yamauchi, Tomoyuki Kajiwara, Marie Katsurai and Takashi Ninomiya
pp. 572‑577
pdf bib Am I Blue or Is My Hobby Counting the Teardrops? Expression Leakage in Large Language Models as a Symptom of Irrelevancy Disruption
Berkay Kopru, Mehrzad Mashal, Yigit Gurses, Akos Kadar, Maximilian Schmitt, Ditty Mathew, Felix Burkhardt, Florian Eyben and Björn W. Schuller
pp. 578‑586
pdf bib Fusion of Object-Centric and Linguistic Features for Domain-Adapted Multimodal Learning
Jordan Konstantinov Kralev
pp. 587‑594
pdf bib Multi-Agent Reinforcement Learning for Interactive Code Debugging with Human Feedback and Memory
Anjana Krishnamoorthy, Kartik Ivatury and Benyamin Ahmadnia
pp. 595‑603
pdf bib Integrating Large Language Models for Comprehensive Study and Sentiment Analysis of Student Feedback
Jana Kuzmanova, Katerina Zdravkova and Ivan Chorbev
pp. 604‑613
pdf bib Task-Oriented Dialogue Systems through Function Calling
Tiziano Labruna, Giovanni Bonetta and Bernardo Magnini
pp. 614‑622
pdf bib When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively
Tiziano Labruna, Jon Ander Campos and Gorka Azkune
pp. 623‑632
pdf bib Trust but Verify: A Comprehensive Survey of Faithfulness Evaluation Methods in Abstractive Text Summarization
Salima Lamsiyah, Aria Nourbakhsh and Christoph Schommer
pp. 633‑643
pdf bib Evaluating Large Language Models on Multiword Expressions in Multilingual and Code-Switched Contexts
Frances Adriana Laureano De Leon, Asim Abbas, Harish Tayyar Madabushi and Mark Lee
pp. 644‑653
pdf bib Instruction Finetuning to Attribute Language Stage, Dialect, and Provenance Region to Historical Church Slavic Texts
Piroska Lendvai, Uwe Reichel, Anna Jouravel, Achim Rabus and Elena Renje
pp. 654‑662
pdf bib MariATE: Automatic Term Extraction Using Large Language Models in the Maritime Domain
Shijie Liu, Els Lefever and Veronique Hoste
pp. 663‑673
pdf bib Exploring the Usage of Knowledge Graphs in Identifying Human and LLM-Generated Fake Reviews
Ming Liu and Massimo Poesio
pp. 674‑681
pdf bib The Evaluation of Medical Terms Complexity Using Lexical Features and Large Language Models
Liliya Makhmutova, Giancarlo Dondoni Salton, Fernando Perez-Tellez and Robert J. Ross
pp. 682‑693
pdf bib Where and How as Key Factors for Knowledge-Enhanced Constrained Commonsense Generation
Ivan Martinez-Murillo, Paloma Moreda Pozo and Elena Lloret
pp. 694‑703
pdf bib Forecasting Online Negativity Spikes with Multilingual Transformers for Strategic Decision-Making
Rowan Martnishn, Vishal Green, Varun Kadari, Shravan Athikinasetti, Zach miller, Julia Brady, Viraj Chawda and Nikhil Badlani
pp. 704‑710
pdf bib C-SHAP: Collocation-Aware Explanations for Financial NLP
Martina Menzio, Elisabetta Fersini and Davide Paris
pp. 711‑717
pdf bib Investigating Polarization in YouTube Comments via Aspect-Based Sentiment Analysis
Daniel Miehling, Daniel Dakota and Sandra Kübler
pp. 718‑728
pdf bib From the Tractatus Logico-Philosophicus to Later Wittgenstein: An NLP-Based Comparative Analysis
Andreiana Mihail, Silviu-Florin Gheorghe, Andrei Fotea and Liviu P. Dinu
pp. 729‑736
pdf bib Towards Intention-aligned Reviews Summarization: Enhancing LLM Outputs with Pragmatic Cues
Maria Miro Maestre, Robiert Sepulveda-Torres, Ernesto Luis Estevanell-Valladares, Armando Suarez Cueto and Elena Lloret
pp. 737‑747
pdf bib Subtle Shifts, Significant Threats: Leveraging XAI Methods and LLMs to Undermine Language Models Robustness
Adrián Moreno Muñoz, L. Alfonso Ureñ-López and Eugenio Martínez Cámara
pp. 748‑757
pdf bib Fast Thinking with Structured Prompts: Enabling LLM Reasoning without Chain-of-Thought Generation
Kirill Morozov, Liubov Chubarova and Irina Piontkovskaya
pp. 758‑766
pdf bib T2Know: Analysis and Trend Platform Using the Knowledge Extracted from Scientific Texts
Rafael Muñoz Guillena, Manuel Palomar, Yoan Gutiérrez and Mar Bonora
pp. 767‑770
pdf bib Investigating Large Language Models’ (LLMs) Capabilities for Sexism Detection on a Low-Resource Language
Lutfiye Seda Mut Altin and Horacio Saggion
pp. 771‑779
pdf bib PolyHope-M at RANLP2025 Subtask-1 Binary Hope Speech Detection: Spanish Language Classification Approach with Comprehensive Learning Using Transformer, and Traditional ML, and DL
Md. Julkar Naeen, Sourav Kumar Das, Sharun Akter Khushbu, Shahriar Sultan Ramit and Alaya Parven Alo
pp. 780‑786
pdf bib F-LoRA-QA: Finetuning LLaMA Models with Low-Rank Adaptation for French Botanical Question Generation and Answering
Ayoub Nainia, Régine Vignes-Lebbe, Hajar Mousannif and Jihad Zahir
pp. 787‑796
pdf bib Reverse Prompting: A Novel Computational Paradigm in Schizophrenia Based on Large Language Models
Ivan Nenchev, Christiane Montag and Sandra Anna Just
pp. 797‑806
pdf bib A Survey on Small Language Models
Chien Van Nguyen, Xuan Shen, Ryan Aponte, Yu Xia, Samyadeep Basu, Zhengmian Hu, Jian Chen, Mihir Parmar, Sasidhar Kunapuli, Joe Barrow3, Junda Wu, Ashish Singh, Yu Wang, Jiuxiang Gu, Nesreen K. Ahmed, Nedim Lipka, Ruiyi Zhang, Xiang Chen, Tong Yu, Sungchul Kim, Hanieh Deilamsalehy, Namyong Park, Michael Rimer, Zhehao Zhang, Huanrui Yang, Puneet Mathur, Gang Wu, Franck Dernoncourt, Ryan Rossi and Thien Huu Nguyen
pp. 807‑821
pdf bib Quantifying the Overlap: Attribution Maps and Linguistic Heuristics in Encoder-Decoder Machine Translation Models
Aria Nourbakhsh, Salima Lamsiyah and Christoph Schommer
pp. 822‑831
pdf bib The Illusion of a Perfect Metric: Why Evaluating AI´S Words Is Harder than It Looks
Maria Paz Oliva, Adriana D. Correia, Ivan Vankov and Viktor Botev
pp. 832‑842
pdf bib Multi-LLM Debiasing Framework
Deonna M. Owens, Ryan Rossi, Sungchul Kim, Tong Yu, Franck Dernoncourt, Xiang Chen, Ruiyi Zhang, Jiuxiang Gu, Hanieh Deilamsalehy and Nedim Lipka
pp. 843‑853
pdf bib Toward Quantum-Enhanced Natural Language Understanding: Sarcasm and Claim Detection with QLSTM
Pritam Pal and Dipankar Das
pp. 854‑859
pdf bib Legal Terminology Extraction in Spanish: Gold-standard Generation and LLM Evaluation
Lucia Palacios Palacios, Beatriz Guerrero García, Patricia Martín Chozas and Elena Montiel Ponsoda
pp. 860‑869
pdf bib Benchmarking Item Difficulty Classification in German Vocational Education and Training
Alonso Palomino and Benjamin Paassen
pp. 870‑875
pdf bib Isolating LLM Performance Gains in Pre-training versus Instruction-tuning for Mid-resource Languages: The Ukrainian Benchmark Study
Yurii Paniv
pp. 876‑883
pdf bib Evaluating LLMs on Deceptive Text across Cultures
Katerina Papantoniou, Panagiotis Papadakos and Dimitris Plexousakis
pp. 884‑893
pdf bib Annotating Hate Speech towards Identity Groups
Donnie Parent, Nina Georgiades, Charvi Mishra, Khaled Mohammed and Sandra Kübler
pp. 894‑899
pdf bib On the Interaction of Identity Hate Classification and Data Bias
Donnie Parent, Nina Georgiades, Charvi Mishra, Khaled Mohammed and Sandra Kübler
pp. 900‑906
pdf bib Financial News as a Proxy of European Central Bank Interest Rate Adjustments
Davide Paris, Martina Menzio and Elisabetta Fersini
pp. 907‑914
pdf bib Generating and Analyzing Disfluency in a Code-Mixed Setting
Aryan Paul, Tapabrata Mondal, Dipankar Das and Sivaji Bandyopadhyay
pp. 915‑924
pdf bib A Low-Resource Speech-Driven NLP Pipeline for Sinhala Dyslexia Assistance
Peshala Sandali Perera and Deshan Koshala Sumanathilaka
pp. 925‑933
pdf bib Evaluating Transliteration Ambiguity in Adhoc Romanized Sinhala: A Dataset for Transliteration Disambiguation
Sandun Sameera Perera and Deshan Koshala Sumanathilaka
pp. 934‑942
pdf bib Detecting Deception in Disinformation across Languages: The Role of Linguistic Markers
Alba Perez-Montero, Silvia Gargova, Elena Lloret and Paloma Moreda Pozo
pp. 943‑952
pdf bib Enhancing Transformer-Based Rerankers with Synthetic Data and LLM-Based Supervision
Dimitar Peshevski, Kiril Blazhevski, Martin Popovski and Gjorgji Madjarov
pp. 953‑961
pdf bib Q&A-LF : A French Question-Answering Benchmark for Measuring Fine-Grained Lexical Knowledge
Alexander Petrov, Alessandra Thais Mancas, Viviane Binet, Antoine Venant, Francois Lareau, Yves Lepage and Phillippe Langlais
pp. 962‑969
pdf bib Analysis of Vocabulary and Subword Tokenization Settings for Optimal Fine-tuning of MT: A Case Study of In-domain Translation
Javad Pourmostafa Roshan Sharami, Dimitar Shterionov and Pieter Spronck
pp. 970‑979
pdf bib LLM-based Embedders for Prior Case Retrieval
Damith Premasiri, Tharindu Ranasinghe and Ruslan Mitkov
pp. 980‑988
pdf bib Exploiting Primacy Effect to Improve Large Language Models
Bianca Raimondi and Maurizio Gabbrielli
pp. 989‑997
pdf bib Alankaar: A Dataset for Figurativeness Understanding in Bangla
Geetanjali Rakshit and Jeffrey Flanigan
pp. 998‑1002
pdf bib ASQ: Automatically Generating Question-Answer Pairs Using AMRs
Geetanjali Rakshit and Jeffrey Flanigan
pp. 1003‑1011
pdf bib Multi-LLM Verification for Question Answering under Conflicting Contexts
Geetanjali Rakshit and Jeffrey Flanigan
pp. 1012‑1021
pdf bib Comparative Analysis of Human and Large Language Model Performance in Pharmacology Multiple-Choice Questions
Ricardo Rodriguez, Stéphane Huet, Benoit Favre and Mickael Rouvier
pp. 1022‑1029
pdf bib Enhancing Textual Understanding: Automated Claim Span Identification in English, Hindi, Bengali, and CodeMix
Rudra Roy, Pritam Pal, Dipankar Das, Saptarshi Ghosh and Biswajit Paul
pp. 1030‑1035
pdf bib Detecting Fake News in the Era of Language Models
Muhammad Irfan Fikri Sabri, Hansi Hettiarachchi and Tharindu Ranasinghe
pp. 1036‑1043
pdf bib Cyberbullying Detection via Aggression-Enhanced Prompting
Aisha Saeid, Anu Sabu, Girish Koushik, Ferrante Neri and Diptesh Kanojia
pp. 1044‑1052
pdf bib Lingdex.org:Leveraging LLMs to Structure and Explore Linguistic Olympiad Puzzles for Learning and Teaching Linguistics
Jonathan Sakunkoo and Annabella Sakunkoo
pp. 1053‑1057
pdf bib When Does Language Transfer Help? Sequential Fine-Tuning for Cross-Lingual Euphemism Detection
Julia Sammartino, Libby Barak, Jing Peng and Anna Feldman
pp. 1058‑1065
pdf bib Modelling the Relative Contributions of Stylistic Features in Forensic Authorship Attribution
G. Çağatay Sat, John Blake and Evgeny Pyshkin
pp. 1066‑1073
pdf bib The Hidden Cost of Structure: How Constrained Decoding Affects Language Model Performance
Maximilian Schall and Gerard de Melo
pp. 1074‑1084
pdf bib A Question-Answering Based Framework/Metric for Evaluation of Newspaper Article Summarization
Vasanth Seemakurthy, Shashank Sundar, Siddharth Arvind, Siddhant Jagdish and Ashwini M. Joshi
pp. 1085‑1089
pdf bib Efficient Financial Fraud Detection on Mobile Devices Using Lightweight Large Language Models
Lakpriya Senevirathna and Deshan Koshala Sumanathilaka
pp. 1090‑1098
pdf bib Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems
Lia Shahnazaryan, Patrick Simianer and Joern Wuebker
pp. 1099‑1108
pdf bib Exposing Pink Slime Journalism: Linguistic Signatures and Robust Detection against LLM-Generated Threats
Sadat Shahriar, Navid Ayoobi, Arjun Mukherjee, Mostafa Musharrat and Sai Vishnu Vamsi Senagasetty
pp. 1109‑1117
pdf bib The Erosion of LLM Signatures: Can We Still Distinguish Human and LLM-Generated Scientific Ideas after Iterative Paraphrasing?
Sadat Shahriar, Navid Ayoobi and Arjun Mukherjee
pp. 1118‑1126
pdf bib Deep Language Geometry: Constructing a Metric Space from LLM Weights
Maksym Shamrai and Vladyslav Hamolia
pp. 1127‑1136
pdf bib Cross-Lingual Fact Verification: Analyzing LLM Performance Patterns across Languages
Hanna Shcharbakova, Tatiana Anikina, Natalia Skachkova and Josef van Genabith
pp. 1137‑1147
pdf bib ESAQueryRank: Ranking Query Interpretations for Document Retrieval Using Explicit Semantic Analysis
Avijeet Shil and Wei Jin
pp. 1148‑1152
pdf bib Personalized Author Obfuscation with Large Language Models
Mohammad Shokri, Sarah Ita Levitan and Rivka Levitan
pp. 1153‑1162
pdf bib Bulgarian Event Extraction with LLMs
Kiril Simov, Nikolay Paev, Petya Osenova and Stefan Marinov
pp. 1163‑1171
pdf bib FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh, Ashutosh Singh, Prateek Agarwal, Zixuan Huang, Arpita Singh, Tong Yu, Sungchul Kim, Victor Soares Bursztyn, Nesreen K. Ahmed, Puneet Mathur, Erik Learned-Miller, Franck Dernoncourt and Ryan Rossi
pp. 1172‑1182
pdf bib LLM Compression: How Far Can We Go in Balancing Size and Performance?
Sahil Sk, Debashish Dhal, Sonal Khosla, Akash Dhaka, Shantipriya Parida, Sk Shahid, Sambit Shekhar, Dilip Prasad and Ondrej Bojar
pp. 1183‑1187
pdf bib Pushing the (Generative) Envelope: Measuring the Effect of Prompt Technique and Temperature on the Generation of Model-based Systems Engineering Artifacts
Erin Smith Crabb, Cedric Bernard, Matthew Jones and Daniel Dakota
pp. 1188‑1194
pdf bib Dutch CrowS-Pairs: Adapting a Challenge Dataset for Measuring Social Biases in Language Models for Dutch
Elza Strazda and Gerasimos Spanakis
pp. 1195‑1204
pdf bib The Challenge of Performing Ontology-driven Entity Extraction in Real-world Unstructured Textual Data from the Domain of Dementia
Sumaiya Suravee, Carsten Oliver Schmidt and Kristina Yordanova
pp. 1205‑1214
pdf bib Recognizing the Structure and Content of Hungarian Civil Registers
Kata Ágnes Szűcs, Noémi Vadász and Zsolt Béla Záros
pp. 1215‑1223
pdf bib Optimism, Pessimism, and the Language between: Model Interpretability and Psycholinguistic Profiling
Stefana Arina Tabusca and Liviu P. Dinu
pp. 1224‑1231
pdf bib Demographic Features for Annotation-Aware Classification
Narjes Tahaei and Sabine Bergler
pp. 1232‑1236
pdf bib Exploring the Performance of Large Language Models for Event Detection and Extraction in the Health Domain
Hristo Tanev, Nicolas Stefanovitch, Tomáš Harmatha and Diana F. Sousa
pp. 1237‑1247
pdf bib Leveraging LLaMa for Abstractive Text Summarisation in Malayalam: An Experimental Study
Hristo Tanev, Anitha S. Pillai and Revathy V. R
pp. 1248‑1255
pdf bib Building a Clean Bartangi Language Corpus and Training Word Embeddings for Low-Resource Language Modeling
Warda Tariq, Victor Popov and Vasilii Gromov
pp. 1256‑1262
pdf bib A Deep Dive into Multi-Head Attention and Multi-Aspect Embedding
Maryam Teimouri, Jenna Kanerva and Filip Ginter
pp. 1263‑1270
pdf bib A Linguistically-informed Comparison between Multilingual BERT and Language-specific BERT Models: The Case of Differential Object Marking in Romanian
Maria Tepei and Jelke Bloem
pp. 1271‑1281
pdf bib PoliStance-TR: A Dataset for Turkish Stance Detection in Political Domain
Muhammed Cihat Unal, Yasemin Sarkın, Alper Karamanlioglu and Berkan Demirel
pp. 1282‑1288
pdf bib Towards Safer Hebrew Communication: A Dataset for Offensive Language Detoxification
Natalia Vanetik, Lior Liberov, Marina Litvak and Chaya Liebeskind
pp. 1289‑1298
pdf bib AIDEN: Automatic Speaker Notes Creation and Navigation for Enhancing Online Learning Experience
Stalin Varanasi, Umer Butt, Guenter Neumann and Josef van Genabith
pp. 1299‑1303
pdf bib Using LLMs for Multilingual Clinical Entity Linking to ICD-10
Sylvia Vassileva, Ivan K. Koychev and Svetla Boytcheva
pp. 1304‑1308
pdf bib Aspect–Sentiment Quad Prediction with Distilled Large Language Models
Filippos Karolos Ventirozos, Peter Appleby and Matthew Shardlow
pp. 1309‑1319
pdf bib SENTimental - a Simple Multilingual Sentiment Annotation Tool
John Vidler, Paul Rayson and Dawn Knight
pp. 1320‑1326
pdf bib Anonymise: A Tool for Multilingual Document Pseudonymisation
Rinalds Vīksna and Inguna Skadina
pp. 1327‑1332
pdf bib Revealing Gender Bias in Language Models through Fashion Image Captioning
Maria Villalba-Oses, Victoria Muñoz-Garcia and Juan Pablo Consuegra-Ayala
pp. 1333‑1340
pdf bib Benchmarking Korean Idiom Understanding: A Comparative Analysis of Local and Global Models
Xiaonan Wang, Seoyoon Park and Hansaem Kim
pp. 1341‑1351
pdf bib TinyMentalLLMs Enable Depression Detection in Chinese Social Media Texts
JINYUAN XU, Tian LAN, Mathieu Valette, Pierre Magistry and LEI LI
pp. 1352‑1363
pdf bib Prompt Engineering for Nepali NER: Leveraging Hindi-Capable LLMs for Low-Resource Languages
Dipendra Yadav, Sumaiya Suravee, Stefan Kemnitz, Tobias Strauss and Kristina Yordanova
pp. 1364‑1373
pdf bib Seeing, Signing, and Saying: A Vision-Language Model-Assisted Pipeline for Sign Language Data Acquisition and Curation from Social Media
Shakib Yazdani, Yasser HAMIDULLAH, Cristina España-Bonet and Josef van Genabith
pp. 1374‑1384
pdf bib Visual Priming Effect on Large-scale Vision Language Models
Daiki Yoshida, Haruki Sakajo, Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe
pp. 1385‑1395
pdf bib From Courtroom to Corpora: Building a Name Entity Corpus for Urdu Legal Texts
Adeel Zafar, Sohail Ashraf and Slawomir Nowaczyk
pp. 1396‑1405
pdf bib EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English and Arabic
Wajdi Zaghouani and Md. Rafiul Biswas
pp. 1406‑1412
pdf bib An Annotated Corpus of Arabic Tweets for Hate Speech Analysis
Wajdi Zaghouani and Md. Rafiul Biswas
pp. 1413‑1419
pdf bib Strategies for Efficient Retrieval-augmented Generation in Clinical Domains with RAPTOR: A Benchmarking Study
Xumou Zhang, Qixuan Hu, Jinman Kim and Adam G. Dunn
pp. 1420‑1429
pdf bib LLM-Based Product Recommendation with Prospect Theoretic Self Alignment Strategy
Manying Zhang, Zehua Cheng and Damien Nouvel
pp. 1430‑1436
pdf bib Branching Out: Exploration of Chinese Dependency Parsing with Fine-tuned Large Language Models
He Zhou, Emmanuele Chersoni and Yu-Yin Hsu
pp. 1437‑1445

Last modified on December 15, 2025, 6:55 a.m.