Important Dates

* All deadlines are calculated at 11:59 pm
UTC-12 hours ("anywhere on Earth")

Pre-submission mentorship application Mar 27 (Thu), 2025
Pre-submission mentorship feedback May 1 (Thu), 2025
Submission deadline May 18 (Sun), 2025
Reviews due Jun 6 (Fri), 2025
Acceptance notification Jun 21 (Sat), 2025
Camera-ready due Jul 1 (Tue), 2025
Workshop July 27-Aug 1st (With Main Conference)

Archival

Advancing African-Accented English Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR Models
Bonaventure F. P. Dossou

Beyond the Gold Standard in Analytic Automated Essay Scoring
Gabrielle Gaudeau

Confidence and Stability of Global and Pairwise Scores in NLP Evaluation
Georgii Levtsov; Dmitry Ustalov

Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets
Simon Münker; Kai Kugler; Achim Rettinger

Rethinking Full Finetuning from Pretraining Checkpoints in Active Learning for African Languages
Bonaventure F. P. Dossou; Ines Arous; Jackie CK Cheung

HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren; Yihong Liu; Hinrich Schuetze

SEPSIS: I Can Catch Your Lies – A New Paradigm for Deception Detection
Anku Rani; Dwip Dalal; Shreya Gautam; Pankaj Gupta; Vinija Jain; Aman Chadha; Amit Sheth; Amitava Das

Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems?
Md Tanzib Hosain; Md Kishor Morol

Do Androids Question Electric Sheep? A Multi-Agent Cognitive Simulation of Philosophical Reflection on Hybrid Table Reasoning
Yiran Rex Ma

Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
Euntae Choi; Sumin Song; Woosang Lim; Sungjoo Yoo

A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş; Çağatay Yıldız

Transforming Brainwaves into Language: EEG Microstates Meet Text Embedding Models for Dementia Detection
Quoc-Toan Nguyen; Linh Le; Xuan-The Tran; Dorothy Bai; Nghia Duong-Trung; Thomas Do; Chin-teng Lin

Neuron-Level Language Tag Injection Improves Zero-Shot Translation Performance
Jay Orten; Ammon Shurtz; Nancy Fulda; Stephen D. Richardson

Voices of Dissent: A Multimodal Analysis of Protest Songs through Lyrics and Audio
Utsav Shekhar; Radhika Mamidi

Your Pretrained Model Tells the Difficulty Itself: A Self-Adaptive Curriculum Learning Paradigm for Natural Language Understanding
Qi Feng; Yihong Liu; Hinrich Schuetze

CausalGraphBench: a Benchmark for Evaluating Language Models capabilities of Causal Graph discovery
Nikolay Babakov; Ehud Reiter; Alberto Bugarín-Diz

Reasoning for Translation: Comparative Analysis of Chain-of-Thought and Tree-of-Thought Prompting for LLM Translation
Lam Nguyen; Yang Xu

iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Jiahui Li; Roman Klinger

Evaluating Structured Output Robustness of Small Language Models for Open Attribute-Value Extraction from Clinical Notes
Nikita Neveditsin; Pawan Lingras; Vijay Kumar Mago

FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Datasets Dependency
Seonglae Cho; Harryn Oh; Donghyun Lee; Luis Rodrigues Vieira; Andrew Bermingham; Ziad El Sayed

Translating Movie Subtitles by Large Language Models using Movie-meta Information
Ashmari Pramodya; Yusuke Sakai; Justin Vasselli; Hidetaka Kamigaito; Taro Watanabe

Pun2Pun: Benchmarking LLMs on Textual-Visual Chinese-English Pun Translation via Pragmatics Model and Linguistic Reasoning
Yiran Rex Ma; Shan Huang; Yuting Xu; Ziyu Zhou; Yuanxi Wei

Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages
Daniil Gurgurov; Ivan Vykopal; Josef van Genabith; Simon Ostermann

Exploring the Effect of Nominal Compound Structure in Scientific Texts on Reading Times of Experts and Novices
Isabell Landwehr; Marie-Pauline Krielke; Stefania Degaetano-Ortlieb

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
Amir Saeidi; Shivanshu Verma; Md Nayem Uddin; Chitta Baral

From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems
Youngjoon Jang; Seongtae Hong; Junyoung Son; Sungjin Park; Chanjun Park; Heuiseok Lim

Quantifying the Influence of Irrelevant Contexts on Political Opinions Produced by LLMs
Samuele D’Avenia; Valerio Basile

Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
Seunguk Yu; Kyeonghyun Kim; JungMin Yun; YoungBin Kim

Towards Multi-Perspective NLP Systems: A Thesis Proposal
Benedetta Muscato

Enhancing Software Requirements Engineering with Language Models and Prompting Techniques: Insights from the Current Research and Future Directions
Moemen Ebrahim; Shawkat Guirguis; Christine Basta

Question Decomposition for Retrieval-Augmented Generation
Paul J. L. Ammann; Jonas Golde; Alan Akbik

Neural Machine Translation for Agglutinative Languages via Data Rejuvenation
Chen Zhao; Yatu Ji; Ren Qing-Dao-Er-Ji; Nier Wu; Lei Shi; Fu Liu; Yepai Jia

StRuCom: A Novel Dataset of Structured Code Comments in Russian
Maria Dziuba; Valentin Malykh

A Semantic Uncertainty Sampling Strategy for Back-Translation in Low-Resources Neural Machine Translation
Yepai Jia, Yatu Ji, Xiang Xue, Lei Shi, Qing-Dao-Er-Ji Ren, Nier Wu, Na Liu, Chen Zhao, Fu Liu

Spanish Dialect Classification: A Comparative Study of Linguistically Tailored Features, Unigrams and BERT Embeddings
Laura Zeidler; Chris Jenkins; Filip Miletić; Sabine Schulte im Walde

SequentialBreak: Large Language Models Can be Fooled by Embedding Jailbreak Prompts into Sequential Prompt Chains
Bijoy Ahmed Saiem; MD Sadik Hossain Shanto; Rakib Ahsan; Md Rafi Ur Rashid

A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs
Sean Kim; Hyuhng Joon Kim

MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition
Shanshan liu; Noriki Nishida; Rumana Ferdous Munne; Narumi Tokunaga; Yuki Yamagata; Kouji Kozaki; Yuji Matsumoto

LibVulnWatch: A Deep Assessment Agent System and Leaderboard for Uncovering Hidden Vulnerabilities in Open-Source AI Libraries
Zekun Wu; Seonglae Cho; Umar Mohammed; CRISTIAN ENRIQUE MUNOZ VILLALOBOS; Kleyton Da Costa; Xin Guan; Theo King; Ze Wang; Emre Kazim; Adriano Koshiyama

Interactive Text Games: Lookahead Is All You Need!
Hosein Rezaei; James Alfred Walker; Frank Soboczenski

Evaluating Credibility and Political Bias in LLMs for News Outlets in Bangladesh
Tabia Tanzin Prama; Md. Saiful Islam

The Evolution of Gen Alpha Slang: Linguistic Patterns and AI Translation Challenges
Ishita; Radhika Mamidi

Light-Weight Hallucination Detection using Contrastive Learning for Conditional Text Generation
Miyu Yamada; Yuki Arase

Fact from Fiction: Finding Serialized Novels in Newspapers
Pascale Feldkamp; Alie Lassche; Katrine Frøkjær Baunvig; Kristoffer Nielbo; Yuri Bizzoni

Cross-Genre Learning for Old English Poetry POS Tagging
Irene Miani; Sara Stymne; Gregory R. Darwin

A Computational Framework to Identify Self-Aspects in Text
Jaya Caporusso; Matthew Purver; Senja Pollak

Prompting the Muse: Generating Prosodically-Correct Latin Speech with Large Language Models
Michele Ciletti

Can a Large Language Model Keep My Secrets? A Study on LLM-Controlled Agents
Niklas Hemken; Sai Koneru; Florian Jacob; Hannes Hartenstein; Jan Niehues

Chart Question Answering from Real-World Analytical Narratives
Maeve Hutchinson; Radu Jianu; Aidan Slingsby; Jo Wood; Pranava Madhyastha

Low-Perplexity LLM-Generated Sequences and Where To Find Them
Arthur Wuhrmann; Andrei Kucharavy; Anastasiia Kucherenko

CoLeM: A framework for semantic interpretation of Russian-language tables based on contrastive learning
Kirill Tobola; Nikita Dorodnykh

Mitigating Hallucination by Integrating Knowledge Graphs into LLM Inference – a Systematic Literature Review
Robin Wagner; Emanuel Kitzelmann; Ingo Boersch

Semantic alignment in hyperbolic space for fine-grained emotion classification
Ashish Kumar; Durga Toshniwal

I Speak for the Árboles: Developing a Dependency Treebank for Spanish L2 and Heritage Speakers
Emiliana Pulido; Robert Pugh; Zoey Liu

Evaluating Tokenizer Adaptation Methods for Large Language Models on Low-Resource Programming Languages
Georgy Andryushchenko; Vladimir V. Ivanov

Learning and Enforcing Context-Sensitive Control for LLMs
Mohammad Albinhassan; Pranava Madhyastha; Mark Law; Alessandra Russo

When Will the Tokens End? Graph-Based Forecasting for LLMs Output Length
Grzegorz Piotrowski; Mateusz Bystroński; Mikołaj Hołysz; Jakub Binkowski; Grzegorz Chodak; Tomasz Jan Kajdanowicz

Only for the Unseen Languages, Say the Llamas: On the Efficacy of Language Adapters for Cross-lingual Transfer in English-centric LLMs
Julian Schlenker; Jenny Kunz; Tatiana Anikina; Günter Neumann; Simon Ostermann

HyILR: Hyperbolic Instance-Specific Local Relationships for Hierarchical Text Classification
Ashish Kumar; Durga Toshniwal

Are LLMs Truly Graph-Savvy? A Comprehensive Evaluation of Graph Generation
Ege Demirci; Rithwik Kerur; Ambuj Singh

Pragmatic Perspective on Assessing Implicit Meaning Interpretation in Sentiment Analysis Models
Rashid Mustafin

Foundations of PEERS: Assessing LLM Role Performance in Educational Simulations
Jasper Meynard Arana; Kristine Ann M. Carandang; Ethan Robert Casin; Christian Alis; Daniel Stanley Tan; Erika Fille Legara; Christopher Monterola

The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
Yi-Jie Cheng; Oscar Chew; Yun-Nung Chen

Bridging the Embodiment Gap in Agricultural Knowledge Representation for Language Models
Vasu Jindal; Huijin Ju; Zili Lyu

Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity
So Fukuda; Hayato Ogawa; Kaito Horio; Daisuke Kawahara; Tomohide Shibata

Towards Robust Sentiment Analysis of Temporally-Sensitive Policy-Related Online Text
Charles Alba; Benjamin C Warner; Akshar Saxena; Jiaxin Huang; Ruopeng An

Is Partial Linguistic Information Sufficient for Discourse Connective Disambiguation? A Case Study of Concession
Takuma Sato; Ai Kubota; Koji Mineshima

Semantic Frame Induction from a Real-World Corpus
Shogo Tsujimoto; Kosuke Yamada; Ryohei Sasano

Lost and Found: Computational Quality Assurance of Crowdsourced Knowledge on Morphological Defectivity in Wiktionary
Jonathan Sakunkoo; Annabella Sakunkoo

Improving Explainability of Sentence-level Metrics via Edit-level Attribution for Grammatical Error Correction
Takumi Goto; Justin Vasselli; Taro Watanabe

Proposal: From One-Fit-All to Perspective Aware Modeling
Leixin Zhang

Controlling Language Confusion in Multilingual LLMs
Nahyun Lee; Yeongseo Woo; Hyunwoo Ko; Guijin Son

Grammatical Error Correction via Sequence Tagging for Russian
Regina Nasyrova; Alexey Sorokin

DRUM: Learning Demonstration Retriever for Large MUlti-modal Models
Ellen Yi-Ge; Jiechao Gao; Wei Han; Wei Zhu

GerMedIQ: A Resource for Simulated and Synthesized Anamnesis Interview Responses in German
Justin Hofenbitzer; Sebastian Schöning; Sebastian Belle; Jacqueline Lammert; Luise Modersohn; Martin Boeker; Diego Frassinelli

Unstructured Minds, Predictable Machines: A Comparative Study of Narrative Cohesion in Human and LLM Stream-of-Consciousness Writing
Nellia Dzhubaeva; Katharina Trinley; Laura Pissani

Exploiting contextual information to improve stance detection in informal political discourse with LLMs
Arman Engin Sucu; Yixiang Zhou; Mario A. Nascimento; Tony Mullen

A Framework for Fine-Grained Complexity Control in Health Answer Generation
Daniel Jorge Bernardo Ferreira; Tiago Almeida; Sérgio Matos

QA Analysis in Medical and Legal Domains: A Survey of Data Augmentation in Low-Resource Settings
Benedictus Kent Rachmat; Thomas Gerald; Zheng Zhang SLB; Cyril Grouin

Time-LlaMA: Adapting Large Language Models for Time Series Modeling via Dynamic Low-rank Adaptation
Juyuan Zhang; Jiechao Gao; Wenwen Ouyang; Wei Zhu; Hui Yi Leong

RusConText Benchmark: A Russian Language Evaluation Benchmark for Understanding Context
Andrey Chirkin; Svetlana Kuznetsova; Maria Volina; Anna Dengina

GenDLN: Evolutionary Algorithm-Based Stacked LLM Framework for Joint Prompt Optimization
Pia Chouayfati; Niklas Herbster; Ábel Domonkos Sáfrán; Matthias Grabmair

Sign Language Video Segmentation Using Temporal Boundary Identification
Kavu Maithri Rao; Yasser HAMIDULLAH; Eleftherios Avramidis

LIP-NER: Literal Patterns Benefit LLM-Based NER
Ruiqi Li; Li Chen

Testing English News Articles for Lexical Homogenization Due to Widespread Use of Large Language Models
Sarah Fitterer; Dominik Gangl; Jannes Ulbrich

Bridging the Data Gap in Financial Sentiment: LLM-Driven Augmentation
Rohit Kumar; Chandan Nolbaria

Non-Archival

Bias Amplification: Large Language Models as Increasingly Biased Media
Ze Wang, Zekun Wu, Jeremy Zhang, Xin Guan, Navya Jain, Skylar Lu, Saloni Gupta, Adriano Koshiyama

LayerNorm vs RMSNorm: A Geometric Perspective and the Case Against Mean Subtraction
Akshat Gupta; Atahan Ozdemir; Caoqinwei Gong; Gopala Anumanchipalli

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
Jaydeep Borkar, Matthew Jagielski, Katherine Lee, Niloofar Mireshghallah, David A. Smith, Christopher A. Choquette-Choo

CHENGYU-BENCH: Benchmarking Large Language Models for Chinese Idiom Understanding and Use
Yicheng Fu; Zhemin Huang; Liuxin Yang; Yumeng Lu; Zhongdongming Dai

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
Tiancheng Hu, Joachim Baumann, Lorenzo Lupo, Nigel Collier, Dirk Hovy, Paul Röttger

Adversarial Tokenization
Renato Geh; Zilei Shao; Guy Van den Broeck

Unwrapping Circularity: Can Transformers Learn Languages with Circular Schemes?
Xiutian Zhao; Aulia Rafi; Siying Chen; Xiulin Yang

Tree-of-Report: Table-to-Text Generation for Sports Game Reports with Tree-Structured Prompting
Shang-Hsuan Chiang; Tsan-Tsung Yang; Kuang-Da Wang; Wei-Yao Wang; An-Zi Yen; Wen-Chih Peng

From Directions to Cones: Multidimensional Representations of Propositional Facts in LLMs
Stanley Yu; Vaidehi Bulusu; Clayton Lau; Oscar S. Yasunaga; Cole Blondin; Vasu Sharma; Kevin Zhu; Sean O’Brien

NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts
Abhay Gupta, Michael Lu, Kevin Zhu, Sean O’Brien, Vasu Sharma

Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning
Seungjun Yi; Joakim Nguyen; Huimin Xu; Terence Lim; Andrew Well; Mia Markey; Ying Ding

Causal Language Control in Multilingual Transformers via Sparse Feature Steering
Cheng-Ting Chou; George Liu; Jessica Sun; Cole Blondin; Kevin Zhu; Vasu Sharma; Sean O’Brien

Semantic Convergence: Investigating Shared Representations Across Scaled LLMs
Daniel Son; Sanjana Rathore; Andrew Rufail; Adrian Simon; Daniel Zhang; Soham Dave; Cole Blondin; Sean O’Brien; Kevin Zhu

Can LLMs Contribute to Social Inclusion? A Zero-Shot Analysis of Homelessness Bias Detection on Reddit
Jonathan A. Karr Jr.; Benjamin F. Herbst; Matthew Hauenstein; Georgina Curto; Nitesh V Chawla

Do LLMs Understand Wine Descriptors Across Cultures? A Benchmark for Cultural Adaptions of Wine Reviews
Chenye Zou, Xingyue Wen, Qian Janice Wang, Daniel Hershcovich

Direct Confidence Alignment: Aligning Verbalized Confidence with Internal Confidence In Large Language Models
Glenn Zhang; Treasure Mayowa; Jason Fan; Yicheng Fu; Aaron Sandoval; Sean O’Brien; Kevin Zhu

Overcoming Self-Imposed Limits: Five Words to Break an LLM’s Context Compression Barrier
Lin-Wei Chao; Kuang-Da Wang; Wen-Chih Peng