Important Dates
Pre-submission mentorship application |
|
Pre-submission mentorship feedback |
|
Submission deadline |
|
Reviews due |
|
Acceptance notification |
|
Camera-ready due |
|
Workshop | July 27-Aug 1st (With Main Conference) |
Archival
Advancing African-Accented English Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR Models
Bonaventure F. P. Dossou
Beyond the Gold Standard in Analytic Automated Essay Scoring
Gabrielle Gaudeau
Confidence and Stability of Global and Pairwise Scores in NLP Evaluation
Georgii Levtsov; Dmitry Ustalov
Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets
Simon Münker; Kai Kugler; Achim Rettinger
Rethinking Full Finetuning from Pretraining Checkpoints in Active Learning for African Languages
Bonaventure F. P. Dossou; Ines Arous; Jackie CK Cheung
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren; Yihong Liu; Hinrich Schuetze
SEPSIS: I Can Catch Your Lies – A New Paradigm for Deception Detection
Anku Rani; Dwip Dalal; Shreya Gautam; Pankaj Gupta; Vinija Jain; Aman Chadha; Amit Sheth; Amitava Das
Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems?
Md Tanzib Hosain; Md Kishor Morol
Do Androids Question Electric Sheep? A Multi-Agent Cognitive Simulation of Philosophical Reflection on Hybrid Table Reasoning
Yiran Rex Ma
Grouped Sequency-arranged Rotation: Optimizing Rotation Transformation for Quantization for Free
Euntae Choi; Sumin Song; Woosang Lim; Sungjoo Yoo
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş; Çağatay Yıldız
Transforming Brainwaves into Language: EEG Microstates Meet Text Embedding Models for Dementia Detection
Quoc-Toan Nguyen; Linh Le; Xuan-The Tran; Dorothy Bai; Nghia Duong-Trung; Thomas Do; Chin-teng Lin
Neuron-Level Language Tag Injection Improves Zero-Shot Translation Performance
Jay Orten; Ammon Shurtz; Nancy Fulda; Stephen D. Richardson
Voices of Dissent: A Multimodal Analysis of Protest Songs through Lyrics and Audio
Utsav Shekhar; Radhika Mamidi
Your Pretrained Model Tells the Difficulty Itself: A Self-Adaptive Curriculum Learning Paradigm for Natural Language Understanding
Qi Feng; Yihong Liu; Hinrich Schuetze
CausalGraphBench: a Benchmark for Evaluating Language Models capabilities of Causal Graph discovery
Nikolay Babakov; Ehud Reiter; Alberto Bugarín-Diz
Reasoning for Translation: Comparative Analysis of Chain-of-Thought and Tree-of-Thought Prompting for LLM Translation
Lam Nguyen; Yang Xu
iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Jiahui Li; Roman Klinger
Evaluating Structured Output Robustness of Small Language Models for Open Attribute-Value Extraction from Clinical Notes
Nikita Neveditsin; Pawan Lingras; Vijay Kumar Mago
FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Datasets Dependency
Seonglae Cho; Harryn Oh; Donghyun Lee; Luis Rodrigues Vieira; Andrew Bermingham; Ziad El Sayed
Translating Movie Subtitles by Large Language Models using Movie-meta Information
Ashmari Pramodya; Yusuke Sakai; Justin Vasselli; Hidetaka Kamigaito; Taro Watanabe
Pun2Pun: Benchmarking LLMs on Textual-Visual Chinese-English Pun Translation via Pragmatics Model and Linguistic Reasoning
Yiran Rex Ma; Shan Huang; Yuting Xu; Ziyu Zhou; Yuanxi Wei
Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages
Daniil Gurgurov; Ivan Vykopal; Josef van Genabith; Simon Ostermann
Exploring the Effect of Nominal Compound Structure in Scientific Texts on Reading Times of Experts and Novices
Isabell Landwehr; Marie-Pauline Krielke; Stefania Degaetano-Ortlieb
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
Amir Saeidi; Shivanshu Verma; Md Nayem Uddin; Chitta Baral
From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems
Youngjoon Jang; Seongtae Hong; Junyoung Son; Sungjin Park; Chanjun Park; Heuiseok Lim
Quantifying the Influence of Irrelevant Contexts on Political Opinions Produced by LLMs
Samuele D’Avenia; Valerio Basile
Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
Seunguk Yu; Kyeonghyun Kim; JungMin Yun; YoungBin Kim
Towards Multi-Perspective NLP Systems: A Thesis Proposal
Benedetta Muscato
Enhancing Software Requirements Engineering with Language Models and Prompting Techniques: Insights from the Current Research and Future Directions
Moemen Ebrahim; Shawkat Guirguis; Christine Basta
Question Decomposition for Retrieval-Augmented Generation
Paul J. L. Ammann; Jonas Golde; Alan Akbik
Neural Machine Translation for Agglutinative Languages via Data Rejuvenation
Chen Zhao; Yatu Ji; Ren Qing-Dao-Er-Ji; Nier Wu; Lei Shi; Fu Liu; Yepai Jia
StRuCom: A Novel Dataset of Structured Code Comments in Russian
Maria Dziuba; Valentin Malykh
A Semantic Uncertainty Sampling Strategy for Back-Translation in Low-Resources Neural Machine Translation
Yepai Jia, Yatu Ji, Xiang Xue, Lei Shi, Qing-Dao-Er-Ji Ren, Nier Wu, Na Liu, Chen Zhao, Fu Liu
Spanish Dialect Classification: A Comparative Study of Linguistically Tailored Features, Unigrams and BERT Embeddings
Laura Zeidler; Chris Jenkins; Filip Miletić; Sabine Schulte im Walde
SequentialBreak: Large Language Models Can be Fooled by Embedding Jailbreak Prompts into Sequential Prompt Chains
Bijoy Ahmed Saiem; MD Sadik Hossain Shanto; Rakib Ahsan; Md Rafi Ur Rashid
A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs
Sean Kim; Hyuhng Joon Kim
MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition
Shanshan liu; Noriki Nishida; Rumana Ferdous Munne; Narumi Tokunaga; Yuki Yamagata; Kouji Kozaki; Yuji Matsumoto
LibVulnWatch: A Deep Assessment Agent System and Leaderboard for Uncovering Hidden Vulnerabilities in Open-Source AI Libraries
Zekun Wu; Seonglae Cho; Umar Mohammed; CRISTIAN ENRIQUE MUNOZ VILLALOBOS; Kleyton Da Costa; Xin Guan; Theo King; Ze Wang; Emre Kazim; Adriano Koshiyama
Interactive Text Games: Lookahead Is All You Need!
Hosein Rezaei; James Alfred Walker; Frank Soboczenski
Evaluating Credibility and Political Bias in LLMs for News Outlets in Bangladesh
Tabia Tanzin Prama; Md. Saiful Islam
The Evolution of Gen Alpha Slang: Linguistic Patterns and AI Translation Challenges
Ishita; Radhika Mamidi
Light-Weight Hallucination Detection using Contrastive Learning for Conditional Text Generation
Miyu Yamada; Yuki Arase
Fact from Fiction: Finding Serialized Novels in Newspapers
Pascale Feldkamp; Alie Lassche; Katrine Frøkjær Baunvig; Kristoffer Nielbo; Yuri Bizzoni
Cross-Genre Learning for Old English Poetry POS Tagging
Irene Miani; Sara Stymne; Gregory R. Darwin
A Computational Framework to Identify Self-Aspects in Text
Jaya Caporusso; Matthew Purver; Senja Pollak
Prompting the Muse: Generating Prosodically-Correct Latin Speech with Large Language Models
Michele Ciletti
Can a Large Language Model Keep My Secrets? A Study on LLM-Controlled Agents
Niklas Hemken; Sai Koneru; Florian Jacob; Hannes Hartenstein; Jan Niehues
Chart Question Answering from Real-World Analytical Narratives
Maeve Hutchinson; Radu Jianu; Aidan Slingsby; Jo Wood; Pranava Madhyastha
Low-Perplexity LLM-Generated Sequences and Where To Find Them
Arthur Wuhrmann; Andrei Kucharavy; Anastasiia Kucherenko
CoLeM: A framework for semantic interpretation of Russian-language tables based on contrastive learning
Kirill Tobola; Nikita Dorodnykh
Mitigating Hallucination by Integrating Knowledge Graphs into LLM Inference – a Systematic Literature Review
Robin Wagner; Emanuel Kitzelmann; Ingo Boersch
Semantic alignment in hyperbolic space for fine-grained emotion classification
Ashish Kumar; Durga Toshniwal
I Speak for the Árboles: Developing a Dependency Treebank for Spanish L2 and Heritage Speakers
Emiliana Pulido; Robert Pugh; Zoey Liu
Evaluating Tokenizer Adaptation Methods for Large Language Models on Low-Resource Programming Languages
Georgy Andryushchenko; Vladimir V. Ivanov
Learning and Enforcing Context-Sensitive Control for LLMs
Mohammad Albinhassan; Pranava Madhyastha; Mark Law; Alessandra Russo
When Will the Tokens End? Graph-Based Forecasting for LLMs Output Length
Grzegorz Piotrowski; Mateusz Bystroński; Mikołaj Hołysz; Jakub Binkowski; Grzegorz Chodak; Tomasz Jan Kajdanowicz
Only for the Unseen Languages, Say the Llamas: On the Efficacy of Language Adapters for Cross-lingual Transfer in English-centric LLMs
Julian Schlenker; Jenny Kunz; Tatiana Anikina; Günter Neumann; Simon Ostermann
HyILR: Hyperbolic Instance-Specific Local Relationships for Hierarchical Text Classification
Ashish Kumar; Durga Toshniwal
Are LLMs Truly Graph-Savvy? A Comprehensive Evaluation of Graph Generation
Ege Demirci; Rithwik Kerur; Ambuj Singh
Pragmatic Perspective on Assessing Implicit Meaning Interpretation in Sentiment Analysis Models
Rashid Mustafin
Foundations of PEERS: Assessing LLM Role Performance in Educational Simulations
Jasper Meynard Arana; Kristine Ann M. Carandang; Ethan Robert Casin; Christian Alis; Daniel Stanley Tan; Erika Fille Legara; Christopher Monterola
The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
Yi-Jie Cheng; Oscar Chew; Yun-Nung Chen
Bridging the Embodiment Gap in Agricultural Knowledge Representation for Language Models
Vasu Jindal; Huijin Ju; Zili Lyu
Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity
So Fukuda; Hayato Ogawa; Kaito Horio; Daisuke Kawahara; Tomohide Shibata
Towards Robust Sentiment Analysis of Temporally-Sensitive Policy-Related Online Text
Charles Alba; Benjamin C Warner; Akshar Saxena; Jiaxin Huang; Ruopeng An
Is Partial Linguistic Information Sufficient for Discourse Connective Disambiguation? A Case Study of Concession
Takuma Sato; Ai Kubota; Koji Mineshima
Semantic Frame Induction from a Real-World Corpus
Shogo Tsujimoto; Kosuke Yamada; Ryohei Sasano
Lost and Found: Computational Quality Assurance of Crowdsourced Knowledge on Morphological Defectivity in Wiktionary
Jonathan Sakunkoo; Annabella Sakunkoo
Improving Explainability of Sentence-level Metrics via Edit-level Attribution for Grammatical Error Correction
Takumi Goto; Justin Vasselli; Taro Watanabe
Proposal: From One-Fit-All to Perspective Aware Modeling
Leixin Zhang
Controlling Language Confusion in Multilingual LLMs
Nahyun Lee; Yeongseo Woo; Hyunwoo Ko; Guijin Son
Grammatical Error Correction via Sequence Tagging for Russian
Regina Nasyrova; Alexey Sorokin
DRUM: Learning Demonstration Retriever for Large MUlti-modal Models
Ellen Yi-Ge; Jiechao Gao; Wei Han; Wei Zhu
GerMedIQ: A Resource for Simulated and Synthesized Anamnesis Interview Responses in German
Justin Hofenbitzer; Sebastian Schöning; Sebastian Belle; Jacqueline Lammert; Luise Modersohn; Martin Boeker; Diego Frassinelli
Unstructured Minds, Predictable Machines: A Comparative Study of Narrative Cohesion in Human and LLM Stream-of-Consciousness Writing
Nellia Dzhubaeva; Katharina Trinley; Laura Pissani
Exploiting contextual information to improve stance detection in informal political discourse with LLMs
Arman Engin Sucu; Yixiang Zhou; Mario A. Nascimento; Tony Mullen
A Framework for Fine-Grained Complexity Control in Health Answer Generation
Daniel Jorge Bernardo Ferreira; Tiago Almeida; Sérgio Matos
QA Analysis in Medical and Legal Domains: A Survey of Data Augmentation in Low-Resource Settings
Benedictus Kent Rachmat; Thomas Gerald; Zheng Zhang SLB; Cyril Grouin
Time-LlaMA: Adapting Large Language Models for Time Series Modeling via Dynamic Low-rank Adaptation
Juyuan Zhang; Jiechao Gao; Wenwen Ouyang; Wei Zhu; Hui Yi Leong
RusConText Benchmark: A Russian Language Evaluation Benchmark for Understanding Context
Andrey Chirkin; Svetlana Kuznetsova; Maria Volina; Anna Dengina
GenDLN: Evolutionary Algorithm-Based Stacked LLM Framework for Joint Prompt Optimization
Pia Chouayfati; Niklas Herbster; Ábel Domonkos Sáfrán; Matthias Grabmair
Sign Language Video Segmentation Using Temporal Boundary Identification
Kavu Maithri Rao; Yasser HAMIDULLAH; Eleftherios Avramidis
LIP-NER: Literal Patterns Benefit LLM-Based NER
Ruiqi Li; Li Chen
Testing English News Articles for Lexical Homogenization Due to Widespread Use of Large Language Models
Sarah Fitterer; Dominik Gangl; Jannes Ulbrich
Bridging the Data Gap in Financial Sentiment: LLM-Driven Augmentation
Rohit Kumar; Chandan Nolbaria
Non-Archival
Bias Amplification: Large Language Models as Increasingly Biased Media
Ze Wang, Zekun Wu, Jeremy Zhang, Xin Guan, Navya Jain, Skylar Lu, Saloni Gupta, Adriano Koshiyama
LayerNorm vs RMSNorm: A Geometric Perspective and the Case Against Mean Subtraction
Akshat Gupta; Atahan Ozdemir; Caoqinwei Gong; Gopala Anumanchipalli
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
Jaydeep Borkar, Matthew Jagielski, Katherine Lee, Niloofar Mireshghallah, David A. Smith, Christopher A. Choquette-Choo
CHENGYU-BENCH: Benchmarking Large Language Models for Chinese Idiom Understanding and Use
Yicheng Fu; Zhemin Huang; Liuxin Yang; Yumeng Lu; Zhongdongming Dai
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
Tiancheng Hu, Joachim Baumann, Lorenzo Lupo, Nigel Collier, Dirk Hovy, Paul Röttger
Adversarial Tokenization
Renato Geh; Zilei Shao; Guy Van den Broeck
Unwrapping Circularity: Can Transformers Learn Languages with Circular Schemes?
Xiutian Zhao; Aulia Rafi; Siying Chen; Xiulin Yang
Tree-of-Report: Table-to-Text Generation for Sports Game Reports with Tree-Structured Prompting
Shang-Hsuan Chiang; Tsan-Tsung Yang; Kuang-Da Wang; Wei-Yao Wang; An-Zi Yen; Wen-Chih Peng
From Directions to Cones: Multidimensional Representations of Propositional Facts in LLMs
Stanley Yu; Vaidehi Bulusu; Clayton Lau; Oscar S. Yasunaga; Cole Blondin; Vasu Sharma; Kevin Zhu; Sean O’Brien
NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts
Abhay Gupta, Michael Lu, Kevin Zhu, Sean O’Brien, Vasu Sharma
Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning
Seungjun Yi; Joakim Nguyen; Huimin Xu; Terence Lim; Andrew Well; Mia Markey; Ying Ding
Causal Language Control in Multilingual Transformers via Sparse Feature Steering
Cheng-Ting Chou; George Liu; Jessica Sun; Cole Blondin; Kevin Zhu; Vasu Sharma; Sean O’Brien
Semantic Convergence: Investigating Shared Representations Across Scaled LLMs
Daniel Son; Sanjana Rathore; Andrew Rufail; Adrian Simon; Daniel Zhang; Soham Dave; Cole Blondin; Sean O’Brien; Kevin Zhu
Can LLMs Contribute to Social Inclusion? A Zero-Shot Analysis of Homelessness Bias Detection on Reddit
Jonathan A. Karr Jr.; Benjamin F. Herbst; Matthew Hauenstein; Georgina Curto; Nitesh V Chawla
Do LLMs Understand Wine Descriptors Across Cultures? A Benchmark for Cultural Adaptions of Wine Reviews
Chenye Zou, Xingyue Wen, Qian Janice Wang, Daniel Hershcovich
Direct Confidence Alignment: Aligning Verbalized Confidence with Internal Confidence In Large Language Models
Glenn Zhang; Treasure Mayowa; Jason Fan; Yicheng Fu; Aaron Sandoval; Sean O’Brien; Kevin Zhu
Overcoming Self-Imposed Limits: Five Words to Break an LLM’s Context Compression Barrier
Lin-Wei Chao; Kuang-Da Wang; Wen-Chih Peng