Workshop on Structured Linguistic Data and Evaluation (SLiDE)

A full-day workshop at LREC 2026

May 11, 2026 | Room #3 | Palma, Mallorca (Spain)

Call For Papers

In the last ten years, significant advances in deep learning models and the development of Large Language Models (LLMs) have revolutionized the fields of computational linguistics (CL) and natural language processing (NLP). In turn, this has led to a complete re-assessment of the language resources and evaluation practices necessary for training LLMs and analyzing their outputs.

In particular, the availability of very large amounts of unstructured data for training foundational models has come into focus, while the value of high-quality structured linguistic data with rich annotations at various levels of linguistic analysis has been downplayed by comparison. However, as CL and NLP practitioners engage further with LLMs and debate their strengths and weaknesses, the importance of high-quality, structured linguistic data has been re-emphasized.

The proposed workshop can be seen as related to the Treebanks and Linguistic Theories (TLT) conference series and the more recent SyntaxFest venue. The Workshop on Structured Linguistic Data is conceived as both a continuation of this tradition and an adaptation to the new realities of an LLM-dominated research landscape.

Topics of Interest

Topics include but are not limited to:

Linguistic Data & Evaluation

Grammar processing with NLP and LLM-based tools
Phonological and morphological analysis and LLM tokenization
Annotation strategies with LLM-empowered methodologies
Design principles and annotation schemes
Multi-lingual and cross-lingual settings
Language resources for underresourced languages
(Semi-)automatic methods for creating structured linguistic data

Spoken Language & Multimodality

Speech-to-text, Generation, and Curation
Structured multimodal resources: gesture AMR (GAMR), gaze, posture
Multimodal grounding and alignment
Multimodal evaluation resources for LLMs

Pragmatics & Semantics

Structured data for discourse: coherence, dialogue acts
Pragmatic annotation: speech acts, politeness, stance
Theory of Mind in LLMs
Dependency analysis and semantic parsing
Abstract Meaning Representation (AMR), UMR, MRS, TTR
Distributional and neural-symbolic representations

Invited Speakers

Shira Wein (Amherst College)
Naiara Perez (University of the Basque Country)

Important Dates

Paper submission deadline February 22, 2026

Notification of acceptance March 15, 2026

Camera-ready papers March 25, 2026

Workshop at LREC 2026 May 11, 2026

All deadlines are 11.59 pm UTC -12h ("anywhere on Earth").

Presenter Guidelines

Presentations:

Orals: Oral presentations are envisaged to be 20 minutes presentation + 10 minutes discussion;
Posters: Posters should be printed in format A0.
Presentation Format: Please note that accepted papers will appear in the workshop proceedings, which include both oral and poster papers in the same format. Determination of the presentation format (oral vs. poster) is based solely on an assessment of the optimal method of communication (more or less interactive), given the paper content as well as the time constraints.

Workshop Organizers

Jan Hajič (Prague University, Czech Republic)
Erhard Hinrichs (Tübingen University, Germany)
Sandra Kübler (Indiana University, USA)
Joakim Nivre (Uppsala University, Sweden)
Petya Osenova (Sofia University, Bulgaria)
James Pustejovsky (Brandeis University, USA)

Programme Committee

Meriem Beloucif, Uppsala University, Sweden
Bernd Bohnet, Google, London, UK
Johan Bos, Rijksuniversiteit Groningen, NL
Gosse Bouma, Rijksuniversiteit Groningen, NL
Çağrı Çöltekin, Tübingen University, Germany
Daniel Dakota, Leidos, Indiana University, USA
Miryam de Lhoneux, KU Leuven, Belgium
Stephanie Dipper, Bochum University, Germany
Kilian Evang, Düsseldorf University, Germany
>Kim Gerdes, Université Paris-Saclay, France
Jan Hajič, Charles University, Czech Republic
Eva Hajičová, Charles University, Czech Republic
Yoshihiko Hayashi, Waseda University, Japan
Erhard Hinrichs, University of Tübingen, Germany
Shu Kai Hsieh, National Taiwan University, Taiwan
Chu-Ren Huang, Hong Kong Polytechnic University
Laura Kallmeyer, Düsseldorf University, Germany
Ekaterina Kochmar, MBZUAI, Abu Dhabi
Marie-Pauline Krielke, Saarbrücken, Germany
Sandra Kübler, Indiana University, USA
Lori Levin, Carnegie Mellon University, USA (tbc)
Wolfgang Menzel, Hamburg University, Germany
Marie Mikulová, Charles University, Czechia
Kaili Müürisep, University of Tartu, Estonia
John Nerbonne, Freiburg University, Germany
Joakim Nivre, Uppsala University, Sweden
Colleen Alena O'Brien, Palacky University, Czechia
Stephan Oepen, Oslo University, Norway
Petya Osenova, Bulgarian Academy of Sciences
Lilja Øvrelid, University of Oslo, Norway
Barbara Plank, LMU Munich, Germany
James Pustejovsky, Brandeis University, USA
Agata Savary, Université Paris-Saclay, France
Kiril Simov, Bulgarian Academy of Sciences
Sanghoun Song, Korea University, Korea
Sara Stymne, Uppsala University, Sweden
Rob van der Goot, IT University of Copenhagen, Denmark
Leonie Weissweiler, Uppsala University, Sweden
Shuly Wintner, Haifa University, Israel
Alina Wróblewska, Polish Academy of Sciences
Nianwen Xue, Brandeis University, USA
Dan Zeman, Charles University, Czech Republic
Heike Zinsmeister, Hamburg University, Germany