Skip links

The Joint 3rd International Conference on Natural Language Processing for Digital Humanities & 8th International Workshop on Computational Linguistics for Uralic Languages NLP4DH & IWCLUL 2023

The 3rd International Conference on Natural Language Processing for Digital Humanities (NLP4DH 2023) will be organized together with the 8th International Workshop on Computational Linguistics for Uralic Languages (IWCLUL 2023). The proceedings of the joint event will be published in the ACL anthology. The conference will take place December 1-3, 2023 in Tokyo, Japan at Waseda University.

The focus of NLP4DH is on applying natural language processing techniques to digital humanities research. The topics can be anything of digital humanities interest with a natural language processing or generation aspect. A list of suitable topics includes but is not limited to:

  • Text analysis and processing related to humanities using computational methods
  • Thorough error analysis of an NLP system using (digital) humanities methods
  • Dataset creation and curation for NLP (e.g. digitization, digitalization, datafication, and data preservation).
  • Research on cultural heritage collections such as national archives and libraries using NLP
  • NLP for error detection, correction, normalization and denoising data
  • Generation and analysis of literary works such as poetry and novels
  • Analysis and detection of text genres

The purpose of IWCLUL is to bring together researchers working on computational approaches to Uralic languages (e.g. Finnish, Hungarian, Estonian, Võro, the Sámi languages, Komi (Zyrian, Permyak), Mordvin (Erzya, Moksha), Mari (Hill, Meadow), Udmurt, Nenets (Tundra, Forest), Enets, Nganasan, Selkup, Mansi, Khanty, Veps, Karelian (Olonets), Karelian, Ingrian (Izhorian), Votic, Livonian and Ludic). All Uralic languages exhibit rich morphological structure, which makes processing them challenging for state-of-the-art computational linguistic approaches, the majority also suffer from a lack of resources and many are endangered. Appropriate topics include (but are not limited to):

  • Parsers, analysers and processing pipelines of Uralic languages
  • Lexical databases, electronic dictionaries
  • Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing
  • Evaluation methods and gold standards, tagged corpora, treebanks
  • Reports on language-independent or unsupervised methods as applied to Uralic languages
  • Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages
  • Any work that aims at combining efforts and reducing duplication of work
  • How to elicit activity from the language community, agitation campaigns, games with a purpose

Paper submission

We solicit original and unpublished work related to digital humanities and natural language processing (NLP4DH) and NLP approaches for Uralic languages (IWCLUL). Short papers can be up to 4 pages in length and long papers up to 8 pages. Both submission formats can have an unlimited number of pages for references. All submissions must follow the ACL stylesheet (Overleaf template).

The submissions must be anonymous and they will be peer-reviewed by our program committee. The peer review is double blinded.

Papers must be submitted using OpenReview by the submission deadline. At least one of the authors of an accepted paper must attend the event to present the paper. There will be no registration/publication fees.

Accepted papers (short and long) will be published in the proceedings that will appear in the ACL Anthology. Accepted papers will also be given an additional page to address the reviewers’ comments. The length of a camera ready submission can then be 5 pages for a short paper and 9 for a long paper with an unlimited number of pages for references.

The authors of the accepted papers will be invited to submit an extended version of their workshop paper to a special issue in the Journal of Data Mining & Digital Humanities.

Lightning talk submission

You may also contribute to the event by submitting a lightning talk. Lightning talks are submitted as 750-word abstracts. Lightning talks are suited for discussing ideas or presenting work in progress. The abstracts will not be published or indexed and will only be made available on the conference website.

Lightning talks must be submitted using this Google Form.

Remote attendance

We aim for an inclusive event and we understand that some people have difficulties to travel. If you have a valid reason why you cannot attend the event in person (visa issues, health issues etc.) you may present your paper remotely. Note: lightning talks are ineligible for remote presentation.

Important dates

  • Paper submission (full and short): October 8, 2023
  • Lightning talk submission: October 31, 2023
  • Notification of acceptance: November 3, 2023
  • Camera ready deadline: November 17, 2023
  • Registration deadline (authors): November 17, 2023
  • Registration deadline (participants) November 29, 2023
  • Conference: December 1-3, 2023

All times are Anywhere on Earth (AoE).


Conference room BF1 at Waseda University Research Innovation Center (Building No.121)

513, Wasedatsurumakicho, Shinjuku-ku, Tokyo, 162-0041, JAPAN

Organizers (NLP4DH)

Mika Hämäläinen, Rootroo Ltd and Metropolia University of Applied Sciences

Emily Öhman, Waseda University

So Miyagawa, National Institute for Japanese Language and Linguistics

Khalid Alnajjar, Rootroo Ltd

Yuri Bizzoni, Arhus University

Organizers (IWCLUL)

Flammie Pirinen, UiT The Arctic University of Norway

Niko Partanen, University of Helsinki

Jack Rueter, University of Helsinki

You can contact us by email

Program committee

Aynat Rubinstein, The Hebrew University of Jerusalem
Leo Leppänen, University of Helsinki
Kenichi Iwatsuki, KTTA
Lidia Pivovarova, University of Helsinki
Linda Wiechetek, University of Tromsø
Jouni Tuominen, University of Helsinki
Mikko Kurimo, Aalto University
Balázs Indig, Eötvös Lorand University
Pierre Magistry, Institut National des Langues et Civilisations Orientales
Yoshifumi Kawasaki, The University of Tokyo
Eetu Mäkelä, University of Helsinki
Timofey Arkhangelskiy, Universität Hamburg
Nicolas Gutehrlé, Université de Franche-Comté
Kaisla Kajava, Aalto University
Joshua Wilbur, University of Tartu
Pascale Moreira, School of Communication and Culture, Aarhus University
Miikka Silfverberg, University of British Columbia
Francis Tyers, Indiana University
Anna Dmitrieva, University of Helsinki
Somesh Mohapatra, Massachusetts Institute of Technology
Won Ik Cho, Samsung Advanced Institute of Technology
Shuo Zhang, Bose Corp
Pihla Toivanen, University of Helsinki
Antti Kanner, University of Turku
Jeremy Bradley, Universität Vienna
Aatu Liimatta, University of Helsinki
Sijia Ge, University of Colorado Boulder
Michael Rießler, University of Eastern Finland
Irene Russo, Consiglio Nazionale delle Ricerche
Gechuan Zhang, University College Dublin
Maria Antoniak, Allen Institute for Artificial Intelligence
Thomas Schmidt, Universität Regensburg
Juho Pääkkönen, University of Helsinki
Rogier Blokland, Uppsala University
Jenna Kanerva, University of Turku
Katerina Korre, University of Bologna
Mikko Aulamo, University of Helsinki
Mitsunori Ogihara, University of Miami
Miu Takagi, Waseda University
Quan Duong, University of Helsinki
Daniela Teodorescu, University of Alberta
Erkki Mervaala, Finnish Environment Institute,
Joachim Scharloth, Waseda University
Dimosthenis Antypas, Cardiff University
Ayana Niwa, Tokyo Institute of Technology, Tokyo Institute of Technology
Heiki-Jaan Kaalep, institute of computer science, University of Tartu
Shu Okabe, Univ. Paris-Saclay
Dmitry Nikolaev, University of Stuttgart, Universität Stuttgart
Ritwik Bose, The Institute for Human & Machine Cognition
Dongqi Pu, Universität des Saarlandes
Aina Garí Soler, Télécom-Paris
Nils Hjortnaes, Indiana University
László Fejes, Hungarian Research Centre for Linguistics
Ligeti-Nagy Noémi, MTA-PPKE
Tulika Bose, Vivoka
Allison Lahnala, Phillips-Universität Marburg
Gabriel Simmons, University of California, Davis
Vilja Hulden, University of Colorado at Boulder
Federico Boschetti, CNR-ILC