default search action
10th VarDial@EACL 2023: Dubrovnik, Croatia
- Yves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Preslav Nakov, Jörg Tiedemann, Marcos Zampieri:
Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2023, Dubrovnik, Croatia, May 5, 2023. Association for Computational Linguistics 2023, ISBN 978-1-959429-50-0 - Galo Castillo-López, Arij Riabi, Djamé Seddah:
Analyzing Zero-Shot transfer Scenarios across Spanish variants for Hate Speech Detection. 1-13 - Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic, Fabio Rinaldi:
Optimizing the Size of Subword Vocabularies in Dialect Classification. 14-30 - Olli Kuparinen:
Murreviikko - A Dialectologically Annotated and Normalized Dataset of Finnish Tweets. 31-39 - Verena Blaschke, Hinrich Schütze, Barbara Plank:
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages. 40-54 - Oksana Dereza, Theodorus Fransen, John P. McCrae:
Temporal Domain Adaptation for Historical Irish. 55-66 - Jonathan Dunn:
Variation and Instability in Dialect-Based Embedding Spaces. 67-77 - Sina Ahmadi, Milind Agarwal, Antonios Anastasopoulos:
PALI: A Language Identification Benchmark for Perso-Arabic Scripts. 78-90 - Taja Kuzman, Peter Rupnik, Nikola Ljubesic:
Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora. 91-103 - Hanna Fischer, Robert Engsterhold:
Reconstructing Language History by Using a Phonological Ontology. An Analysis of German Surnames. 104-112 - Peter Rupnik, Taja Kuzman, Nikola Ljubesic:
BENCHić-lang: A Benchmark for Discriminating between Bosnian, Croatian, Montenegrin and Serbian. 113-120 - Junlin Li, Bo Peng, Yu-Yin Hsu, Emmanuele Chersoni:
Comparing and Predicting Eye-tracking Data of Mandarin and Cantonese. 121-132 - Alfred Lameli, Andreas Schönberg:
A Measure for Linguistic Coherence in Spatial Language Variation. 133-141 - Gabriel Bernier-Colborne, Cyril Goutte, Serge Léger:
Dialect and Variant Identification as a Multi-Label Classification Task: A Proposal Based on Near-Duplicate Analysis. 142-151 - Aarohi Srivastava, David Chiang:
Fine-Tuning BERT with Character-Level Noise for Zero-Shot Transfer to Dialects and Closely-Related Languages. 152-162 - Aleksandra Miletic, Janine Siewert:
Lemmatization Experiments on Two Low-Resourced Languages: Low Saxon and Occitan. 163-173 - Ilia Afanasev:
The Use of Khislavichi Lect Morphological Tagging to Determine its Position in the East Slavic Group. 174-186 - Alan Ramponi, Camilla Casula:
DiatopIt: A Corpus of Social Media Posts for the Study of Diatopic Language Variation in Italy. 187-199 - Olli Kuparinen, Yves Scherrer:
Dialect Representation Learning with Neural Dialect-to-Standard Normalization. 200-212 - Fritz Hohl, Soh-eun Shim:
VarDial in the Wild: Industrial Applications of LID Systems for Closely-Related Language Varieties. 213-221 - Ankit Vaidya, Aditya Kane:
Two-stage Pipeline for Multilingual Dialect Detection. 222-229 - Mihaela Gaman:
Using Ensemble Learning in Language Variety Identification. 230-240 - Sang Yun Kwon, Gagan Bhatia, El Moatez Billah Nagoudi, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed:
SIDLR: Slot and Intent Detection Models for Low-Resource Language Varieties. 241-250 - Noëmi Aepli, Çagri Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubesic, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2023. 251-261
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.