default search action
VarDial@COLING 2020: Barcelona, Spain (Online)
- Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer:
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2020, Barcelona, Spain (Online), December 13, 2020. International Committee on Computational Linguistics (ICCL) 2020, ISBN 978-1-952148-47-7 - Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri:
A Report on the VarDial Evaluation Campaign 2020. 1-14 - Iuliia Nigmatulina, Tannon Kew, Tanja Samardzic:
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German. 15-24 - Janine Siewert, Yves Scherrer, Martijn Wieling, Jörg Tiedemann:
LSDC - A comprehensive dataset for Low Saxon Dialect Classification. 25-35 - Amirhossein Tebbifakhr, Matteo Negri, Marco Turchi:
Machine-oriented NMT Adaptation for Zero-shot NLP tasks: Comparing the Usefulness of Close and Distant Languages. 36-46 - Michael Gasser, Binyam Ephrem Seyoum, Nazareth Amlesom Kifle:
Character Alignment in Morphologically Complex Translation Sets for Related Languages. 47-56 - Bharathi Raja Chakravarthi, Navaneethan Rajasekaran, Mihael Arcan, Kevin McGuinness, Noel E. O'Connor, John P. McCrae:
Bilingual Lexicon Induction across Orthographically-distinct Under-Resourced Dravidian Languages. 57-69 - Sina Ahmadi:
Building a Corpus for the Zaza-Gorani Language Family. 70-78 - Ainara Estarrona, Izaskun Etxeberria, Ricardo Etxepare, Manuel Padilla-Moyano, Ander Soraluze:
Dealing with dialectal variation in the construction of the Basque historical corpus. 79-89 - Chahan Vidal-Gorène, Victoria Khurshudyan, Anaïd Donabédian-Demopoulos:
Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing. 90-101 - Maja Popovic, Alberto Poncelas, Marija Brkic, Andy Way:
Neural Machine Translation for translating into Croatian and Serbian. 102-113 - Sina Ahmadi:
A Tokenization System for the Kurdish Language. 114-127 - Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow:
Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification. 128-139 - Aleksandra Miletic, Myriam Bras, Marianne Vergez-Couret, Louise Esher, Clamença Poujade, Jean Sibille:
A Four-Dialect Treebank for Occitan: Building Process and Parsing Experiments. 140-149 - Andrea Zugarini, Matteo Tiezzi, Marco Maggini:
Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language. 150-159 - Alyssa Hwang, William R. Frey, Kathleen R. McKeown:
Towards Augmenting Lexical Resources for Slang and African American English. 160-172 - Tommi Jauhiainen, Heidi Jauhiainen, Niko Partanen, Krister Lindén:
Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora. 173-185 - Çagri Çöltekin:
Dialect Identification under Domain Shift: Experiments with Discriminating Romanian and Moldavian. 186-192 - Cristian Popa, Vlad Stefanescu:
Applying Multilingual and Monolingual Transformer-Based Models for Dialect Identification. 193-201 - Yves Scherrer, Nikola Ljubesic:
HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models. 202-211 - Petru Rebeja, Dan Cristea:
A dual-encoding system for dialect classification. 212-219 - Tommi Jauhiainen, Heidi Jauhiainen, Krister Lindén:
Experiments in Language Variety Geolocation and Dialect Identification. 220-231 - George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea:
Exploring the Power of Romanian BERT for Dialect Identification. 232-241 - Mihaela Gaman, Radu Tudor Ionescu:
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets. 242-253 - Fernando Benites, Manuela Hürlimann, Pius von Däniken, Mark Cieliebak:
ZHAW-InIT - Social Media Geolocation at VarDial 2020. 254-264 - Andrea Ceolin, Hong Zhang:
Discriminating between standard Romanian and Moldavian tweets using filtered character ngrams. 265-272 - Gabriel Bernier-Colborne, Cyril Goutte:
Challenges in Neural Language Identification: NRC at VarDial 2020. 273-282 - Piyush Mishra:
Geolocation of Tweets with a BiLSTM Regression Model. 283-289
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.