TY - BOOK AU - Zampieri,Marcos AU - Nakov,Preslav TI - Similar languages, varieties, and dialects: a computational perspective T2 - Studies in natural language processing SN - 9781108565080 (ebook) AV - P120.V37 S556 2021 U1 - 417 23 PY - 2021/// CY - Cambridge PB - Cambridge University Press KW - Language and languages KW - Variation KW - Computational linguistics KW - Sociolinguistics KW - Dialectology N1 - Title from publisher's bibliographic system (viewed on 23 Aug 2021) N2 - Language resources and computational models are becoming increasingly important for the study of language variation. A main challenge of this interdisciplinary field is that linguistics researchers may not be familiar with these helpful computational tools and many NLP researchers are often not familiar with language variation phenomena. This essential reference introduces researchers to the necessary computational models for processing similar languages, varieties, and dialects. In this book, leading experts tackle the inherent challenges of the field by balancing a thorough discussion of the theoretical background with a meaningful overview of state-of-the-art language technology. The book can be used in a graduate course, or as a supplementary text for courses on language variation, dialectology, and sociolinguistics or on computational linguistics and NLP. Part 1 covers the linguistic fundamentals of the field such as the question of status and language variation. Part 2 discusses data collection and pre-processing methods. Finally, Part 3 presents NLP applications such as speech processing, machine translation, and language-specific issues in Arabic and Chinese UR - https://doi.org/10.1017/9781108565080 ER -