LEADER 02685nam 2200433Ka 450 001 996691917003316 005 20251225100014.3 010 $a3-11-220953-2 010 $a3-11-220821-8 035 $a(CKB)44145893800041 035 $a(ODN)ODN0012687691 035 $a(EXLCZ)9944145893800041 100 $a20251224d2025 uy 0 101 0 $aeng 135 $aurcn|---||||| 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aHarmonizing language data $eStandards for linguistic resources. /$fPiotr Ba?ski 210 $aLaVergne $cDe Gruyter$d2025 215 $a1 online resource 225 0 $aDigital Linguistics,$v4. 300 $aTitle from eBook information screen.. 311 08$a3-11-914802-4 330 $aStandards function as safeguards to ensure that data remains interpretable, uniformly queryable, and archivable over time ? a critical challenge for digital humanists working with complex linguistic resources. This book provides an overview of essential standards for ensuring the sustainability of data in the Digital Humanities (DH). It addresses the selection of data encoding formats, methods of annotating primary data, and approaches to making resources findable and accessible. The focus is on various forms of linguistic data, such as texts, lexicons, or parallel arrangements (e.g., translations or transcribed recordings). The work explains the role of annotations and metadata in structuring and contextualizing data and examines the influence of diverse data formats, shaped by local academic or industrial practices. In contrast to neural language models, which often yield impressive but opaque results, DH projects aim for transparency, reproducibility, and sustainability. Achieving these goals requires interoperability ? the seamless interaction between data and tools. The book demonstrates how clear guidelines and best practices help ensure the long-term usability of data. It offers digital humanists practical approaches and well-founded standards to sustainably archive and efficiently utilize their data, making it an indispensable resource for the field. 606 $aNonfiction$2OverDrive 606 $aComputer Technology$2OverDrive 606 $aGrammar & Language Usage$2OverDrive 606 $aLanguage Arts$2OverDrive 615 17$aNonfiction. 615 7$aComputer Technology. 615 7$aGrammar & Language Usage. 615 7$aLanguage Arts. 686 $aCOM014000$aFOR000000$aLAN009000$2bisacsh 700 $aBa?ski$b Piotr$01897816 906 $aBOOK 912 $a996691917003316 996 $aHarmonizing language data$94553220 997 $aUNISA