The linguistic section of our homepage consists of five parts: 1) Text Corpus, 2) Concordance, 3) Transcription, Transliteration and Orthography, 4) e-Dictionaries and 5) e-Grammar.
•The text corpus includes glossed, analyzed and translated texts of all Ob-Ugric dialects covered in both projects.
•It is possible to search for word forms and glosses and their context in the concordance subsection.
•In the subsection Transcription, Transliteration and Orthography you can find information on phonological particularities of the Ob-Ugric languages, their various transcriptions and different correspondence tables which help when converting between the different transcriptions (e.g. FUT>IPA, Cyrillic>IPA). There is also a comparison of the Cyrillic traditions of the different publications as well as notes on the transcription of our corpus texts from several sources and their characteristics.
•The e-dictionaries subsection contains a regular dictionary, a reverse dictionary and a thesaurus. All three are corpus-based. In additon, there is the digitized version of Munkácsi and Kálmán’s Mansi Dictionary and lists with ethnographic comments for several dialects.
•In the e-grammar you can find for example phoneme tables (IPA) and inflected form and affix tables. The annotation subsection presents the semi-automatic annotation of functional, semantic and pragmatic roles.
•The Download section provides all language data in different data formats that have also been shared with The Language Archive (TLA).