GeTa tool
GeTa is a tailored tool for annotation of Gǝʿǝz texts which enables a deep fine-grained linguistic annotation as well as annotation at other levels.
The tool is programmed in Java and stores data in JSON format, while offering data export into several other formats, including ANNIS linguistic visualization platform (http://corpus-tools.org/annis/; via an additional converter) and TEI-XML.
Each text can be (in full or in part) annotated at different levels. The main level is formed by the detailed linguistic (part-of-speech) annotation (‘deep annotation’ in the project’s terminology), where each word is linked to the corresponding dictionary entry. Named entities such as persons, places, dates, titles of work, or offices can also be annotated and linked to a metadata repository. Furthermore, the tool allows the mark up of the text structure (e.g. parts, chapters, sentences, verses). Special features related to the edition, like editorial intervention such as conjectures, are marked upon occurrence.
The current version of the GeTa user manual can be downloaded here.