Metadata

Both DiACL and TITUS 2.0 are rich in metadata. Given the different nature of the two databases (derived data versus corpus) the metadata differs between them

DiACL metadata

In DiACL, language metadata is given by each language, before the typological and lexical data. Metadata also includes access to polygon data, if available. Metadata includes language names, ISO 639-3 code, glottocode, timeframe, focal point, focus area, language area (i.e., Glottolog area), language reliability, and associated tree node (i.e., position in language family tree). A further important data is geographical presence, which corresponds to the language polygon data, describe under Polygon data.

Screenshot of the language metadata for A’jie (Austronesian, Pacific)

TITUS 2.0 metadata

TITUS 2.0 is a corpus, and therefore metadata is of a different type compared to the DiACL database. Here, metadata is given for each data set, under a tab “Metadata” in the right sidebar. This metadata contains important information about the text and gives reference to the title, the author, the contributors and context to the digitization, both of the original version and the TITUS version, the source of the text and the details about the publication of the original text (if available).

Metadata in TITUS 2.0 complies with the standard of Text+, for a future integration and searchability of the metadata via this platform.