TITUS 2.0

NOTE: TITUS 2.0 is contiuously updated. New data sets are being transferred and made available on a daily basis.

TITUS 2.0 is an updated version of the corpora previously existing in the TITUS database. Databases of the TITUS corpus that have been developed into more modern and functional interfaces are not part of TITUS 2.0. See Corpora for an overview of these databases.

TITUS 2.0 provides corpora in the form of Datasets. These datasets (which have a unique ID) typically represents a text (for instance a compilation of manuscripts, a book, or similar) in a specific language. One example would be the dataset Alexandreis in Old Czech.

The datasets represented as datasets in TITUS 2.0 typically have an internal structure, for instance a text with an hierarchical organization into chapters and subchapters. In TITUS 2.0, this internal structure is given in a tab Table of contents in the right sidebar.

Another tab in the right sidebar, Metadata, gives important information about the text itself. It includes, first of all, information about the text itself: title, date, and author. Furthermore, it gives information about the text edition (editor, location, publisher, title, and year) and the process of digitization of the text, which includes names, place and year of the digitization, names, context and source of the version appearing in TITUS.

Datasets in TITUS 2.0 can be retrieved in by different entrances. From the Dataset page, they can be searched by ID, name of text, author of text, language, language group and type (e.g., text).

The page Sources gives and overview of publications used in the database. By clicking these sources, the associated datasets can be achieved.

Another way of finding the sources is by language, under the page Language.

All datasets can be downloaded in three formats (by icons in the right sidebar, located on top of the Metadata and Table of contents tabs): either as TEI files, as a CLDF format package, or as a .txt file.

Datasets in TITUS 2.0 are continuously improved, curated and provided with additional information. In a continuous flow, new data sets are uploaded and made available to the public.

For comments and questions about the database TITUS 2.0, please write to webmaster@compling.eu.