Languages

The resources of the CompLing platform contains data from a large number of languages. An important issue is to find languages by means of search tools of the databases. The platform consists of several types of data for languages, and we will describe here how data can be searched and found by language.

Derived data for large-scale studies of language patterns (grammar/typology, lexicon)

Grammatical, typological and lexical data that is organized into data sets suitable for quantitative analysis is provided via the DiACL database. Languages in DiACL can be searched through the following entrances:

By list of languages

By expandable language family trees

Language corpora of spoken and ancient languages

Language corpora are made available via the TITUS 2.0 corpus as well as in individual corpora not included in TITUS 2.0.

Languages in TITUS 2.0 can be searched via the Datasets tab under “Language”, reached via the tab Language, or by the menu to the left on the page, where corpora are organized by language families or subgroups.

Language metadata

Both DiACL and TITUS 2.0 are rich in language metadata.

In DiACL, language metadata is given by each language, and therefore searchable by Language (see above)

For TITUS 2.0, which is a corpus, metadata is of a different kind and searchable by data set. For further information about language and dataset metadata, visit the tab Metadata.