Welcome to CorpusHub
The Meta-Corpus of All Corporas
Featured
CorpusHub
is aiming to be a community-contributed meta-database of all natural language resources, including concordancers, corpora, dictionaries, databases, databases, texts and other language resources of either major or minor languages, for ease of access and use by the academic community.
-
Concordancers →
A concordancer can show word usage in context, often with search.
-
Corpora →
A corpus is a collection of texts, often with metadata and annotations.
-
Datasets →
A dataset or database is a structured collection of language data.
-
Dictionaries →
A dictionary or lexicon is a collection of words and their meanings.