Avar corpus

The page you are currently viewing is a web interface for the pilot version of Avar corpus. The corpus size so far is about 2.3 million tokens. The texts of the corpus have been automatically annotated with a morphological analyzer, about 76% of tokens have morphological annotation. There is no disambiguation in the corpus, i. e. each token is annotated with all possible analyses, regardless of its context. The latest update of the corpus was performed on February 14th, 2016.


Dmitry Ganenkov

Web interface

The search platform of the Eastern Armenian National Corpus (EANC) was used for this corpus. You can read about making search queries at EANC help page.