THE CORPUS OF CONTACT-INFLUENCED RUSSIAN EN | RU | Show help How to cite the corpus
of Northern Siberia and The Russian Far East
(Place for results)
Grammar selection body
The help text should be loaded here.
The dictionary text should be loaded here.
Irina Khomchenkova, Polina Pleshak, Natalia Stoynova. The corpus of contact-influenced Russian of Northern Siberia and The Russian Far East. (Available online at: http://web-corpora.net/ruscontact/).
If you want to share your query with someone, send them the text you see below. The person who loads this query will see the same results in the same order, unless the corpus has been re-indexed.
Here you can load a corpus query someone sent you. Please enter the query below:
The plot below works as follows. On the x axis, you see frequency ranks, i.e. positions in the full list of all word forms / lemmata of the corpus, ordered by decreasing frequency. If multiple words/lemmata have the same frequency, they get the same rank equal to the average of their positions. For each frequency rank r, the plot shows the proportion of words/lemmata that conform to your query (each word counts only once) among all words with frequency rank less or equal to r on the y axis. The rightmost point, therefore, shows the total proportion of such words/lemmata among all types (different words) in the corpus. In the case of lemmata, all lemmata that have at least one word form conforming to the query are counted.
The subcorpus constraints and all words in the query, except the first one, are not taken into account here.Scale of the x axis: