CoRST News

Corpus of Russian Student Texts

Dec. 12, 2015

Anna Vishenkova has presented a poster "Morphosyntax and Semantics of Russian Metalinguistic Comparatives" coauthored with Natalia Zevakhina and Svetlana Dzhakupova at the 14th annual conference of the Slavic Cognitive Linguistics Association “Crossing boundaries: taking a cognitive scientific perspective on Slavic languages and linguistics” organized by Universities of Sheffield and Oxford, UK, 9-13.12.2015. The poster was voted as the best poster talk!

Nov. 2, 2015

The size of the corpus reached 3 million tokens. Near 15 000 errors are annotated.

July 20, 2015

The Corpus was presented at the international conference "Corpus Linguistics 2015" in Lancaster, Great Britain.

June 26, 2015

Puzhaeva Svetlana gave a talk "Construction blending in non-standard variants of Russian in the Corpus of Russian Student Texts" coauthored with Natalia Zevakhina and Svetlana Dzhakupova at the 6th International Conference "Corpus Linguistics-2015" in Saint-Petersburg, 22-26.06.2015.

May 30, 2015

Svetlana Dzhakupova gave a talk "Corpus of Russian student texts: design and prospects" coauthored with Natalia Zevakhina at the 21st International Conference on Computational Linguistics "Dialog" in Moscow, 27-30.05.2015.

Dec. 16, 2014

Natalia Zevakhina and Svetlana Dzhakupova gave a talk "Russian in the mirror of the Corpus of Russian Student Texts" at the Institute of Slavic Studies of the Russian Academy of Sciences (Moscow, 16.12.2014).

Nov. 29, 2014

Natalia Zevakhina and Svetlana Dzhakupova gave a talk "Russian Metalinguistic Comparatives: Towards the Typology" at the 11th Conference on Typology and Grammar for Young Researchers at the Institute for Linguistic Studies of Russian Academy of Science (Saint-Petersburg, 27-29.11.2014).

Nov. 12, 2014

The Corpus now has a new logo. The CoRST team thanks Alexandra Kozhukhar, 4th year NRU HSE student, for the beautiful graphic design.

Nov. 4, 2014

Texts with error markup are uploaded to the corpus. More than 5 500 errors are annotated.

Oct. 27, 2014

Corpus updated — added about 300 000 tokens. The size of the corpus is more than 2.5 million tokens.

Sept. 14, 2014

The Corpus is given a new name: Corpus of Russian Student Texts. New link — web-corpora.net/CoRST/

July 9, 2014

Corpus updated — added about 420 000 tokens.

April 12, 2014

Natalia Zevakhina and Alexander Letuchiy gave a talk "Corpus of mistakes in written texts of Russian native speakers" at the Seminar of the Corpus Research Laboratory at the Institute for the Russian language in Moscow.

Nov. 10, 2013

Natalia Zevakhina, Svetlana Dzhakupova and Alexander Letuchiy gave a talk "Error annotation and metadata of Corpus of errors in written texts of Russian native speakers" at the Seminar of the Corpus Research Laboratory at the National Research University Higher School of Economics in Moscow.

Oct. 10, 2013

Alexander Letuchiy gave a talk "Error corpus in written texts of Russian speakers: goals, materials, error classification" coauthored with Natalia Zevakhina, Tim Arkhangelskiy, and Anna Plisetskaya at the conference "Corpus Technologies. Digital Humanities and contemporary knowledge" at the National Research University Higher School of Economics in Nizhniy Novgorod.

Feb. 1, 2013

The Corpus of Russian Student Texts has been established!