Q&A’s: TRok-Corpus

October 5th, 2009

QUESTIONS AND ANSWERS:

TRok-Corpus


What is TRok-Corpus?
TRok-Corpus is private, not-for-profite and personal computational corpus of Dr. Oktay Ahmed with intention to be used solely in his own linguistic researchs.

What’s the current status of the TRok-Corpus?
At this moment, TRok-Corpus is consisted of 541 electronic books in plain text format (all in UTF-8), with over 100,000 pages.

What is the language of the corpus?
Turkish.

What kind of books are they?
The TRok-Corpus is consisted of books from extremely wide area of interest, such as: fiction, non-fiction, translated books, computer books, travel, poetry, novels, psychology books, medical books, etc.

What software you use for handling the TRok-Corpus?
After many years of using XAIRA, at this time the author is using AntConc 3.2.0u software under Ubuntu linux.

Where did you find the books?
All materials used in this corpus are downloaded from the internet. No scanning of any book is made by the author.

Do you plan to make publicly available this corpus?
Due to copyrights of the downloaded books, for now, the corpus will remain private and only for private research of Dr. Oktay Ahmed.

How can I contact you?
You can send me an e-mail by clicking here.

Last update: Oct 5, 2009.