When you find an interesting website that you would like to remember send us an-e-
mail and we check if the content is accessible and text is obtainable. When we will
crawl the website, we determine the region of origin and language and assign the
content to a corpus that will be assigned a knowledge domain.
In this example the website delpher.nl is a website with extension nl and language
is dutch. Its size 3.36 MB characters and based on its content assigned to
knowledge domain linguistic where it accessible for teacher and linguist.