Statement of Purpose: Our aim is implementing a software to track internet pages and index turkish pages for intelligent search.
Introduction: Besides the difficulties of reaching an information, Internet is the largest data source of the world. Unfortunetly most of the languages are lack of smart implementations to utilize this information. Almost 90% of the information in internet is in natural language and using computers to extract such an information is only available after the natural language processing studies. Heading a language study in an environment like Internet makes it necessary to declare a formal way of categorisation of Internet pages.
During this project We try to find a smart algorithm to categorize and index the natural language web pages.