P.Jananipriya, Dr.P.Vivekanandan, Mrs.A.Anitha
Internet forums are online discussion sites where people can do conversations in the form of messages. Each forum is having sub-forums and it contains different topics based on the people’s discussion. Crawling is the initial and the most important step during the Web searching procedure. Existing system presents a supervised web-scale forum crawler called Forum Crawler under Supervision (FoCUS). The goal of the FoCUS is to collect the forum pages with minimum overhead. During Crawling, the existing system uses only the single keyword method to crawl the web pages. It does not discover new threads and also does not refresh the crawled threads in a timely manner. The above two problems are rectified in the proposed system by using Ontology concept for Multi Keyword web crawling and Temporal database for discovering new threads. To improve the efficiency the proposed new crawler collects web pages for indexing from the web. By using Ontology concept, the crawling efficiency will be increased and also page coverage will be increased.