ACIS 1504 Lecture Notes - Lecture 12: Web Crawler, Referential Integrity, User Interface
Document Summary
Web crawler (not database)- program that goes out searching internet for new webpage. Does web crawling to find additional web pages. Job to go out and find new pages and bring a copy back to next component. Indexer (database)- takes those pages from the web crawler, extracts significant words in new pages and indexes them (no a, an and be, don"t matter) rank those words in importance. Query processor (database)- output side, what we see, made up of three components (matches our requested stuff with stored information) User interface- search box and go button, or advance search, allows you to identify search criteria. Engine- matches search criteria to index of terms previously stored, does matching, Results formatter- what match it decides for you to see, displays urls containing search terms, displayed in order by a ranking that is based on other data stored.