Name: Web Search Engine Softwares: Java, C, C++, Perl, any software Database: Access, mysql Main pieces of the Project 1. Inverted List Index 2. Store Index (built in Step1) in database or files 3. Searching Index built in step 1 4. Ranking Algorithm 5. Input Screen and Parsing 6. Display Output Brief Explanantion: Project Description: Domain specific search engine. 5. Input Screen: Accept input from user and parse the string extract keywords to search. 1. Inverted List Index: This would be like a nightly process. Build a index database of all the keywords of the domain. Could restrict domain to one website like www.biology.edu. Have to use Inverted list index algorithms to build the index. 2. Store Index: The above built index has to be stored in a database or on a file. Sort it and keep it ready for the search process to aquire results 3. Searching Index: Keywords extractected in step 5 should be used to search the index and get pointers or links ( also stored in the database) to location of the webpage 4. Ranking Algorithm: Can use Boolean, Vector or Probabalistic models to rank the results of the search process 6. Display Output: After the ranking is done, the output has to be displayed best match first.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased.
Softwares: Java, C, C++, Perl, any software Database: Access, mysql