Before a search engine can begin to do its work it must be able to read a file and process it so it can make some determination about what files correspond to certain search terms. I need to create a simple version of this first step of the process. From the command line i need the option to: Read a single file Process a list of files Display a list of words in database Search for a term Exit the program It needs to read text streams from either files or the command line. It will break them apart in to individual words and perform a frequency count of the words, discarding simple words like a, an, the, of, etc. It will then update a lookup table in a binary file. It also has to provide a listing of files in a simple plain text file, one filename per line, and it can batch process the file list. It will need to throw exceptions if any file can not be opened or found, and continue with the next file in the list. It will need to provide a way to display the words within the database so that people will be able to know what terms they can use. It should give the option to display them alphabetically or by popularity. Finally, it should allow the user to query the database and provide a listing of all files that contain the term that they have requested. It will be provided support classes and a main driving class with the command line interface.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased. 4) All work needs to be done in mostly simple java, no other languages
needs to work in java, has to compile and must be compatible with windows.