we need to develop an HTTP Robot in two levels:
The robot will index several web sources (blogs, newspaper...) in order to get content from deep pages (posts, news...). Final content pages will be selected based on uri patterns.
Robot have to take the content from that pages using regulars expresions for each different data (title, content, category...)
Content must be saved in a Database.
Web administrator with access restriction based in user/password.
Some functions: create web sources, create categories, create differentes IndexerProjects (one source can have more than one IndexProject: i.e. one per category)
System must run in linux with mysql and apache. We will provide more detailed information when choose the developer.
You must give us some information about how you're going to de the development: technologies, programming language, modules...
7 freelance font une offre moyenne de $686 pour ce travail
Hello! 100% guaranteed of high quality and professional work, as we are the experts in web and programming. Our company has been in the sphere of webdesign and programming for 7 years already, always providing its clie Plus
I have written and using a servlet (JAVA) robot which does what You need. Saves all the data in an SQL database.
Hello, I have wide area experience in open source technology like php/c++/linux/so files/fusebox AND WINDOW based ASP,COM,DCOM,MTS,VB,VC++,.NET. MY SUGGESTION IS USE Open Source software for your project, We can cus Plus