We have a Scrapy project already working on a Linux Server, extracting data from several e-commerce sites, and we need add 5 spiders more.
The site names will be showed after project acceptance.
1) The code should be integrated to a existent project.
2) The spider must store data in to a mongodb (already created).
The field to store are:
id (unique by site)
timestamp (date when the product is updated)
3) Before store an item to Db, you should verify if this same product already exist, if not, you should insert a new one, if yes you should update the existent. The way to know that is by, id and site_name.
4) One of the sites, requires signing before browse the catalog, the spider must manage this logging.
The project payment will be divided in 5 hitos. We consider as the final of each one, the end of crawling of one site.
All the web sites are in Spanish language.
Décerné à :
14 freelance font une offre moyenne de $230 pour ce travail
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
I have performed similar work recently, so I will be able to deliver quickly. I am currently 100% available. I am a linux, python, scrapy and mongodb expert.
Hello, I have a lot of knowledge and experience for this job. If You hire me this project will be done efficiently and fast. Feel free to contact me if You have any questions. Kind regards, Nino Rasic
Hi, I have done 4 scrapping projects and got 5 starts. Please check my profile. I can deliver faster scrapper. Hope I would be your choice. Thank you Regards Vikas