Product Scraper for three big online stores
This scraper allows getting all product information from any category or folder of three big online food stores. All these sites have dynamic-loaded web pages, so Selenium Webdriver was used. It's possible to run the script in silent (headless) mode. It is a part of a big crawling project running on the AWS platform
- Data mining and web scraping using Python - Providing and using API (REST, GraphQL) - Any web automation, search automation using Selenium Webdriver - Data exploring and processing, tasks and processes automation using Python - Data converting from/to XLSX, JSON, XML, PDF, DOCX, etc. - Desktop applications, Windows applications using PyQT - SQL, NoSQL Database integration: MongoDB, MySql, MariaDB, etc. Data exploring & statistical analysis using R and Python: Scipy, Numpy, Pandas - Linear regression - Logistic regression - Pearson product-moment correlation coefficient - Spearman's rank correlation coefficient - Kendall tau rank correlation coefficient - Pearson's chi-squared test - Fisher's exact test - Student's t-test - Mann - Whitney U-test - Analysis of variance (ANOVA) - Kruskal-Wallis test - Cluster analysis - Principal component analysis, etc. I would be pleased to consider proposals for long-term projects!