Write scraper for data from html pages. Very basic stuff.
Prefer something in Simple_html_DOM PHP that i can expand if i need to later.
Pages to scrape are here:
[url removed, login to view]
Scraper must walk from page to page using the next link inside the articles. ( the 3 blue arrows at the top and bottom of the article ) In such a way that i can re-use the scraper on other section of the site by changing the opening page.
From each page i want the content being <h> <p> <div> and <li> elements to be collected in a MySQL table. I'm not interested in the surrounding website, only the content of the article.
pagenumber: number of the page, count from 1
elementnumber, order number of element, counter starting at 1 per page.
html content, text field with the html content.