I have a CSV file with 10,000 titles of articles from a single Mexican newspaper (Reforma). I need a script that will go through each title and 1. Search for it on the newspaper's home page, which brings up a results page with a link to the article, 2. download each article and save it with a unique name. The only hurdles are:
1. The newspaper's archive is all online but you need a login and password, which i have, but cannot distribute as it might get cancelled. once logged in, i believe you can search and download unlimitedly, though there may be a session timeout.
2. Usually title uniquely idenitifies the ariticle, but may need to dis-ambiguate some search results using article date or author (this data is also in the CSV file)
3. Need to save articles in a convenient, easily readable format (would be good to use iReader or something like that)
4. Website and article titles are, obviously, in Spanish.