We need a code to put zillow "houses sold" data into an excel file that can be converted for use in stata or matlad. Copying and pasting of each observation is quite tedious so we are looking for something that can be applied to many different searches. We need the address, dollar amount the house sold for, number of bedrooms, bathrooms and square feet. We need to do this for 20 different cities, with each one having ~100,000 observations. I'm not sure if a code can be written for this, but hopefully!
I would need this to be written in php and stored in a mysql database. You can provide a .sql dump if necessary along with the program to run on my server.
It would need to take a zip code in the US as the input and save to database whatever "houses sold" data Zillow may have on their search results. Zillow provides an API to retrieve extra data so that would be useful to have with the program as well, a way to do a second-pass over the database and add more details based on API results. Zillow's API is limited to 1000 requests per day but if you deliver a php program we can keep it running for a few weeks to stay within the API limit.
Further information: this project requires two parts: scraping the addresses of houses sold from Zillow and then pulling via API information about those houses.