have 500K resumes (CV's) in PDF, Word (doc & docx), Text and html formats (just those 4 formats) for the USA that need to be parsed to extract:
email address (if it exists and "none" indicated if it is not in the resume)
address (the most important piece of the address being the zip/postal code)
We are developing an internal Drupal site to provide retention, indexing and search for resumes/cv's.
The intention is clean up the 500K objects to ensure they are valid resumes (20% could be junk) before they are imported into the new system.
I expect there will be a lot more questions so feel free to ask.
7 freelance font une offre moyenne de $314 pour ce travail
Dear Customer,Greetings:):) We will be highly obliged to get an opportunity to work for you. My reviews on freelancer.com speaks about my credibility. I would love to start right away and can assure outstanding quality Plus
Me and my colleagues are very well-versed in parsing and could complete this project for you. Please send a private message if you have any further questions. Thank you for your consideration, Phil @ Forward S Plus
Definitely up for the challenge. Should be able to come up with some interesting parsing methods.