Basic Web Scrape

Terminé Publié le Mar 15, 2012 Paiement à la livraison
Terminé Paiement à la livraison

I have a webpage that i want to scrape data from. I have already written the code to capture the HTML...so all i need is someone to implement the regex and such to parse the page and populate an array with the data. Attached is the HTML data you will be parsing (the structure does not change, but the number of entries will)

INPUT:

(see attached) I will pass this in a variable called $html at the top of the script. For now you can just save the file and use file_get_contents() to grab the text. I will be inserting some cURL to get this data from a form submission.

OUTPUT:

Multidimensional array with the key set as the "Account Number" and all sub-array keys set to the name of the field. Here is a complete example showing the fields i need, based on the first two records in the sample data.

$output =

Array

(

[0162740110030] => Array

(

[Precinct Number] => 1

[Sale Date (Sale Nbr)] => 04/03/2012 (1)

[Cause Number] => 2008-71303

[District Court] => 189

[Case Style] => HARRIS COUNTY, ET AL VS WILL BAILEY, ET AL

[Legal Description] => LT 30 BLK 11 HIGHLAND HEIGHTS

[Physical Address] => 6633 TUSKEGEE ST 77091

[Adjudged Value] => $6,000

[Estimated Minimum Bid] => $6,000.00

[Status] =>

)

[0833680000007] => Array

(

[Precinct Number] => 8

[Sale Date (Sale Nbr)] => 04/03/2012 (1)

[Cause Number] => 2010-20154

[District Court] => 55

[Case Style] => LA PORTE INDEPENDENT SCHOOL DISTRICT VS KELTON [url removed, login to view], ET AL

[Legal Description] => LT 7 BLK 1A LAPORTE TERRACE

[Physical Address] => 714 N 13TH ST 77571

[Adjudged Value] => $62,063

[Estimated Minimum Bid] => $4,295.86

[Status] =>

)

)

HINTS/TIPS:

1. You don't have to use this if you have another method you prefer, but this class makes parsing HTML tables pretty quick and easy: [url removed, login to view]

2. Parse in chunks...each listing begins with <td class="repTblCell">...then each piece of data in in <td class="repText"> and within that, the label for the array is within <span class="repTblPrompt">

3. Note: The amount you bid is the amount you will be paid (upon successful completion). There are no tips or bonuses. I will thoroughly test the script and let you know of any problems BEFORE releasing escrow so that everything works 100% to specifications above before payment is made.

HOW TO WIN THE PROJECT:

1. Show solid experience with scraping/data mining.

2. Experience on this website and positive feedback.

3. Low price and quick turnaround.

4. Bid early...I will likely NOT wait until the end of the bidding period if I find a developer that seems like a good fit.

Exploitation de Données PHP Web Scraping

Nº du projet : #1505704

À propos du projet

15 propositions Projet à distance Actif Mar 15, 2012

Décerné à:

inspire007

dear sir...i already developed your project...the demo link is given in PM...thank you

%selectedBids___i_sum_sub_4% %project_currencyDetails_sign_sub_5% USD en 1 jour
(118 Commentaires)
6.5

15 freelances font une offre moyenne de 107 $ pour ce travail

srinichal

I look forward to deliver the script

$150 USD en 5 jours
(109 Commentaires)
7.3
mantislin

Hi sir, Please check PM.

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% USD en 1 jour
(189 Commentaires)
6.9
waelfree

Hi, I can do that ISA

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% USD en 1 jour
(72 Commentaires)
6.5
inzaghi2006

Hi, I write many scripts that use regex. please contact me Thanks

$70 USD en 2 jours
(128 Commentaires)
6.4
ansi2

Proposal details will follow. Thanks, 2ansi

$50 USD en 0 jours
(87 Commentaires)
6.3
phpXpertbd

I specialize in similar projects. Please check PM for more details.

$130 USD en 3 jours
(30 Commentaires)
6.2
tonykim100

Hello, I am ready to start now. Thanks.

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% USD en 1 jour
(113 Commentaires)
6.1
Cueball61

What you ask for looks fairly simple to do, I have been using various scraping libraries for quite some time now, and have recently settled on the library you mentioned (very easy to use, I must say!). I can do this Plus

$120 USD en 3 jours
(12 Commentaires)
5.3
procoder898

Hi, I am expert at Data Mining/Web Scraping and can surely satisfy you. Please check your inbox,

$69 USD en 2 jours
(26 Commentaires)
5.2
ViliusSutkus

Hello. Check the PM.

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% USD en 1 jour
(15 Commentaires)
4.7
atozinfosoft

Hello Respected Client, I have Read your requirements and we are very experience in this concept. please check Message Board for more details. Thanks

$300 USD en 6 jours
(0 Commentaires)
0.0
darioa

php scraper with knowledge of regex

%bids___i_sum_sub_32% %project_currencyDetails_sign_sub_33% USD en 1 jour
(0 Commentaires)
0.0
falazar

Hi, I am a very experienced programmer, and will have no problem completing this project quickly. I deal with web scraping every day, from small issues, to some very large projects. A couple of recent projects I h Plus

$100 USD en 2 jours
(0 Commentaires)
0.0