Site Scraping

we need an application which search whether a phone number is a "teletu" customer or not.

the excel file will have 2 columns one column with the phone numbers et the second column in wich will be the result.

The website is [url removed, login to view] and the application have to send a post with a phone number so that the wensite send a response which will be saved in front of the number tested

the website blockes the IP after 5 attenptes so you need to use a proxy to be able to continue

What is demanded

1) I send you a 1 miilion number File

2) you test the numbers in the site and you save the responses in front of each number

3) the total number of the files is 400 it means you have to test 400 million numbers in the end

4) delivery must be done by file (by 1 million number)

5) the bid must be done for 1 file

6) For the three first files the delay must not exceed 4 days and starting from the 4th file the delay must be 1 file each day

7) paiement will be done after verification of the results

Exemple of responses of the site to be saved

0815035374 - "there is coverage" - site reponse (Sul numero 0815035374 puoi attivare le offerte TeleTu Bolletta Unica senza Canone Telecom. Scopri sotto quella) che fa per te.

04611800930 - "there is no adsl coverage" - site reponse (Per completare la verifica, inserisci l'indirizzo)

0110764333 - "is already customer" - site reponse (site reponse (Sul numero 0110764333 è già cliente TeleTu.

Puoi attivare queste offerte senza promozione.)

- Each file contains 1 million number
- the bid must be for each file
- The total project is for 400 files it means for 400 million numbers

the paiement will be for file by file after verification of the responses

those who dont have experience there is no need to bid :
It's not a simple scaping it has Ip blocking and you must be able to deliver 1 million number in 2 or three days it means that you must be able to scrape 350 number in one minute
==> you must be able to use multi-threading with IP changing.

You must be able to deliver me 1 hour test numbers to be chosen, it means 350 * 60 = 21.000num in 1 hour

so those how did not visited the site or didn't make tests no need to bid!

Compétences : Programmation C, Exploitation de Données, Excel, Visual Basic, Web Scraping

en voir plus : what is visual programming, what is visual basic programming, starting with visual basic, starting programming, We Scraping , offerte, fa, excel te, excel scraping, completare, unica , visual basic excel post, visual basic http file save, excel delay, excel send http post, http post excel, phone proxy, end scraping, numbers verification, scraping website excel, post http visual, http post visual, excel visual basic continue, scraping www, visual http post

Concernant l'employeur :
( 1 commentaire ) Sousse, Tunisia

Nº du projet : #2400746