Project Detail

two data scraper for italian websites  

two data scraper for italian websites is project number 263685
posted at Freelancer.com. Click here to post your own project.

 

| More Free Trial For New Buyers
 

Status:

Selected Providers: victory07

Budget: $30-250

Created: 06/05/2008 at 12:03 EDT

Bid Count: 15

Average Bid:
$ 157

06/20/2008 at 12:03 EDT

Project Creator: saverio10
Employer Rating: 10/1010/1010/1010/1010/1010/1010/1010/1010/1010/10 (5 reviews)

Bid On This Project
 

Description

I need two data scraper for the following sites:
www dot aziende dot it
login dot cercaziende dot it

The scraper needs to collect the following information
- category (eg plumbers etc)
- Business Name
- description (id="textDescriptor")
- All phone & fax numbers
- Address
- website address
- email address

A business may have more than one phone number and should be broken into the following fields.
- Ph
- Other
- Fax
- Mobile
- AH Contact

I also need the address broken into separate fields
- Street number and name
- Suburb
- State
- postcode
- Country

The script must be able to:
use a proxy server lists in round robin way, rotating them every 20 or 50 requests

use as input a file with the urls list

export the data to a csv.

A simple interface will allow me to start/stop the script and provide basic progress feedback.

automatically extract the data from the continuing pages i.e. 2, 3, 4 onwards to get the full data

I should be able to specify the max number of records to retrieve and the speed (delay) of retrieving


For the first web site the url that contains the links to the information
are like:
http://www dot aziende dot it/abbigliamento/index.php
http://www dot aziende dot it/casa-e-giardino/index.php

and so on.

For the second website the urls are like:
http://login dot cercaziende dot it/category/abbigliamento
http://login dot cercaziende dot it/category/auto-e-moto

and so on

for this site the info is all on the page, you do not have to follow other links beside the paging.

Messages Posted:0 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 

200

7 days

05-19-2008 14:25 EDT

Hello, please refer your PMB. Thank you.

help

 

150

3 days

06-05-2008 12:11 EDT

Hello,Ready to start.Thank you.

help

 

250

1 day

06-05-2008 12:06 EDT

Professional work.

help

 

225

4 days

05-19-2008 01:37 EDT

Please check PMB.

help

 

100

2 days

05-18-2008 21:11 EDT

Dear sir, I am very interested in your project, Please see PMB for more details. Thanks. Best Regards.

help

 

195

3 days

06-06-2008 04:28 EDT

Hello, Will be glad to help. Best Regards, Yousef

help

 

120

3 days

05-18-2008 23:01 EDT

Please see PMB

help

 

100

3 days

05-18-2008 21:05 EDT

Please refer PMB. Thanks.

help

 

100

2 days

05-18-2008 23:17 EDT

Please check PM. Thanks RC.

help

 

100

4 days

05-19-2008 06:27 EDT

Can do that on python. I have experience with scraping USA and BEL yellowpages, so I do not expect any unexpectedness.

help

 

120

2 days

05-21-2008 05:40 EDT

I'm interested in your projects. Regards, Federico

help

 

200

2 days

06-06-2008 00:44 EDT

Hi,please check PM.

help

 

249

5 days

06-05-2008 12:15 EDT

Plz see pmb :))

help

 

100

3 days

05-18-2008 23:02 EDT

(No Feedback Yet)

I can do it

help

 

150

5 days

05-19-2008 00:38 EDT

(No Feedback Yet)

I'm very much interested consider me as one of your personnel online. thanks

help


    Bid on this Project