Project Detail

Extract Data From Website : Web Crawler  

Extract Data From Website : Web Crawler is project number 545228
posted at Freelancer.com. Click here to post your own project.

 

| More
Free Trial For New Buyers
 

Status:

Selected Providers: xamdam91

Budget: $30-250

Created: 11/06/2009 at 21:42 EST

Bid Count: 19

Average Bid:
$ 142

11/16/2009 at 21:42 EST

Project Creator: IIcucucoolio
Employer Rating: 10/1010/1010/1010/1010/1010/1010/1010/1010/1010/10 (1 reviews)

Bid On This Project
 

Description

I need someone that can write a program, script, crawler, ect of some sort that can crawl through an existing
website and extract a large amount of data from that site. The data that I need to
collect is very organized on the website and should not be that hard to accomplish.The finished product should
produce at least 700 excel files organized in folders on a server that I will specify later. Also, once the
master list is organized, I will need to update the data inside the excel files when the information on the
website changes. I also will be adding new excel files when needed if it does not exist when scanning for
updates. Here is how it needs to go be: Of course this can change if you think you know a more efficient way
of doing it.





1. Go to url: http://www.xxxxxxxx.com

2. Login (we will provide)

3. Choose State in middle

4. Choose School in middle

5. Choose Courses on Right

6. (record list of departments that school offers...these will be in alphabetical order: example below)


ACCT Accounting
ADED Adult Education
AERO Aerospace Engineering
AFRI Africana Studies
AGEC Agric Economics
FORY Forestry
FOUN Foundations Of Educ
FOWS Forestry & Wildlife Sci.
Gdes Graphic Design



7. Choose First department in middle

8. (record the list of classes for each department: example below)


2110 Principles Of Financial Accounting
2117 Honors Principles Of Financial Accounting
2210 Principles Of Managerial Accounting
2810 Fundamentals Of Accounting

9. Repeat the process until all schools have been crawled and all departments and each department's classes.


As I said before, this is what needs to happen. I really don't care how it is programmed. The final product
should produce Excel files, or update them. If for some reason when updating an excel file you find that it
is no longer listed in the master list, it should just use the previous excel file and not delete the existing
one. There should also be a text file created that summaries what was completed, if an excel file was updated, or
a new one added. We need to know when a new excel file is added, updated, or deleted. This is very important
and is needed considering that this list will have at least 700 separate files.


A final Example Excel file will be provided as an example. We need this to be done by some type of program.
Doing this by hand normally takes about 3-4 hours per Excel file. Besides being unproductive, it is very boring.
We would like to get this done in one week. Please let us know if you have any questions. If you are really interested please
use the code IIcucucoolio in the message you send us. All other messages will be ignored. We don't want auto generated
messages. We will give a username and password that will be needed to enter site. Will send the website address in a private
message.


Additional files submitted:
Augusta+State+University+DONE.xls

Messages Posted:2 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 

200

7 days

11-07-2009 03:40 EST

Hello, please refer your PMB. Thank you.

help

 

150

5 days

11-07-2009 08:45 EST

Hello,Please refer your PMB.Thank you.

help

 

200

3 days

11-07-2009 05:23 EST

We can help in your project, please check PMB to see our related experience.

help

 

60

0 days

11-06-2009 22:20 EST

Looking forward to working with you...

help

 

195

3 days

11-07-2009 00:23 EST

Hi, I'm interested in doing your project. Please contact me by PM if you like to discuss. Best Regards, Yousef

help

 

60

4 days

11-06-2009 22:15 EST

I can do this job for you. See PM for details.

help

 

250

8 days

11-07-2009 03:23 EST

Please check PM. Thanks.

help

 

100

3 days

11-07-2009 02:39 EST

Hi, Please check pm. Thanks!

help

 

150

2 days

11-07-2009 16:17 EST

I can do this in perl and deliver you soon

help

 

100

2 days

11-06-2009 23:03 EST

Check PMB.

help

 

225

5 days

11-07-2009 09:57 EST

Dear sir I am ready to start the work immediately. Sincerely Rajendra

help

 

100

3 days

11-07-2009 02:09 EST

Please check PM for more details..

help

 

250

5 days

11-07-2009 07:30 EST

Contact us for details.

help

 

100

5 days

11-06-2009 22:40 EST

Hi, i can do this job. please see the pmb.

help

 

55

0 days

11-07-2009 01:16 EST

(No Feedback Yet)

I an expert in python for 8+ years, I have many web scraping experiences. I can do this job.

help

 

100

10 days

11-06-2009 23:22 EST

(No Feedback Yet)

ready to start here

help

 

50

30 days

11-07-2009 00:06 EST

(No Feedback Yet)

I've so much experience on Excle Program. I can do this type of job so properly within short time.

help

 

200

5 days

11-07-2009 00:38 EST

(No Feedback Yet)

Please check my PMB.

help

 

145

3 days

11-07-2009 04:10 EST

(No Feedback Yet)

Please visit PMB

help


    Bid on this Project