Freelancer

Screen Scrape to XML to Save Local file  

Screen Scrape to XML to Save Local file is project number 547331
posted at Freelancer.com. Click here to post your own project.


| More Free Trial For New Buyers
 

Status: Awarded

Selected Providers: Arenabpo

Budget: $30-250

Created: 11/22/2009 at 3:55 EST

Bid Count: 26

Average Bid:
$ 174

12/02/2009 at 3:55 EST

Project Creator: marketmike
Employer Rating: 10/1010/1010/1010/1010/1010/1010/1010/1010/1010/10 (5 reviews)

Bid On This Project
 

Description

I am looking to screeb scrape a specific site http://offender.fdle.state.fl.us/ to collect data on registered sex offenders. The criteria for the search url to return the records I am interested in is as follows: http://offender.fdle.state.fl.us/offender/offenderSearchNav.do?county=marion&link=doSearch&commaSeparatedOffenderStatus=1,6,7,8,9&stateStatus=1&offenderType=3

However, hitting that URL directly seems to redirect you back to the homepage unless you already have an active session on the site. I suspect this is the first tricky spot as a session or something needs to be set with the parsers.

Once you do get the results, you will notice in a hidden field that all the IDs exist for the results. I anticipated using those ids to build the urls for the next part of the scrape where the offenders record would be built. It is a hidden field. <input type="hidden" name="commaSeparatedPersonIdsALL"

From these ids, the url to the respective record can be formed: http://offender.fdle.state.fl.us/offender/flyer.do?personId=16687 using the ID for the personID.

From this form I would like the following data scraped, including a url to the image and combined into an XML feed which will later be imported into our database (the DB import is not part of this project).

From right of photo....
-------------------------
Designation: Sexual Offender
Name: Samuel E Ackerson
Status: Released - Required to Register
Department of Corrections #: D93831
Search the Dept of Corrections Website
Date of Birth: 05/28/1975
Race : White
Sex: Male
Hair: Blond
Eyes: Blue
Height: 5'10"
Weight: 153 lbs

Below Photo....
--------------------
Samuel E Ackerson
Date Of Photo: 11/03/2009

Aliases
Scars, Marks & Tattoos

From Address Information I would like the first Address and Address Source Information> I would also want longitutde and latitude extracted from the map link for the address being imported. This will be stored in db on import for Geo coding on map.

From Crime Information - Qualifying Offenses I would like all the information brought into the feed as a table using the same headers as the page but without color or formatting.



Again, this data should all be produced into an XML file that I will later use to import into the DB. The XML file should be stored on each run when completed and named with time/date stamp. The process should be setup to be able to be run via windows task manager so maybe php curl from command line or something similar... not my area of expertise.


Additional information submitted:

11/23/2009 at 0:33 EST:
Also note that I will need personID in the XML output for each record.


Messages Posted:1 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 

225

3 days

11-10-2009 02:16 EST

VALUEONWEB is a customer-specific service oriented company has got a Professional and creative team. We are the Professional Web Development Company having rich experience in Web design and development. We have expertise and experience in ecommerce site development, flash and animation. Our team can work on HTML,DHTML, JAVA SCRIPT, FLASH , ASP,ASP.NET,PHP,MY SQL, SQL Server, MS SQL. Our Bid includes development , deployment and testing. We are ready to start and send daily reports. we communicate through email,messenger,phone and skype.We will a deliver a professional looking design(s) with proposed functionality to match your concept.

help

 

250

7 days

11-09-2009 23:35 EST

Hello, please refer your PMB. Thank you.

help

 

200

7 days

11-09-2009 23:50 EST

Hello,Please refer your PMB.Thank you.

help

 

220

3 days

11-09-2009 23:52 EST

Hi, please check PMB.

help

 

250

7 days

11-28-2009 03:51 EST

Hello, I am scraper expert, i have test the site, it use session, but it's no problem for me, thank you! (as we agreed, i add the plugin revision part into it, and add $70 on bid)

help

 

225

3 days

11-22-2009 08:46 EST

We can help in your project, please check PMB to see our related experience.

help

 

150

2 days

11-10-2009 07:11 EST

We are very good in session based scraping. Please check pmb for more details.

help

 

250

3 days

11-09-2009 23:10 EST

Hi, Please check pmb

help

 

60

2 days

11-22-2009 07:25 EST

I can do this job for you. See PM for details.

help

 

175

10 days

11-23-2009 00:26 EST

Hi please check Pm thanks jasbir

help

 

150

4 days

11-22-2009 08:33 EST

Please check PM. Thanks.

help

 

145

5 days

11-23-2009 12:52 EST

I specialize in data scrapping. Please check PMB for more info.

help

 

80

1 day

11-10-2009 07:52 EST

Hi sir, Please check PM for more details, thanks, Kimi.

help

 

185

3 days

11-09-2009 23:50 EST

pls chk pmb

help

 

250

3 days

11-10-2009 08:10 EST

Please see PMB.

help

 

200

5 days

11-23-2009 00:57 EST

Hi! Please view the PMB for details. Cheers, -Cam.

help

 

140

7 days

11-10-2009 06:39 EST

kindly check the pmb.

help

 

100

0 days

11-09-2009 23:19 EST

See PMB for details

help

 

250

0 days

11-23-2009 05:29 EST

Hello, Please have a look in PMB. Regards, Bhavik

help

 

90

5 days

11-10-2009 02:34 EST

I can do this for you quickly. Please contact for more details.

help

 

160

5 days

11-22-2009 07:22 EST

we have 10 years experience with PHP/MYSQL. we can gaurantee you good service

help

 

100

3 days

11-22-2009 04:45 EST

Please check my PM!

help

 

230

5 days

11-10-2009 04:12 EST

(No Feedback Yet)

hello there we are SolutionBytes please have a luk on our work portfolio....check PM...regards ankita.

help

 

86

2 days

11-10-2009 04:36 EST

(No Feedback Yet)

hi this is my preliminary bid see pm. do this alle time man

help

 

150

20 days

11-22-2009 05:02 EST

(No Feedback Yet)

Hello sir, we are able to do this plz contact us...thx LOZIX SOLUTIONS TEAM

help

 

200

1 day

11-22-2009 10:49 EST

(No Feedback Yet)

Hi, I'm techPista from bangalore, india working as a PHP web developer for past 10 years.I have sound knowledge in building Framework based web sites. I have very strong knowledge in the following areas 1. PHP 2.HTML & DHTML 3.CSS 4.JavaScript 5.CodeIgniter 6.Ajax 7.Flex 8.Mysql 9.Manual Testing 10. XML 11. RSS Feeds 12. SimpleTest Opera Widget Development: Also I have very strong knowledge in Opera Widget development. Currently i have been developing widgets for windows mobile. Almost i have developed 100 widgets for windows mobiles regards techPista

help


    Bid on this Project