Project Detail

Screen Scrape to XML to Save Local file  

Screen Scrape to XML to Save Local file is project number 547331
posted at GetAFreelancer.com. Click here to post your own project.

 

| More
Free Trial For New Buyers
 

Status: Closed
(Selected Service Provider)

Selected Providers: mantislin

Budget: $30-250

Created: 11/09/2009 at 21:35 EST

Bid Count: 17

Average Bid:
$ 183

11/14/2009 at 21:35 EST

Project Creator: marketmike View PM Post PM
Employer Rating: 10/1010/1010/1010/1010/1010/1010/1010/1010/1010/10 (1 reviews)

Bid On This Project
 

Description

I am looking to screeb scrape a specific site http://offender.fdle.state.fl.us/ to collect data on registered sex offenders. The criteria for the search url to return the records I am interested in is as follows: http://offender.fdle.state.fl.us/offender/offenderSearchNav.do?county=marion&link=doSearch&commaSeparatedOffenderStatus=1,6,7,8,9&stateStatus=1&offenderType=3

However, hitting that URL directly seems to redirect you back to the homepage unless you already have an active session on the site. I suspect this is the first tricky spot as a session or something needs to be set with the parsers.

Once you do get the results, you will notice in a hidden field that all the IDs exist for the results. I anticipated using those ids to build the urls for the next part of the scrape where the offenders record would be built. It is a hidden field. <input type="hidden" name="commaSeparatedPersonIdsALL"

From these ids, the url to the respective record can be formed: http://offender.fdle.state.fl.us/offender/flyer.do?personId=16687 using the ID for the personID.

From this form I would like the following data scraped, including a url to the image and combined into an XML feed which will later be imported into our database (the DB import is not part of this project).

From right of photo....
-------------------------
Designation: Sexual Offender
Name: Samuel E Ackerson
Status: Released - Required to Register
Department of Corrections #: D93831
Search the Dept of Corrections Website
Date of Birth: 05/28/1975
Race : White
Sex: Male
Hair: Blond
Eyes: Blue
Height: 5'10"
Weight: 153 lbs

Below Photo....
--------------------
Samuel E Ackerson
Date Of Photo: 11/03/2009

Aliases
Scars, Marks & Tattoos

From Address Information I would like the first Address and Address Source Information> I would also want longitutde and latitude extracted from the map link for the address being imported. This will be stored in db on import for Geo coding on map.

From Crime Information - Qualifying Offenses I would like all the information brought into the feed as a table using the same headers as the page but without color or formatting.



Again, this data should all be produced into an XML file that I will later use to import into the DB. The XML file should be stored on each run when completed and named with time/date stamp. The process should be setup to be able to be run via windows task manager so maybe php curl from command line or something similar... not my area of expertise.

Messages Posted:1 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 
View PM Post PM

250

7 days

11-09-2009 23:35 EST

Hello, please refer your PMB. Thank you.

help

 
View PM Post PM

225

3 days

11-10-2009 02:16 EST

VALUEONWEB is a customer-specific service oriented company has got a Professional and creative team. We are the Professional Web Development Company having rich experience in Web design and development. We have expertise and experience in ecommerce site development, flash and animation. Our team can work on HTML,DHTML, JAVA SCRIPT, FLASH , ASP,ASP.NET,PHP,MY SQL, SQL Server, MS SQL. Our Bid includes development , deployment and testing. We are ready to start and send daily reports. we communicate through email,messenger,phone and skype.We will a deliver a professional looking design(s) with proposed functionality to match your concept.

help

 
View PM Post PM

200

7 days

11-09-2009 23:50 EST

Hello,Please refer your PMB.Thank you.

help

 
View PM Post PM

220

3 days

11-09-2009 23:52 EST

Hi, please check PMB.

help

 
View PM Post PM

180

2 days

11-10-2009 04:53 EST

Hello, I am scraper expert, i have test the site, it use session, but it's no problem for me, thank you!

help

 
View PM Post PM

225

3 days

11-10-2009 08:44 EST

We can help in your project, please check PMB to see our related experience.

help

 
View PM Post PM

150

2 days

11-10-2009 07:11 EST

We are very good in session based scraping. Please check pmb for more details.

help

 
View PM Post PM

250

3 days

11-09-2009 23:10 EST

Hi, Please check pmb

help

 
View PM Post PM

185

3 days

11-09-2009 23:50 EST

pls chk pmb

help

 
View PM Post PM

80

1 day

11-10-2009 07:52 EST

Hi sir, Please check PM for more details, thanks, Kimi.

help

 
View PM Post PM

250

3 days

11-10-2009 08:10 EST

Please see PMB.

help

 
View PM Post PM

100

0 days

11-09-2009 23:19 EST

See PMB for details

help

 
View PM Post PM

250

5 days

11-09-2009 23:15 EST

Hello, Interesting job, please check PM...

help

 
View PM Post PM

90

5 days

11-10-2009 02:34 EST

I can do this for you quickly. Please contact for more details.

help

 
View PM Post PM

140

7 days

11-10-2009 06:39 EST

kindly check the pmb.

help

 
View PM Post PM

230

5 days

11-10-2009 04:12 EST

(No Feedback Yet)

hello there we are SolutionBytes please have a luk on our work portfolio....check PM...regards ankita.

help

 
View PM Post PM

86

2 days

11-10-2009 04:36 EST

(No Feedback Yet)

hi this is my preliminary bid see pm. do this alle time man

help


    Bid on this Project