Project Detail

...... Scrape files from site......  

...... Scrape files from site...... is project number 194797
posted at Freelancer.com. Click here to post your own project.

 

| More Free Trial For New Buyers
 

Status:

Selected Providers: chama

Budget: $30-100

Created: 11/12/2007 at 14:57 EST

Bid Count: 12

Average Bid:
$ 71

01/11/2008 at 14:57 EST

Project Creator: honestmoney
Employer Rating: 10/1010/1010/1010/1010/1010/1010/1010/1010/1010/10 (17 reviews)

Bid On This Project
 

Description

I need to scrap site content (mp3 files to be precious)

Site structure:

All files to scrape are located in ONE directory:

www.site.com/mp3/1000.mp3


www.site.com/mp3/250000.mp3

All files here can be easy downloaded using for example WGET command in simple loop and it’s not problem for me.

The problem is that I want them in the format:

www.my_site.com_Artis_Name-Song_name.mp3

So your script must extract this information from HTML code (I believe this part it’s dead easy because META tag looks like this):

<title>Ain't To Fun, AC DC MP3 xxxxxxx</title>

Where: Ain't To Fun is a song name, AC DC is artist name, xxxxxxx is text not important

But the tricky part is that html code call mp3 file with java script not direct URL to file (so for example all songs for AC DC has different numerical names without any logical order (I believe author use HASH function to make direct scrape a “little” more difficult))

So simply put I want to RENAME file from NUMBER to www.my_site.com_Artist_Name-Song_Name.mp3

So for this example script must WGET the file to my local HDD find the proper artist name and song name from HTML code and rename it.

So for example file: 128908.mp3 must be downloaded and renamed to www.my_site.com_AC_DC- Ain't_ To_Fun.mp3 (it’s obvious that script must REPLACE spaces in names to underscores so I can upload the file to my server)

I will put money on escrow account and release it if script will be working.

For obvious reason I can’t reveal the site address here.

Messages Posted:0 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 

100

2 days

11-12-2007 17:18 EST

Quality work. Thanks

help

 

100

1 day

11-12-2007 17:53 EST

Hello,sir, I am very interested in your project, please see my PMB for more detail.

help

 

80

1 day

11-12-2007 19:48 EST

Hi, Please read the message I've sent you.

help

 

100

1 day

11-12-2007 15:35 EST

Hi sir, please check your PMB right now, important, thank you.

help

 

50

1 day

11-13-2007 03:04 EST

We are suitable for this project as we have already done same type of project...we can give you the exact solutions you are asking...Please check your PMB for more details..... Thanks

help

 

30

1 day

11-12-2007 19:08 EST

I can do this job for you. See PM for details.

help

 

100

1 day

11-12-2007 16:56 EST

If you could reveal the site's address in PMB, it would be perfect.

help

 

80

0 days

11-13-2007 08:56 EST

pls check PMB for details.

help

 

100

1 day

11-13-2007 06:58 EST

Hi, Please check PM. Thanks.

help

 

50

1 day

11-12-2007 18:03 EST

Hi. I can make the scrapper in a few hours. Can make you a demo for free. I've scrapped sites like wwitv.com and shoutcast.com. Escrow first it is not necesary

help

 

30

1 day

11-13-2007 11:46 EST

I can make it on bash. It would be easy to run it under Linux and Windows (using cygwin).

help

 

30

1 day

11-13-2007 09:50 EST

(No Feedback Yet)

please, advice further on PMB thank's

help


    Bid on this Project