GetAFreelancer.com
 
Find projectsSearch
Sign Up | Log in | Top Rated Users | Browse projects | Post Project | RSS feeds | Articles
 

Wanted: A Perl guru to write a perl script.

   Click here to post similar project

Wanted: A Perl guru to write a perl script. is project number 53701 posted at GetAFreelancer.com. Click here to post your own project.

Status: Closed (Selected Service Provider)
Selected Providers: selected service provider gasparch
Budget: $30-100
Created: 04/06/2006 at 15:29 EDT
Bidding Ends: 04/11/2006 at 15:29 EDT
Project Creator: emailnotvalid View PM Post PM
Buyer Rating: 10.00/1010.00/1010.00/1010.00/1010.00/1010.00/1010.00/1010.00/1010.00/1010.00/10
(11 reviews)
Description: looking for a perl programmer that will write a script that will parse data from an 800mb-1.5GB text file.

this script will need to

* be readable and well commented. i can write some perl, but i'm far from an expert. i would like the ability to add on and use this script far into the future. this perl script will be run via a cron job at night.

* be extremely fast. we will test this on 2-3 gig files daily.
i have my own script, but it performs very slow and inefficiently. i need someone to write something from scratch that will be much faster. if my script runs faster than your script, i will send it back to you for reworking. obviously, that'll be a waste of time, so think carefully about the algorithm and your approach before writing anything.

* accept the data filename from the commandline
e.g. ./script.pl DATA.txt

* each chunk of data in the text file is separated by a special group of characters on its own line. the chunk of data is variable in length. We will forward u an example text file upon project start. in the meantime, here's a generic data layout. there are 3 data fields in each chunk. the first two data fields are each on their own single line. the third data field is multi-line and can range from 1 line to 40 lines or more. the special group of characters then separate the data chunk from the next one.

* the script will match for various keywords regardless of case in the third data field. i've done this by uppercasing the text and matching for these. i will provide these keywords to you. these keywords are subject to later change and future modification, so i need the ability to add/remove/edit them easily.

* each single keyword is part of a single keyword group and we will need certain things done for each group. a match of a keyword in a single group means that part of the first field in the data chunk will be copied and written to a single text file which i can name. i imagine each keyword group will have 10-30 keywords/phrases to match for.

* the script will then proceed to the next group with it's own unique and different keywords and check the entire text file as well. it will then write the first field of the data chunk to another text file. i would like only part of first data field to be written, although i'd like the option to include the other fields.

* the script will proceed to another group of keywords and copy the first field to another text file. i would like only part of the first data field to be written, although i'd like the option to include the other fields.


* lastly, the script will run through and look for data chunks which haven't matched any of the keywords in any of the keyword groups and write all 3 fields to another text file.


* after the scripts have finished running, i would like a report that will send via sendmail a report to me on the total number of data chunks matched in each group and keyword. i do not want to install additional perl modules so please use as few perl modules as possible. in addition, i'd like a copy of the report on the server, just in case email doesn't get sent or doesn't arrive. it may be overwritten everytime the script is run.

for example

e.g.

Group 1: 100
Match1: 12
March2: 34
Match3: 34

Group 2: 1232
Match 1: 23
Match 2: 343
Match 3: 123
Match 4: 23


Group 3: 13343


No Match Group: 1232


Before payment will be sent, I will test this on real data for verification and speed. Testing will take a day or two depending on how fast your script runs...

in your bid, please state your experience with perl, programming experience, ability to write clean and fast executing code.

payment will be via paypal or GAF.

this project is not difficult for a perl guru and shouldn't take too much time.

thanks.

feel free to ask questions.
Report violation
Job Type:
  • Perl/CGI
Database: (None)
Operating system: Linux
Bid count: 7
Average bid: N/A

 

Related project
 
Renegade AdvertPro Install Upgrade Featured
posted by microon
Budget: $30-250
 

FREE Trial project for new buyers!

 

View Project Message Board     Post Message on Project Message Board
Messages Posted: 0

If you are the project creator or one of the bidders Log in as project creator or bidder for more options

Bid on This Project

Service Providers PMB Bid Delivery Within Time of Bid Provider Rating
Bids are hidden by project creator. Log in as project creator or bidder as the project creator or as one of the bidders to view bids.
Bid on This Project

 

[ Website Design ] [ EU Freelance ] [ Web Forms ] [ Secure Forms ]

What is GetAFreelancer.com? ( Read about the company )

Outsource projects and save a lot of money. Getting affordable freelance work, freelance programming and custom web design done for your website has never been easier. Our escrow feature is developed to protect both buyers and sellers. If you never tried to contract work to another part of the world you have your chance today. Sign up free! Our mission is to find the best possible freelance workers at the best possible price. Companies of any size have used our web services.

Bookmark our homepage to make sure you don't forget about our website next time you need to develop an IT project for yourself or your company. Outsourcing is the process of subcontracting network operations and support to an organization outside your own company. Try it free today! Buy services with help from our secure escrow system. We have thousands of satisfied clients around the world. We provide a safe escrow environment and you don't release the money until project is completed. GetAFreelancer.com helps webmasters, web designers, programmers, software developers and business owners to develop their projects.

Find Webmaster Resources and Webmaster Forum. Take a look at Search Engine Submission.