Project Detail

Import Wiktionary XML Dump Into SQL Server 2005 DB  

Import Wiktionary XML Dump Into SQL Server 2005 DB is project number 123621
posted at Freelancer.com. Click here to post your own project.

 

| More Free Trial For New Buyers
 

Status:

Selected Providers: vip

Budget: $30-100

Created: 02/06/2007 at 0:09 EST

Bid Count: 10

Average Bid:
$ 77

02/26/2007 at 0:09 EST

Project Creator: endsounds
Employer Rating: (No Feedback Yet)

Bid On This Project
 

Description

I wanted to know if someone could help me with a script that I can run to load the most recent and freely available complete Wikitionary dump (could be several files) into a SQL Server DB.

Essentially I would like to make a data clone of the Wiktionary dumps. The set of files is updated periodically by Wikimedia and the script needs to be able to work on a schedule (I can set the server to execute a page at an interval, e.g. X days, or you can include a job in a SQL Server DB).

Here are some sample files (there are more up to date files on the download.wikimedia.org site but they have the same format)


http://download.wikimedia.org/lawiktionary/20060909/lawiktionary-20060909-pages-articles.xml.bz2

http://download.wikimedia.org/lawiktionary/20060909/


http://download.wikimedia.org/

The first step would be extracting the bz2 file (I do not know how to do that on a Windows machine) and then reading the XML file into a DB.


Deliverables will be a script and a set of instructions that are a recreatable process to import the Wiktionary dumps in a SQL Server 2005 DB.


Write the documentation from the standpoint of a user with Windows XP Professional, MS Access 2003, and SQL Server 2005 installed on their computer and an available Windows server with SQL Server 2005.

Please include links to additional free software that the user must download to decompress the bz2 dump on a Windows machine and the steps involved in that (e.g. open command prompt and enter this command to decompress file X.bz2).


1) Complete step-by-step instructions on how to download a Wiktionary dump in bz2 format, decompress that file, and import the resulting XML file into a SQL Server 2005 database.
2) Instructions/documentation must be delivered in Word format along with a fully-functional SQL Server 2005 database file containing the content of Wiktionary dump.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables).
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

Messages Posted:0 View project clarification board Post message on project clarification board

Bid On This Project
 

If you are the project creator or one of the bidders Log In for more options

 

30

10 days

02-06-2007 02:39 EST

Hi, We would like to introduce ourselves as a company of professionals who are driven by the philosophy of customer satisfaction through QUALITY and INNOVATION. We specialize in web-related technologies and software development, 2D and 3D modeling and animation. Why 6STech? • Use of our tried and tested techniques to deliver what you want as per your expectations. • Really quick turnaround time • We are available on AIM, YAHOO messenger , MSN messenger , Phone, Email ... Please check your PMB Looking forward for favorable reply. Thank you & Regards, 6STech Team

help

vip

 

100

5 days

02-06-2007 06:47 EST

Dear Sir/Madam, I have all required knowledge and experience to achieve success of this project with best quality and professionalism. Please see my profile. Please see my detailed offer in PMB. Best Regards. Yurii.

help

 

100

5 days

02-06-2007 07:24 EST

Hi, I can make a fully automated data import tool for this! Please see PM for details.

help

 

70

25 days

02-06-2007 00:55 EST

(No Feedback Yet)

Detail of bid The budget provided for the project is just approx. Finaly it will be decided after knowing the actual requirement of the project. Give us an opportunity, we will provide you the best solution. Our employees come together with a wide variety of skills and backgrounds to create talented teams of problem-solvers. We help clients become high-performance businesses We use modern project management methodologies, combining through functional and non-functional requirements analysis, careful reference architecture elaboration, and early risks clearance, out of the box components selection and integration, and an interactive and incremental suite of deliveries towards the final release. We also know when and how to downsize the methodology to fit smaller projects, keeping key best practices while focusing on speed and code production. In all cases, we select with each client the methodology best suited to his cost and quality targets at the very start of each project. We are familiar with many programming and modeling languages including but not limited to .Net platforms, C, C++,XML, ASP and UML. What we can provide you: - High quality of outsource service. - Bug fixing (these are discusses before starting work) - Experienced outsource team - Knowledge of new technologies what we need from you - Work Maybe you want to discuss something? Feel free to contact us with any questions.

help

 

70

20 days

02-06-2007 03:53 EST

(No Feedback Yet)

The budget provided for the project is just approx. Finaly it will be decided after knowing the actual requirement of the project. Give us an opportunity, we will provide you the best solution. Our employees come together with a wide variety of skills and backgrounds to create talented teams of problem-solvers. We help clients become high-performance businesses We use modern project management methodologies, combining through functional and non-functional requirements analysis, careful reference architecture elaboration, and early risks clearance, out of the box components selection and integration, and an interactive and incremental suite of deliveries towards the final release. We also know when and how to downsize the methodology to fit smaller projects, keeping key best practices while focusing on speed and code production. In all cases, we select with each client the methodology best suited to his cost and quality targets at the very start of each project. We are familiar with many programming and modeling languages including but not limited to .Net platforms, C, C++,XML, ASP and UML. What we can provide you: - High quality of outsource service. - Bug fixing (these are discusses before starting work) - Experienced outsource team - Knowledge of new technologies what we need from you - Work Maybe you want to discuss something? Feel free to contact us with any questions.

help

 

100

5 days

02-06-2007 11:50 EST

(No Feedback Yet)

we are from new delhi(india).we have 3-5 years experience of web development with different lanuages and platforms like .NET and PHP etc.With good exp.. of XML and SOAP .pls gone through the profile attached and please reply soon if yoy are interested. thanks and regards Rohit Barla

help

 

70

25 days

02-07-2007 00:05 EST

(No Feedback Yet)

The budget provided for the project is just approx. Finaly it will be decided after knowing the actual requirement of the project. Give us an opportunity, we will provide you the best solution. Our employees come together with a wide variety of skills and backgrounds to create talented teams of problem-solvers. We help clients become high-performance businesses We use modern project management methodologies, combining through functional and non-functional requirements analysis, careful reference architecture elaboration, and early risks clearance, out of the box components selection and integration, and an interactive and incremental suite of deliveries towards the final release. We also know when and how to downsize the methodology to fit smaller projects, keeping key best practices while focusing on speed and code production. In all cases, we select with each client the methodology best suited to his cost and quality targets at the very start of each project. We are familiar with many programming and modeling languages including but not limited to .Net platforms, C, C++,XML, ASP and UML. What we can provide you: - High quality of outsource service. - Bug fixing (these are discusses before starting work) - Experienced outsource team - Knowledge of new technologies what we need from you - Work Maybe you want to discuss something? Feel free to contact us with any questions.

help

 

80

15 days

02-09-2007 10:55 EST

(No Feedback Yet)

i am a new comer for outsourcing work .i am able to do this kind of work .So ,i want to get chance from you. you will get your work properly.Thanking you

help

 

60

3 days

02-21-2007 11:44 EST

(No Feedback Yet)

I am a sql dba dealing with such uploads on a daily basis.

help

 

90

7 days

02-22-2007 04:43 EST

(No Feedback Yet)

Hi there, I actually worked on this similar project for my company recently and its working perfectly, all the zipping and unzipping, dumping xml data into sql server, I can be definately handy for you. Give it a try.

help


    Bid on this Project