Project Detail

Web scraper using ASP.NET, C# & SQL Server   Trial project

Web scraper using ASP.NET, C# & SQL Server is project number 299699
posted at GetAFreelancer.com. Click here to post your own project.

 

Bookmark and Share
Free Trial For New Buyers
 

Status: Closed
(Selected Service Provider)

Selected Providers: Mohitkatariya

Budget: $30-250

Created: 08/18/2008 at 20:59 EDT

Bid Count: 14

Average Bid: $ 201

08/25/2008 at 20:59 EDT

Project Creator: apergiel View PM Post PM
Employer Rating: (No Feedback Yet)

Bid On This Project
 

Description

Web scraper using ASP.NET, C# & SQL Server
This is for my own learning. 1st time I have posted a project.
The following is also attached in the file "spec.doc"

SPECIFICATION
Three (3) tables to be used
1st_Table) URL
Field Example_Data
----- ------------
ID 1
URL http://news.google.com
ARGUMENT /news?source=ig&hl=en&um=1&tab=wn&q=
REGEX (?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)

2nd_Table) TERM
Field Example_Data
----- ------------
ID 1
TERM radon

3rd_Table) RESULT
Field Example_Data
----- ------------
ID 1
URL http://news.google.com
ARGUMENT /news?source=ig&hl=en&um=1&tab=wn&q=URL_ARGhttp://news.google.com/news?source=ig&hl=en&um=1&tab=wn&q=radon
REGEX (?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)
TERM radon
RESULT 553
DATETIME August 05, 2008 4:00PM
COUNT 1


Loop through combinations of 2 tables (URL & TERM), get the regular_expression and append it (and other data) into the 3rd table (RESULT)

1st_Step) Get bring the 'source' of URL into a text file

2nd_Step) Parse this text using Regex
//c# EXAMPLE
Regex regex = new Regex(@"(?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)",RegexOptions.IgnoreCase);
// The above is from table URL field REGEX --> ---------------------------------------------
// Run regex parsing on matches
MatchCollection matches = regex.Matches(text);

3rd_Step) Save findings and other info to "RESULT" table

Note:
1) The attached example code "almost" works for a single URL, I got stuck on matchcollections
2) Liberal use of comments is appreciated.
3) The "ID" field is a Identity field that self increments, but is not used for anything now
4) I would like the use of REGEX & Matchcollection, unless a better method is known.
5) A simple ASP.NET page allowing the add/edit/delete of the 3 tables will be needed
5a) two buttons one to import & on to export to a excel file is needed for all three tables



Additional files submitted:
Default.aspx
Default.aspx.cs
spec.doc

Messages Posted: 2 View project clarification board Post message on project clarification board

Bid On This Project
 
If you are the project creator or one of the bidders Log In for more options

  View PM Post PM

250

10 days

08-22-2008 15:03 EDT

Hello Sir, We are interested. Please check PMB

help

  View PM Post PM

200

5 days

08-18-2008 23:40 EDT

(No Feedback Yet)

www.thuedia.com

help

  View PM Post PM

250

8 days

08-19-2008 00:11 EDT

(No Feedback Yet)

we are india based website and software development firm, please have a look at our website www.gnwebsoft.com for ore information.

help

  View PM Post PM

230

6 days

08-19-2008 00:50 EDT

(No Feedback Yet)

i can do it,see your PM.

help

  View PM Post PM

200

5 days

08-19-2008 02:56 EDT

(No Feedback Yet)

HI, This is Tonmoy, I have seen your job posting, I may fit for this job. Please see PM for the details. Thanks. Regard, Tonmoy

help

  View PM Post PM

150

3 days

08-19-2008 07:15 EDT

(No Feedback Yet)

Hello Sir, Here this is Randhir singh from India.I am very much interested to do your work. i have 5 years of experiance in asp.net,ajax,HTML/CSS,flash communication server. i have experience in e-commerce, content management systems, picture galleries, web chat,screen scraping, community,audio and video recording. my portfolio is following: http://www.shiplanes.com/ http://www.ucgsga.net/ http://www.medicalab.co.in http://59.94.224.240/dogsinbritain/LogIn.aspx http://59.94.224.240/Supery/User/frmHomePage.aspx I look forward to hearing from you. Thank you for your consideration.

help

  View PM Post PM

250

20 days

08-19-2008 16:16 EDT

(No Feedback Yet)

This is easily done. Let me know when to start! BTW, How can something like this: (?<=baboutbs*)[0-9]*(?=s*b forb) be called Regular?!?

help

  View PM Post PM

200

5 days

08-20-2008 08:51 EDT

(No Feedback Yet)

we are interested and ready to develop this application as soon as possible..as per your requirements and convenience.

help

  View PM Post PM

180

3 days

08-20-2008 10:12 EDT

(No Feedback Yet)

i am ready to start working.trust me.

help

  View PM Post PM

150

8 days

08-20-2008 11:22 EDT

(No Feedback Yet)

HI, I am interested in your job posting. This may be my first project, as i m new to freelance world:) Regard, Amit

help

  View PM Post PM

150

2 days

08-21-2008 06:34 EDT

(No Feedback Yet)

I will write this scraper for you.

help

  View PM Post PM

202

30 days

08-23-2008 03:22 EDT

(No Feedback Yet)

Dear Sir, I am having 4 years of experience in Microsoft Technologies. In this span I have developed various intranet and web based projects using ASP.NET/C#/AJAX/cuyahoga framework/NHibernate and MS Sql Server as backend.Also created static and dynamic design models. Having excellent analytical, logical and programming skills, I'm capable of analysing problem domain and provide a very firm solution which could be maintained with your business agility. I have my group who can work with me to complete this project on time. My candidature matches with the applied job. Expertise: ASP.NET 2.0,3.5,C#,AJAX,cuyahoga framework,NHibernate,SQL Server 2005/2008,UML Regards, Atul Chaturvedi

help

  View PM Post PM

150

10 days

08-24-2008 04:04 EDT

(No Feedback Yet)

I have 2 year of experience in .net technologies and programming. As a freelancer i had done two web scrapping projects, so i have good exposure how to use Regex and other .net classes for scrapping. I had done a web scrapping project which was basically a more like a reversr engineering project, in which i have to extract data and create a atabase from that. i have a hand on experience of e-commerce based web site and desktop application in microsoft tech.

help

  View PM Post PM

250

30 days

08-24-2008 10:45 EDT

(No Feedback Yet)

Please View PM.

help

    Bid on this Project