crawl website and pull data, how?

cgchris99

Newcomer
Joined
Sep 29, 2003
Messages
14
I have been able to program an app to load a website and crawl through a certain area of it.

But now I need to be able to pull specific pieces of information from it.

For example. The site shows,
part number, desc, quantity on hand, and price

Can anyone provide parsing assistance on how to get this info and put it in a database. I would just cut an past into excel but its over a 100 pages worth.

It is NOT xml, wish it was. It just plain html code and the part number information is in two rows.

It does have <A Href="#" name="skus" onClick="javascript.... at the beginning of the line so maybe this will help.

Thanks for any help
 
Well I wrote a web spider to retrieve some information an images off a webpage. I started by finding similarities in how the data is displayed then wrote regular expressions in order to retrieve the specific data I was looking for.

http://regular-expressions.info
 
I am going to look at using regular expressions.

However, I want to look at Mine the Web first. It looks like it may have everything I need already.
 
Back
Top