Vietspider Web Data Extractor (VDER) implement the Website Parse Template concept, a web 3.0 crawling technology extract data from website. Software extract data from website, output to XML. Support database MS SQL Server, MySQL, Oracle, Postgres,...
Jan 19, 2013 14:04:07
Windows 95, Windows 98, Windows ME, Windows 2000, Windows XP, Windows Vista, Unix, Mac OS
Description: The web crawler is a program that automatically traverses the web by downloading the pages and following the links from page to page. A general purpose of web crawler is to download any web page that can be accessed through the links.
This process is called web crawling or spidering. Many sites, in particular search engines, use spidering as a means of providing up-to-date data. Web crawlers are mainly used to create a copy of all the visited pages for later processing by a search engine that will index the downloaded pages to provide fast searches. Crawlers can also be used for automating maintenance tasks on a website, such as checking links or validating HTML code. Also, crawlers can be used to gather specific types of information from Web pages, such as harvest ing e-mail addresses (usually for spam).
A web crawler is one type of bot, or software agent. In general, it starts with a list of URLs to visit, called the seeds. As the crawler visits these URLs, it identifies all the hyperlinks in the page and adds them to the list of URLs to visit, called the crawl frontier. URLs from the frontier are recursively visited according to a set of policies.