|
For Authors: |
|
For Readers: |
|
|
|
Article Details :: |
|
Article Name : | | VARIOUS APPROACHES FOR DEEP WEB DATA EXTRACTION | Author Name : | | SHILPA DESHMUKH AND NEHA CHOPADE | Publisher : | | Ashok Yakkaldevi | Article Series No. : | | ISRJ-119 | Article URL : | | | Author Profile | Abstract : | | World Wide Web has vast useful Web databases which are difficult to extract relevant data from various sources. The number of Web databases has reached 50 millions according to a recent survey. These web databases can be searched through their web query interfaces. The web pages resulted are said to be surface web which can be accessed by search engines without accessing web databases and deep web refers to the web page that is not indexed by the general search engine. Deep web can be accessed only by websites interfaces. So it is inaccessible to search engines. So extracting data from deep page is critical problem.This paper studies some deep web data extraction techniques. A different way for deep web data extraction to overcome limitations of previous works is using visual approach. Visual features of deep web pages are used as primary concern to extract contents from deep web pages. It includes both data record extraction and data item extraction. Visual wrapper gets generated for web database to which a given deep web page belongs. | Keywords : | | - various approaches
- techniques
- web databases
- deep web data extraction.
|
|
|
|
|
|