Ontology Based Data Unit Similarity With Combining Tag And Value For Data Extraction And Alignment

##plugins.themes.academic_pro.article.main##

K. Jeyalakshmi
Anitha J

Abstract

Web database extraction is used to retrieve relevant information from the query result page. By combining tag and value one can extracts data from query result pages by first identifying and segmenting the query result records (QRRs) in the query result pages and then aligning the segmented QRRs into a table. But combining tag and value similarity measure doesn't handle non-contiguous QRR. To overcome this problem a novel method is proposed to display the most distinct query records from user's query result pages. In this method, First distinct tags are extracted from the result records to build the tag vector table, and then the similarity between each record is found using several similarity methods. Finally the values of similar records are combined and aligned using ontology based alignment.

                

##plugins.themes.academic_pro.article.details##