A Multidimensional Sequence Approach to Measuring Tree Similarity
#1

Abstract—Tree is one of the most common and well-studied data structures in computer science. Measuring the similarity of such structures is key to analyzing this type of data. However, measuring tree similarity is not trivial due to the inherent complexity of trees and the ensuing large search space. Tree kernel, a state of the art similarity measurement of trees, represents trees as vectors in a feature space and measures similarity in this space. When different features are used, different algorithms are required. Tree edit distance is another widely used similarity measurement of trees. It measures similarity through edit operations needed to transform one tree to another. Without any restrictions on edit operations, the computation cost is too high to be applicable to large volume of data. To improve efficiency of tree edit distance, some approximations were introduced into tree edit distance. However, their effectiveness can be compromised. In this paper, a novel approach to measuring tree similarity is presented. Trees are represented as multidimensional sequences and their similarity is measured on the basis of their sequence representations. Multidimensional sequences have their sequential dimensions and spatial dimensions. We measure the sequential similarity by the all common subsequences sequence similarity measurement or the longest common subsequence measurement, and measure the spatial similarity by dynamic time warping. Then we combine them to give a measure of tree similarity. A brute force algorithm to calculate the similarity will have high computational cost. In the spirit of dynamic programming two efficient algorithms are designed for calculating the similarity, which have quadratic time complexity. The new measurements are evaluated in terms of classification accuracy in two popular classifiers ( -nearest neighbor and support vector machine) and in terms of search effectiveness and efficiency in -nearest neighbor similarity search, using three different data sets from natural language processing and information retrieval. Experimental results show that the new measurements outperform the benchmark measures consistently and significantly.


projects9.com
Phone : +91-9618855666
+91-8008855666
Email : projects[at]projects9.com
Reply

Important Note..!

If you are not satisfied with above reply ,..Please

ASK HERE

So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page
Popular Searches: measuring semantic similarity between words using web search engines project code, efficient multidimensional fuzzy search for personal information management systems architecture, uml diagram for efficient multidimensional fuzzy search for personal information management system, multidimensional approach on intternet, managing multidimensional historical, ppt for a fuzzy similarity approach for automated spam, fuzzy similarity approach ppt,

[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  A Link-Based Cluster Ensemble Approach for Categorical Data Clustering 1 1,086 16-02-2017, 10:51 AM
Last Post: jaseela123d
  An Acknowledgement-Based Approach for the Detection of routing misbehavior in MANETs mechanical engineering crazy 2 2,969 26-05-2015, 03:04 PM
Last Post: seminar report asees
  An Acknowledgment-Based Approach For The Detection Of Routing Misbehavior In MANETs electronics seminars 7 4,712 27-01-2015, 12:09 AM
Last Post: Guest
  A PROACTIVE APPROACH TO NETWORK SECURITY nit_cal 1 2,265 19-09-2014, 12:52 AM
Last Post: [email protected]
  Fast Data Collection in Tree-Based Wireless Sensor Networks Projects9 9 3,998 12-03-2014, 06:30 PM
Last Post: computer topic
  Privacy Preserving Decision Tree Learning Using Unrealized Data Sets Projects9 1 2,352 30-10-2013, 01:18 PM
Last Post: Guest
  PREVENTION OF SQL INJECTION AND DATA THEFTS USING DIVIDE AND CONQUER APPROACH seminar presentation 3 4,180 24-10-2012, 01:09 PM
Last Post: seminar details
  RED BLACK TREE ALGORITHM FULL REPORT seminar class 1 2,110 08-10-2012, 12:59 PM
Last Post: seminar details
  A Geometric Approach to Improving Active Packet Loss Measurement full report project topics 2 2,630 09-04-2012, 06:01 PM
Last Post: [email protected]
  Layered Approach Using Conditional Random Fields For Intrusion Detection project topics 2 1,678 06-03-2012, 11:35 AM
Last Post: raviteja538

Forum Jump: