Natural Language Inference for Arabic Using Extended Tree Edit Distance with Subtrees

M. Alabbas; A. Ramsay

doi:10.1613/jair.3892

PDF PS Data

Published: Oct 11, 2013

DOI: https://doi.org/10.1613/jair.3892

M. Alabbas

A. Ramsay

Abstract

Many natural language processing (NLP) applications require the computation of similarities between pairs of syntactic or semantic trees. Many researchers have used tree edit distance for this task, but this technique suffers from the drawback that it deals with single node operations only. We have extended the standard tree edit distance algorithm to deal with subtree transformation operations as well as single nodes. The extended algorithm with subtree operations, TED+ST, is more effective and flexible than the standard algorithm, especially for applications that pay attention to relations among nodes (e.g. in linguistic trees, deleting a modifier subtree should be cheaper than the sum of deleting its components individually). We describe the use of TED+ST for checking entailment between two Arabic text snippets. The preliminary results of using TED+ST were encouraging when compared with two string-based approaches and with the standard algorithm.

Issue

Vol. 48 (2013)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details