Please use this identifier to cite or link to this item:
Title: A New Feature to Improve Moore’s Sentence Alignment Method
Authors: Trieu, Hai-Long
Nguyen, Phuong-Thai
Nguyen, Le-Minh
Keywords: Sentence Alignment;Parallel Corpora;Word Clustering;Natural Language Processing
Issue Date: 2015
Publisher: H. : ĐHQGHN
Citation: p. 32-44
Series/Report no.: Vol. 31, No. 1;
Abstract: The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable.
ISSN: 0866-8612
Appears in Collections:Computer Science and Communication Engineering

Files in This Item:

  • File : document.pdf
  • Description : 
  • Size : 432.82 kB
  • Format : Adobe PDF

  • Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.