A New Feature to Improve Moore’s Sentence Alignment Method
The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable.
Title: | A New Feature to Improve Moore’s Sentence Alignment Method |
Authors: | Trieu, Hai-Long Nguyen, Phuong-Thai Nguyen, Le-Minh |
Keywords: | Sentence Alignment Parallel Corpora Word Clustering Natural Language Processing |
Issue Date: | 2015 |
Publisher: | H. : ĐHQGHN |
Nhận xét
Đăng nhận xét