A New Feature to Improve Moore’s Sentence Alignment Method

The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable.

Title: A New Feature to Improve Moore’s Sentence Alignment Method
Authors: Trieu, Hai-Long
Nguyen, Phuong-Thai
Nguyen, Le-Minh
Keywords: Sentence Alignment
Parallel Corpora
Word Clustering
Natural Language Processing
Issue Date: 2015
Publisher: H. : ĐHQGHN

Nhận xét

Bài đăng phổ biến từ blog này

Tích hợp kiến thức Di truyền học trong dạy học Tiến hóa (Sinh học 12)

Những vấn đề lý luận và thực tiễn về phạm tội chưa đạt theo luật hình sự Việt Nam

Coupled Resonator Induced Transparency (CRIT) Based on Interference Effect in 4x4 MMI Coupler