The sentence alignment approach proposed by Moore, 2002 (M-Align) is an 
effective method which gets a rela-tively high performance based on 
mbination of length-based and word correspondences. Nevertheless, 
despite the high precision, M-Align usually gets a low recall especially
 when dealing with sparse data problem. We pro-pose an algorithm which 
not only exploits advantages of M-Align but overcomes the weakness of 
this baseline method by using a new feature in sentence alignment, word 
clustering. Experiments shows an mprovement on the baseline method up to
 30% recall while precision is reasonable. http://repository.vnu.edu.vn/handle/VNU_123/965
Nhận xét
Đăng nhận xét