Sentence Alignment for Spanish-Basque Bitexts: Word Correspondences vs. Markup Similarity.
Arantza CasillasIdoia FernándezRaquel Martínez-UnanuePublished in: CICLing (2004)
Keyphrases
- sentence similarity
- word level
- language identification
- sentence pairs
- document images
- sentence level
- machine translation system
- semantic similarity
- word pairs
- partially overlapping
- word similarity
- noun phrases
- syntactic information
- similarity measure
- syntactic analysis
- parallel corpus
- partial matching
- word alignment
- document analysis
- n gram
- machine translation
- multi document summarization
- language independent
- lexico syntactic
- question answering
- pairwise
- protein structure alignment
- word segmentation
- dynamic time warping
- text corpus
- distance measure
- natural language
- linguistic features
- training corpus
- markup language
- sentiment analysis
- co occurrence
- keywords
- word order
- matching cost
- dependency relations
- recognizing textual entailment
- geometric constraints
- syntactic categories