Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases.
Maria MoritzDavid StedingPublished in: LREC (2018)
Keyphrases
- cross lingual
- semantic features
- text classification
- machine translation
- syntactic features
- word sense
- linguistic features
- mono lingual
- text mining
- cross language
- wordnet
- document clustering
- text documents
- language modeling
- natural language processing
- semantic information
- language independent
- sentiment classification
- sentence level
- feature set
- feature selection
- machine translation system
- classification accuracy
- machine learning
- information retrieval
- relation extraction
- natural language
- keywords
- bilingual dictionaries
- word sense disambiguation
- news articles
- bag of words
- query translation
- information extraction
- labeled data
- text categorization
- language model
- source language
- n gram
- translation model
- parse tree
- wikipedia articles
- transfer learning