Using Word Embedding for Cross-Language Plagiarism Detection.
Jérémy FerreroLaurent BesacierDidier SchwabFrédéric AgnèsPublished in: EACL (2) (2017)
Keyphrases
- plagiarism detection
- cross language
- spoken document retrieval
- text retrieval
- question answering
- document retrieval
- information access
- cross language information retrieval
- cross lingual
- document collections
- text categorization
- co occurrence
- vector space
- n gram
- source code
- query translation
- keywords
- duplicate detection
- document clustering
- semi supervised learning
- text classification
- natural language processing