Sentence-Based Plagiarism Detection for Japanese Document Based on Common Nouns and Part-of-Speech Structure.
Takeru YokoiPublished in: SoMeT (Selected Papers) (2014)
Keyphrases
- part of speech
- plagiarism detection
- word sense disambiguation
- text documents
- noun phrases
- natural language processing
- tf idf
- n gram
- wordnet
- parse tree
- syntactic categories
- information retrieval
- source code
- keywords
- semantic information
- web documents
- document clustering
- text classification
- document retrieval
- semantic similarity
- information retrieval systems
- named entity recognition
- cross language
- semi supervised
- information extraction
- web pages
- artificial intelligence
- machine learning