External Plagiarism Detection: N-Gram Approach Using Named Entity Recognizer - Lab Report for PAN at CLEF 2010.
Parth GuptaSameer RaoPrasenjit MajumderPublished in: CLEF (Notebook Papers/LABs/Workshops) (2010)
Keyphrases
- digital libraries
- n gram
- plagiarism detection
- named entity recognizer
- cross language
- information access
- character n grams
- document collections
- language model
- relation extraction
- language modeling
- named entities
- document retrieval
- source code
- cross lingual
- text classification
- named entity recognition
- question answering
- natural language processing
- query expansion
- text categorization
- text retrieval
- test collection
- information extraction
- cross language information retrieval
- part of speech
- information retrieval
- semantic features
- general purpose
- maximum entropy
- domain specific
- automatic extraction
- artificial intelligence