Extending English ACE 2005 Corpus Annotation with Ground-truth Links to Wikipedia.
Luisa BentivogliPamela FornerClaudio GiulianoAlessandro MarchettiEmanuele PiantaKateryna TymoshenkoPublished in: PWNLP@COLING (2010)
Keyphrases
- ground truth
- wikipedia articles
- annotated corpus
- relation extraction
- named entities
- parallel corpora
- manually annotated
- named entity recognizer
- link structure
- computing semantic relatedness
- broad coverage
- person names
- link grammar
- semantic relations
- linguistic features
- english words
- information extraction
- semantic relatedness
- statistical machine translation
- semantic features
- open domain
- natural language text
- parse tree
- named entity disambiguation
- high quality
- wide coverage
- parallel corpus
- hand crafted
- natural language
- named entity recognition
- machine translation
- wordnet
- world knowledge
- semi automatically
- training corpus
- active learning
- question answering
- co occurrence
- semantic annotation
- sentence pairs
- text corpus
- metadata
- multiword
- semantic roles
- document corpus
- natural language processing
- unknown words
- cross lingual
- gold standard
- automatic extraction
- penn treebank
- word sense
- explicit semantic analysis
- semantic information
- machine translation system
- automatic annotation
- link analysis
- word sense disambiguation
- image annotation
- image retrieval
- entity ranking
- text mining
- text classification
- query translation
- cross language information retrieval
- web pages