MC4WEPS: a multilingual corpus for Web people search disambiguation.
Soto MontalvoRaquel MartínezLeonardo CampillosAgustín D. DelgadoVíctor FresnoFelisa VerdejoPublished in: Lang. Resour. Evaluation (2017)
Keyphrases
- web people search
- parallel corpus
- cross language information retrieval
- web pages
- manually annotated
- comparable corpora
- named entity disambiguation
- digital libraries
- text corpora
- open domain
- spoken dialog
- query translation
- linguistic features
- feature selection
- chinese english
- reference resolution
- keywords
- supervised machine learning
- information extraction
- language independent
- test set
- language model
- natural language processing