A French corpus annotated for multiword expressions and named entities.
Marie CanditoMathieu ConstantCarlos RamischAgata SavaryBruno GuillaumeYannick ParmentierSilvio CordeiroPublished in: J. Lang. Model. (2020)
Keyphrases
- multiword
- named entities
- annotated corpus
- genia corpus
- named entity recognition
- relation extraction
- context sensitive
- lexical units
- information extraction
- co occurrence
- natural language processing
- text mining
- question answering
- text documents
- language model
- text clustering
- natural language
- automatic annotation
- part of speech
- document representation
- wordnet
- unsupervised learning
- bayesian networks
- feature selection
- semantic knowledge
- domain specific
- named entity disambiguation