Construction conjointe d'un corpus et d'un classifieur pour les registres de langue en français (Joint building of a corpus and a classifier for language registers in French).
Gwénolé LecorvéHugo AyatsBenoît FournierJade MekkiJonathan CheveluDelphine BattistelliNicolas BéchetPublished in: CORIA-TALN-RJC (TALN) (2018)
Keyphrases
- natural language
- open domain
- manually annotated
- spanish language
- decision trees
- co occurrence
- learning algorithm
- training samples
- language learning
- text corpora
- text classification
- spoken dialog
- lexical features
- multiword
- natural language processing
- training set
- support vector
- training data
- feature extraction
- feature selection