An open diachronic corpus of historical Spanish: annotation criteria and automatic modernisation of spelling.
Felipe Sánchez-MartínezIsabel Martínez-SempereXavier Ivars-RibesRafael C. CarrascoPublished in: CoRR (2013)
Keyphrases
- manual annotation
- hand crafted
- manually annotated
- automatic annotation
- spanish language
- automatic indexing
- fully automatic
- annotated corpus
- labor intensive
- semi automatic
- active learning
- semantic annotation
- evaluation criteria
- selection criteria
- semi automatically
- open domain
- tutoring system
- test set
- social networks
- neural network
- historical data
- text classification
- metadata
- information retrieval
- machine learning