Using PiTagger for Lemmatization and PoS Tagging of a Spontaneous Speech Corpus: C-Oral-Rom Italian.
Alessandro PanunziEugenio PicchiMassimo MonegliaPublished in: LREC (2004)
Keyphrases
- pos tagging
- speech corpus
- part of speech
- automatic speech recognition
- speech synthesis
- dependency parsing
- word segmentation
- domain adaptation
- machine translation
- speech recognition
- word sense disambiguation
- spoken document retrieval
- hidden markov models
- feature extraction
- bayesian networks
- text categorization
- semantic role labeling
- document analysis
- named entity recognition
- text documents
- n gram