Spoken Tunisian Arabic Corpus "STAC": Transcription and Annotation.
Inès ZribiMariem EllouzeLamia Hadrich BelguithPhilippe BlachePublished in: Res. Comput. Sci. (2015)
Keyphrases
- annotated corpus
- handwriting recognition
- speech recognition
- spontaneous speech
- automatic transcription
- hand crafted
- conversational speech
- automatic annotation
- semantic annotation
- unknown words
- spoken language
- automatic speech recognition
- test set
- inter annotator agreement
- arabic language
- metadata
- manual annotation
- manually annotated
- relation extraction
- image retrieval
- morphological analysis
- training corpus
- language identification
- automatic image annotation
- named entity recognition
- handwritten text recognition
- visual features
- arabic handwriting recognition
- human machine interaction
- language understanding
- broadcast news
- image annotation