Script-description Pair Extraction from Text Documents of English as Second Language Podcast.
Hyungjong NohMinwoo JeongSungjin LeeJonghoon LeeGary Geunbae LeePublished in: CSEDU (1) (2010)
Keyphrases
- text documents
- information extraction
- language learning
- text mining
- computer assisted language learning
- text classification
- foreign language
- machine translation
- text categorization
- extraction patterns
- news articles
- keywords
- topic models
- tf idf
- document clustering
- wordnet
- text collections
- natural language processing
- bag of words
- natural language
- language skills
- english as a foreign language
- information retrieval
- automatic extraction
- named entities
- cross lingual
- machine learning
- pairwise
- automatic text categorization
- em algorithm
- knowledge discovery
- text corpora
- comparable corpora
- training set
- information extraction systems
- expert systems
- object recognition