Building a conversation corpus by text derivation from "germ dialogs".
Naoki AsanomaSetsuo YamadaOsamu FuruseMasahiro OkuPublished in: EAMT (2005)
Keyphrases
- conversational agent
- broad coverage
- supervised machine learning
- open domain
- dialog systems
- text corpora
- text data
- text collections
- natural language text
- document corpus
- text mining
- text documents
- newspaper articles
- text corpus
- natural language generation
- sentence level
- plain text
- information extraction systems
- free text
- natural language
- spontaneous speech
- topic segmentation
- lexical features
- conversational speech
- keywords
- information retrieval
- word pairs
- noun phrases
- training corpus
- english words
- textual features
- temporal expressions
- syntactic features
- recognizing textual entailment
- linguistic patterns
- text summarization
- named entity disambiguation
- linguistic information
- information extraction
- named entity recognition
- linguistic features
- scientific papers
- annotated corpus
- conversational agents
- document level